The Outpost is a comprehensive collection of curated artificial intelligence software tools that cater to the needs of small business owners, bloggers, artists, musicians, entrepreneurs, marketers, writers, and researchers.
© 2025 TheOutpost.AI All rights reserved
Curated by THEOUTPOST
On Tue, 7 Jan, 8:08 AM UTC
4 Sources
[1]
Nvidia's New Llama Nemotron LLMs Can Build and Deploy AI Agents
They can also be used for fraud detection and product supply chain Nvidia announced the Llama Nemotron family of open large language models (LLMs) on Monday. The company said that with the rise of artificial intelligence (AI) agents, new and more sophisticated AI models were required to handle the workflow of agentic AI. Highlighting the need for more power and higher efficiency, the tech giant stated that the Nemotron family models can create and deploy AI agents across various applications. The company claimed that the AI models will be available for enterprises via the Nvidia NIM microservice. In a blog post, the tech giant announced its new series of open-source LLMs dubbed Nemotron. The series also contains Cosmos Nemotron vision language models (VLMs), and these can be used to build AI agents that analyse and respond to images and videos. Nvidia said the vision-focused agents can be deployed in autonomous machines, hospitals, stores and warehouses, as well as sports events, movies, and news. Built with Meta's Llama foundation models, the Nvidia Llama Nemotron models are said to be optimised to build and develop AI agents. While the company did not reveal the architecture and technical details, it claimed that these models are trained using "latest techniques and high-quality datasets". The models can be used to train agentic capabilities such as instruction following, chat, function calling, coding and mathematics, and more. Nemotron is also said to optimise the AI agents' size to make it easy to deploy. Nvidia stated that SAP, ServiceNow, and other AI agent platform providers will be among the first to use the new Llama Nemotron models. The Nemotron and Cosmos Nemotron models will be available in three parameter sizes -- Nano, Super, and Ultra. Nano is the most cost-effective model built with low latency as the primary focus. Super is a high-accuracy model that can be run on a single GPU. Finally, Ultra is the highest-accuracy model designed for data centre-scale applications. Nvidia highlighted that enterprises can access the Nemotron model family as downloadable models and as NIM. These models will also be available as application programming interfaces (APIs). While the models are open-source, they are only available for academic and research usage.
[2]
NVIDIA Unveils New Llama Nemotron Models to Build AI Agents
The Nemotron families will be offered in Nano, Super, and Ultra sizes to suit deployment needs, from low-latency real-time applications to high-accuracy data center use cases. At CES 2025, NVIDIA CEO Jensen Huang launched new Nemotron models, including the Llama Nemotron large language models (LLMs) and Cosmos Nemotron vision language models (VLMs), to improve agentic AI and boost enterprise productivity. The Llama Nemotron models, built on Llama foundation models, allow developers to create AI agents for applications like customer support, fraud detection, and supply chain optimisation. "Llama 3.1 is a complete phenomenon, with the downloads reaching 650,000 times. It has been derived and turned into other models, about 60,000 different models. It is singularly the reason why every single enterprise and every single industry has been activated to start working on AI," said Huang. "We realized that the Llama models could really be better fine-tuned for enterprise use, so we fine-tuned them using our expertise and capabilities and turned them into the Llama Nemotron suite of open models," he added. The Nemotron families will be offered in Nano, Super, and Ultra sizes to suit deployment needs, from low-latency real-time applications to high-accuracy data center use cases. Optimised for computing efficiency and accuracy, these models support agentic AI tasks like instructions for following, coding, and math. "Agentic AI is the next frontier of AI development, and delivering on this opportunity requires full-stack optimization across a system of LLMs to deliver efficient, accurate AI agents," said Ahmad Al-Dahle, vice president and head of GenAI at Meta. NVIDIA announced that the models will be available as downloadable resources or as microservices for deployment across various computing platforms, including data centers and edge devices. Llama Nemotron and Cosmos Nemotron models will be available soon on build.nvidia.com, Hugging Face, and through the NVIDIA Developer Program. Enterprise-grade deployments will be supported via the NVIDIA AI Enterprise platform on accelerated cloud and data center infrastructure. NVIDIA's Cosmos Nemotron models extend AI capabilities to vision and video tasks, allowing agents to analyse and respond to images and videos. These tools aim to support industries like autonomous systems, healthcare, retail, and media. NVIDIA also unveiled Cosmos world foundation models for physics-aware video generation in robotics and autonomous vehicle applications. NVIDIA NeMo microservices allow enterprises to customise these models for specific domains and workflows. Leading AI platform providers, such as SAP and ServiceNow, have backed the Nemotron models. SAP plans to incorporate them into its Joule platform to improve enterprise user productivity, while ServiceNow seeks to utilise the models for AI agent services across various industries. The models are built using NVIDIA's NeMo platform for distillation, pruning, and alignment, ensuring high accuracy and throughput across various hardware configurations. NVIDIA NeMo Retriever allows integration with enterprise data, boosting model functionality through retrieval-augmented generation capabilities.
[3]
NVIDIA Announces Nemotron Model Families to Advance Agentic AI
Artificial intelligence is entering a new era -- agentic AI -- where teams of specialized agents can help people solve complex problems and automate repetitive tasks. With custom AI agents, enterprises across industries can manufacture intelligence and achieve unprecedented productivity. These advanced AI agents require a system of multiple generative AI models optimized for agentic AI functions and capabilities. This complexity means that the need for powerful, efficient, enterprise-grade models has never been greater. To provide a foundation for enterprise agentic AI, NVIDIA today announced the Llama Nemotron family of open large language models (LLMs). Built with Llama, the models can help developers create and deploy AI agents across a range of applications -- including customer support, fraud detection, and product supply chain and inventory management optimization. To be effective, many AI agents need both language skills and the ability to perceive the world and respond with the appropriate action. With new NVIDIA Cosmos Nemotron vision language models (VLMs) and NVIDIA NIM microservices for video search and summarization, developers can build agents that analyze and respond to images and video from autonomous machines, hospitals, stores and warehouses, as well as sports events, movies and news. For developers seeking to generate physics-aware videos for robotics and autonomous vehicles, NVIDIA today separately announced NVIDIA Cosmos world foundation models. Built with Llama foundation models -- one of the most popular commercially viable open-source model collections, downloaded over 650 million times -- NVIDIA Llama Nemotron models provide optimized building blocks for AI agent development. This builds on NVIDIA's commitment to developing state-of-the-art models, such as Llama 3.1 Nemotron 70B, now available through the NVIDIA API catalog. Llama Nemotron models are pruned and trained with NVIDIA's latest techniques and high-quality datasets for enhanced agentic capabilities. They excel at instruction following, chat, function calling, coding and math, while being size-optimized to run on a broad range of NVIDIA accelerated computing resources. "Agentic AI is the next frontier of AI development, and delivering on this opportunity requires full-stack optimization across a system of LLMs to deliver efficient, accurate AI agents," said Ahmad Al-Dahle, vice president and head of GenAI at Meta. "Through our collaboration with NVIDIA and our shared commitment to open models, the NVIDIA Llama Nemotron family built on Llama can help enterprises quickly create their own custom AI agents." Leading AI agent platform providers including SAP and ServiceNow are expected to be among the first to use the new Llama Nemotron models. "AI agents that collaborate to solve complex tasks across multiple lines of the business will unlock a whole new level of enterprise productivity beyond today's generative AI scenarios," said Philipp Herzig, chief AI officer at SAP. "Through SAP's Joule, hundreds of millions of enterprise users will interact with these agents to accomplish their goals faster than ever before. NVIDIA's new open Llama Nemotron model family will foster the development of multiple specialized AI agents to transform business processes." "AI agents make it possible for organizations to achieve more with less effort, setting new standards for business transformation," said Jeremy Barnes, vice president of platform AI at ServiceNow. "The improved performance and accuracy of NVIDIA's open Llama Nemotron models can help build advanced AI agent services that solve complex problems across functions, in any industry." The NVIDIA Llama Nemotron models use NVIDIA NeMo for distilling, pruning and alignment. Using these techniques, the models are small enough to run on a variety of computing platforms while providing high accuracy as well as increased model throughput. The Llama Nemotron model family will be available as downloadable models and as NVIDIA NIM microservices that can be easily deployed on clouds, data centers, PCs and workstations. They offer enterprises industry-leading performance with reliable, secure and seamless integration into their agentic AI application workflows. The Llama Nemotron and Cosmos Nemotron model families are coming in Nano, Super and Ultra sizes to provide options for deploying AI agents at every scale. Enterprises can also customize the models for their specific use cases and domains with NVIDIA NeMo microservices to simplify data curation, accelerate model customization and evaluation, and apply guardrails to keep responses on track. With NVIDIA NeMo Retriever, developers can also integrate retrieval-augmented generation capabilities to connect models to their enterprise data. And using NVIDIA Blueprints for agentic AI, enterprises can quickly create their own applications using NVIDIA's advanced AI tools and end-to-end development expertise. In fact, NVIDIA Cosmos Nemotron, NVIDIA Llama Nemotron and NeMo Retriever supercharge the new NVIDIA Blueprint for video search and summarization, announced separately today. NeMo, NeMo Retriever and NVIDIA Blueprints are all available with the NVIDIA AI Enterprise software platform. Llama Nemotron and Cosmos Nemotron models will be available soon as hosted application programming interfaces and for download on build.nvidia.com and Hugging Face. Access for development, testing and research is free for members of the NVIDIA Developer Program. Enterprises can run Llama Nemotron and Cosmos Nemotron NIM microservices in production with the NVIDIA AI Enterprise software platform on accelerated data center and cloud infrastructure.
[4]
Nvidia's Nemotron Model Families will advance AI agents
Available as Nvidia NIM microservices, open Llama Nemotron large language models and Cosmos Nemotron vision language models can supercharge AI agents on any accelerated system. Artificial intelligence is entering a new era -- agentic AI -- where teams of specialized agents can help people solve complex problems and automate repetitive tasks. Nvidia made the announcement as part of Nvidia CEO Jensen Huang's opening keynote today at CES 2025. With custom AI agents, enterprises across industries can manufacture intelligence and achieve unprecedented productivity. These advanced AI agents require a system of multiple generative AI models optimized for agentic AI functions and capabilities. This complexity means that the need for powerful, efficient, enterprise-grade models has never been greater. "AI agents is the next robotic industry and likely to be a multibillion-dollar opportunity," Huang said. To provide a foundation for enterprise agentic AI, Nvidia today announced the Llama Nemotron family of open large language models (LLMs). Built with Llama, the models can help developers create and deploy AI agents across a range of applications -- - including customer support, fraud detection, and product supply chain and inventory management optimization. To be effective, many AI agents need both language skills and the ability to perceive the world and respond with the appropriate action. With new Nvidia Cosmos Nemotron vision language models (VLMs) and Nvidia NIM microservices for video search and summarization, developers can build agents that analyze and respond to images and video from autonomous machines, hospitals, stores and warehouses, as well as sports events, movies and news. For developers seeking to generate physics-aware videos for robotics and autonomous vehicles, Nvidia today separately announced Nvidia Cosmos world foundation models. Open Llama Nemotron Models Optimize Compute Efficiency, Accuracy for AI Agents Built with Llama foundation models -- one of the most popular commercially viable open source model collections, downloaded over 650 million times -- Nvidia Llama Nemotron models provide optimized building blocks for AI agent development. Llama Nemotron models are pruned and trained with Nvidia's latest techniques and high-quality datasets for enhanced agentic capabilities. They excel at instruction following, chat, function calling, coding and math, while being size-optimized to run on a broad range of Nvidia accelerated computing resources. "Agentic AI is the next frontier of AI development, and delivering on this opportunity requires full-stack optimization across a system of LLMs to deliver efficient, accurate AI agents," said Ahmad Al-Dahel, vice president and head of GenAI at Meta, in a statement. "Through our collaboration with Nvidia and our shared commitment to open models, the Nvidia Llama Nemotron family built on Llama can help enterprises quickly create their own custom AI agents." Leading AI agent platform providers including SAP and ServiceNow are expected to be among the first to use the new Llama Nemotron models. "AI agents that collaborate to solve complex tasks across multiple lines of the business will unlock a whole new level of enterprise productivity beyond today's generative AI scenarios," said Philipp Herzig, chief AI officer at SAP, in a statement. "Through SAP's Joule, hundreds of millions enterprise users will interact with these agents to accomplish their goals faster than ever before. Nvidia's new open Llama Nemotron model family will foster the development of multiple specialized AI agents to transform business processes." "AI agents make it possible for organizations to achieve more with less effort, setting new standards for business transformation," said Jeremy Barnes, vice president of platform AI at ServiceNow, in a statement. "The improved performance and accuracy of Nvidia's open Llama Nemotron models can help build advanced AI agent services that solve complex problems across functions, in any industry." The Nvidia Llama Nemotron models use Nvidia NeMo for distilling, pruning and alignment. Using these techniques, the models are small enough to run on a variety of computing platforms while providing high accuracy as well as increased model throughput. The Llama Nemotron model family will be available as downloadable models and as Nvidia NIM microservices that can be easily deployed on clouds, data centers, PCs and workstations. They offer enterprises industry-leading performance with reliable, secure and seamless integration into their agentic AI application workflows. Customize and Connect to Business Knowledge With Nvidia NeMo The Llama Nemotron and Cosmos Nemotron model families are coming in Nano, Super and Ultra sizes to provide options for deploying AI agents at every scale. ● Nano: The most cost-effective model optimized for real-time applications with low latency, ideal for deployment on PCs and edge devices. ● Super: A high-accuracy model offering exceptional throughput on a single GPU. ● Ultra: The highest-accuracy model, designed for data-center-scale applications demanding the highest performance. Enterprises can also customize the models for their specific use cases and domains with Nvidia NeMo microservices to simplify data curation, accelerate model customization and evaluation, and apply guardrails to keep responses on track. With Nvidia NeMo Retriever, developers can also integrate retrieval-augmented generation (RAG) capabilities to connect models to their enterprise data. And using Nvidia Blueprints for agentic AI, enterprises can quickly create their own applications using Nvidia's advanced AI tools and end-to-end development expertise. In fact, Nvidia Cosmos Nemotron, Nvidia Llama Nemotron and NeMo Retriever supercharge the new Nvidia Blueprint for video search and summarization, announced separately today. NeMo, NeMo Retriever and Nvidia Blueprints are all available with the Nvidia AI Enterprise software platform. Availability Llama Nemotron and Cosmos Nemotron models will be available as hosted APIs and for download on build.nvidia.com and on Hugging Face. Access for development, testing and research is free for members of the Nvidia Developer Program. Enterprises can run Llama Nemotron and Cosmos Nemotron NIM microservices in production with the Nvidia AI Enterprise software platform on accelerated data center and cloud infrastructure.
Share
Share
Copy Link
NVIDIA announces new Llama Nemotron and Cosmos Nemotron model families designed to enhance AI agent capabilities and boost enterprise productivity across various applications.
NVIDIA has unveiled its latest innovation in artificial intelligence: the Nemotron model families, including Llama Nemotron large language models (LLMs) and Cosmos Nemotron vision language models (VLMs). Announced at CES 2025, these new models are designed to propel the development of agentic AI, ushering in a new era of enterprise productivity and problem-solving capabilities 12.
Built on Meta's Llama foundation models, the Llama Nemotron family is optimized for creating and deploying AI agents across various applications. These open-source LLMs excel in tasks such as instruction following, chat functionality, function calling, coding, and mathematics 13. NVIDIA claims that the models are trained using "latest techniques and high-quality datasets," although specific architectural details were not disclosed 1.
Complementing the Llama Nemotron models, NVIDIA introduced Cosmos Nemotron VLMs. These models enable AI agents to analyze and respond to images and videos, making them suitable for deployment in autonomous machines, healthcare settings, retail environments, and media applications 13.
Both Nemotron families will be available in three parameter sizes:
NVIDIA is offering the Nemotron models as downloadable resources and as NVIDIA NIM microservices, allowing for easy deployment across various computing platforms 2. Enterprises can customize these models for specific use cases using NVIDIA NeMo microservices, which simplify data curation, accelerate model customization, and apply guardrails 3.
Leading AI platform providers, including SAP and ServiceNow, have expressed support for the Nemotron models. SAP plans to incorporate them into its Joule platform to enhance enterprise user productivity, while ServiceNow aims to utilize the models for AI agent services across various industries 24.
The Llama Nemotron and Cosmos Nemotron models will soon be available through build.nvidia.com, Hugging Face, and the NVIDIA Developer Program. While the models are open-source, they are currently limited to academic and research usage 12. Enterprise-grade deployments will be supported via the NVIDIA AI Enterprise platform on accelerated cloud and data center infrastructure 2.
As AI continues to evolve, NVIDIA's Nemotron model families represent a significant step towards more sophisticated and capable AI agents, potentially revolutionizing how enterprises approach complex problem-solving and task automation across various industries.
Reference
[1]
[2]
[3]
[4]
Nvidia introduces Llama Nemotron, a family of open-source AI models with enhanced reasoning capabilities, designed to provide a foundation for building advanced AI agents. The models offer improved accuracy and inference speed, targeting various deployment scenarios from edge devices to multi-GPU servers.
5 Sources
5 Sources
NVIDIA quietly released a new open-source AI model, Llama-3.1-Nemotron-70B-Instruct, which has reportedly outperformed leading models from OpenAI and Anthropic in benchmark tests, signaling a shift in NVIDIA's AI strategy.
6 Sources
6 Sources
NVIDIA introduces new AI models and blueprints for building agentic AI applications, partnering with leading tech companies to simplify the development and deployment of AI agents for enterprises.
7 Sources
7 Sources
Nvidia releases new NIM microservices as part of NeMo Guardrails to improve security, control, and performance of AI agents, addressing critical concerns in enterprise AI adoption.
5 Sources
5 Sources
NVIDIA introduces AI Agent Blueprints, a new tool designed to simplify the creation of AI-powered enterprise applications. This release aims to democratize AI development and enable businesses to build custom AI experiences efficiently.
3 Sources
3 Sources