4 Sources
[1]
Nvidia unveils new Cosmos world models, infra for robotics and physical uses | TechCrunch
Nvidia on Monday unveiled a set of new world AI models, libraries and other infrastructure for robotics developers, most notable of which is Cosmos Reason, a 7-billion-parameter "reasoning" vision language model for physical AI applications and robots. Also joining the existing batch of Cosmos world models are Cosmos Transfer-2, which can accelerate synthetic data generation from 3D simulation scenes or spatial control inputs, and a distilled version of Cosmos Transfers that is more optimized for speed. During its announcement at the SIGGRAPH conference on Monday, Nvidia noted that these models are meant to be used to create synthetic text, image and video data sets for training robots and AI agents. Cosmos Reason, per Nvidia, allows robots and AI agents to "reason" thanks to its memory and physics understanding, which lets it "serve as a planning model to reason what steps an embodied agent might take next." The company says it can be used for data curation, robot planning, and video analytics. The company also unveiled new neural reconstruction libraries, which includes one for a rendering technique that lets developers simulate the real world in 3D using sensor data. This rendering capability is also being integrated into open source simulator CARLA, a popular developer platform. There's even an update to the Omniverse software development kit. There's new servers for robotics workflows, too. The Nvidia RTX Pro Blackwell Servers offers a single architecture for robotic development workloads, while Nvidia DGX Cloud is a cloud-based management platform. These announcements come as the semiconductor giant is pushing further into robotics as it looks toward the next big use case for its AI GPUs beyond AI data centers.
[2]
CrowdStrike, Uber, Zoom Among Industry Pioneers Building Smarter Agents With NVIDIA Nemotron and Cosmos Reasoning Models for Enterprise and Physical AI Applications
Open reasoning models provide faster and extended thinking to generate smarter outcomes for AI agents across customer service, cybersecurity, manufacturing, logistics and robotics. AI agents are poised to deliver as much as $450 billion from revenue gains and cost savings by 2028, according to Capgemini. Developers building these agents are turning to higher-performing reasoning models to improve AI agent platforms and physical AI systems. At SIGGRAPH, NVIDIA today announced an expansion of two model families with reasoning capabilities -- NVIDIA Nemotron and NVIDIA Cosmos -- that leaders across industries are using to drive productivity via teams of AI agents and humanoid robots. CrowdStrike, Uber, Magna, NetApp and Zoom are among some of the enterprises tapping into these model families. New NVIDIA Nemotron Nano 2 and Llama Nemotron Super 1.5 models offer the highest accuracy in their size categories for scientific reasoning, math, coding, tool-calling, instruction-following and chat. These new models give AI agents the power to think more deeply and work more efficiently -- exploring broader options, speeding up research and delivering smarter results within set time limits. Think of the model as the brain of an AI agent -- it provides the core intelligence. But to make that brain useful for a business, it must be embedded into an agent that understands specific workflows, in addition to industry and business jargon, and operates safely. NVIDIA helps enterprises bridge that gap with leading libraries and AI blueprints for onboarding, customizing and governing AI agents at scale. Cosmos Reason is a new reasoning vision language model (VLM) for physical AI applications that excels in understanding how the real world works, using structured reasoning to understand concepts like physics, object permanence and space-time alignment. Cosmos Reason is purpose-built to serve as the reasoning backbone to a robot vision language action (VLA) model, or critique and caption training data for robotics and autonomous vehicles, and equip runtime visual AI agents with spatial-temporal understanding and reasoning of physical operations, like in factories or cities. Nemotron: Highest Accuracy and Efficiency for Agentic Enterprise AI As enterprises develop AI agents to tackle complex, multistep tasks, models that can provide strong reasoning accuracy with efficient token generation enable intelligent, autonomous decision-making at scale. NVIDIA Nemotron is a family of advanced open reasoning models that use leading models, NVIDIA-curated open datasets and advanced AI techniques to provide an accurate and efficient starting point for AI agents. The latest Nemotron models deliver leading efficiency in three ways: a new hybrid model architecture, compact quantized models and a configurable thinking budget that provides developers with control over token generation, resulting in 60% lower reasoning costs. This combination lets the models reason more deeply and respond faster, without needing more time or computing power. This means better results at a lower cost. Nemotron Nano 2 provides as much as 6x higher token generation compared with other leading models of its size. Llama Nemotron Super 1.5 achieves leading performance and the highest reasoning accuracy in its class, empowering AI agents to reason better, make smarter decisions and handle complex tasks independently. It's now available in NVFP4, or 4-bit floating point, which delivers as much as 6x higher throughput on NVIDIA B200 GPUs compared with NVIDIA H100 GPUs. The chart above shows the Nemotron model delivers top reasoning accuracy in the same timeframe and on the same compute budget, delivering the highest accuracy per dollar. Along with the two new Nemotron models, NVIDIA is also announcing its first open VLM training dataset -- Llama Nemotron VLM dataset v1 -- with 3 million samples of optical character recognition, visual QA and captioning data that power the previously released Llama 3.1 Nemotron Nano VL 8B model. In addition to the accuracy of the reasoning models, agents also rely on retrieval-augmented generation to fetch the latest and most relevant information from connected data across disparate sources to make informed decisions. The recently released Llama 3.2 NeMo Retriever embedding model tops three visual document retrieval leaderboards -- ViDoRe V1, ViDoRe V2 and MTEB VisualDocumentRetrieval -- for boosting agentic system accuracy. Using these reasoning and information retrieval models, a deep research agent built using the AI-Q NVIDIA Blueprint is currently No. 1 for open and portable agents on DeepResearch Bench. NVIDIA NeMo and NVIDIA NIM microservices support the entire AI agent lifecycle -- from development and deployment to monitoring and optimization of the agentic systems. Cosmos Reason: A Breakthrough in Physical AI VLMs marked a breakthrough for computer vision and robotics, empowering machines to identify objects and patterns. However, nonreasoning VLMs lack the ability to understand and interact with the real world -- meaning they can't handle ambiguity or novel experiences, nor solve complex multistep tasks. NVIDIA Cosmos Reason is a new open, customizable, 7-billion-parameter reasoning VLM for physical AI and robotics. Cosmos Reason lets robots and vision AI agents reason like humans, using prior knowledge, physics understanding and common sense to understand and act in the physical world. Cosmos Reason enables advanced capabilities across robotics and physical AI applications such as training data critiquing and captioning, robot decision-making and video analytics AI agents. It can help automate the curation and annotation of large, diverse training datasets, accelerating the development of high-accuracy AI models. It can also serve as a sophisticated reasoning engine for robot planning, parsing complex instructions into actionable steps for VLA models, even in new environments. It also powers video analytics AI agents built on the NVIDIA Blueprint for video search and summarization (VSS), enabled by the NVIDIA Metropolis platform, gleaning valuable insights from massive volumes of stored or live video data. These visually perceptive and interactive AI agents can help streamline operations in factories, warehouses, retail stores, airports, traffic intersections and more by spotting anomalies. NVIDIA's robotics research team uses Cosmos Reason for data filtration and curation, and as the "System 2" reasoning VLM behind VLA models such as the next versions of NVIDIA Isaac GR00T NX. Now Serving: NVIDIA Reasoning Models for AI Agents and Robots Everywhere Diverse enterprises and consulting leaders are adopting NVIDIA's latest reasoning models. Leaders spanning cybersecurity to telecommunications are among those working with Nemotron to build enterprise AI agents. Zoom plans to harness Nemotron reasoning models with Zoom AI Companion to make decisions and manage multistep tasks to take action for users across Zoom Meetings, Zoom Chat and Zoom documents. CrowdStrike is testing Nemotron models to enable its Charlotte AI agents to write queries on the CrowdStrike Falcon platform. Amdocs is using NVIDIA Nemotron models in its amAIz Suite to drive AI agents to handle complex, multistep automation spanning care, sales, network and customer support. EY is adopting Nemotron Nano 2, given its high throughput, to support agentic AI in large organizations for tax, risk management and finance use cases. NetApp is currently testing Nemotron reasoning models so that AI agents can search and analyze business data DataRobot is working with Nemotron models for its Agent Workforce Platform for end-to-end agent lifecycle management. Tabnine is working with Nemotron models for suggesting and automating coding tasks on behalf of developers. Automation Anywhere, CrewAI and Dataiku are among the additional agentic AI software developers integrating Nemotron models into their platforms. Leading companies across transportation, safety and AI intelligence are using Cosmos Reason to advance autonomous driving, video analytics, and road and workplace safety. Uber is exploring Cosmos Reason to analyze autonomous vehicle behavior. In addition, Uber is post-training Cosmos Reason to summarize visual data and analyze scenarios like pedestrians walking across highways to perform quality analysis and inform autonomous driving behavior. Cosmos Reason can also serve as the brain of autonomous vehicles. It lets robots interpret environments and, given complex commands, break them down into tasks and execute them using common sense, even in unfamiliar environments. Centific is testing Cosmos Reason to enhance its AI-powered video intelligence platform. The VLM enables the platform to process complex video data into actionable insights, helping reduce false positives and improve decision-making efficiency. VAST is advancing real-time urban intelligence using NVIDIA Cosmos Reason with its AI operating system to process massive video streams at scale. With the VSS Blueprint, VAST can build agents that can identify incidents and trigger responses, turning video streams and metadata into actionable, proactive public safety tools. Ambient.ai is working with Cosmos Reason's temporal, physics-aware reasoning, to enable automated detection of missing personal protection equipment and monitoring of hazardous conditions, helping enhance environmental health and safety across construction, manufacturing, logistics and other industrial settings. Magna is developing with Cosmos Reason as part of its City Delivery Platform -- a fully autonomous, low-cost solution for instant delivery -- to help vehicles adapt more quickly to new cities. The model adds world understanding to the vehicles' long-term trajectory planning. These models are expected to be available as NVIDIA NIM microservices for secure, reliable deployment on any NVIDIA-accelerated infrastructure for maximum privacy and control. They are planned to be available soon through Amazon Bedrock and Amazon SageMaker AI for Nemotron models, as well as through Azure AI Foundry, Oracle Data Science Platform and Google Vertex AI. Try Cosmos Reason on build.nvidia.com or download it from Hugging Face or GitHub. Nemotron Nano 2 and Llama Nemotron Super 1.5 (NVFP4) will be available soon for download. Meanwhile, learn more about Nemotron models and download previous versions. Download the Llama Nemotron VLM Dataset v1 from Hugging Face. Watch the NVIDIA Research special address at SIGGRAPH and learn more about how graphics and simulation innovations come together to drive industrial digitalization by joining NVIDIA at the conference, running through Thursday, Aug. 14. See notice regarding software product information.
[3]
Nvidia debuts next-gen agentic AI and reasoning robotic models at SIGGRAPH 2025 - SiliconANGLE
Nvidia debuts next-gen agentic AI and reasoning robotic models at SIGGRAPH 2025 Nvidia Corp. announced today that it's expanding its offerings of smarter AI models, physical intelligence for robotics and powerful enterprise AI servers. Leading the news at the ACM SIGGRAPH 2025 computer graphics conference in Vancouver, Nvidia unveiled that the Nvidia RTX Pro 6000 Blackwell Server Edition GPU, a graphics processing unit designed for servers, is now coming to enterprise servers. This new addition will allow organizations to run large language models at high speed and these 2U form-factor rack-mountable servers will use the Blackwell architecture to deliver high-performance AI inference workloads. "AI is reinventing computing for the first time in 60 years -- what started in the cloud is now transforming the architecture of on-premises data centers," said Jensen Huang, founder and chief executive of Nvidia. The new Blackwell RTX Pro Servers bring GPU acceleration to traditional CPU-based workloads -- including data analytics, simulation, video processing and graphics rendering -- enabling up to 45 times better performance. According to Nvidia, this results in 18 times higher energy efficiency and significantly lower cost compared with CPU-only systems. Nvidia is partnering with Cisco Systems Inc., Dell Technologies Inc., Hewlett Packard Enterprise Co., Lenovo Group Ltd. and Super Micro Computer, Inc. to offer the new servers in a variety of configurations. "With the world's leading server providers, we're making Nvidia Blackwell RTX Pro Servers the standard platform for enterprise and industrial AI," added Huang. Artificial intelligence agents are forming the foundation of a growing market, as more organizations adopt their autonomous capabilities. These agents can reason through complex tasks and plan across longer time horizons. Nvidia announced an expansion of its Nemotron model family, introducing two new models with advanced reasoning capabilities for building smarter AI agents: Nemotron Nano 2 and Llama Nemotron Super 1.5. The company said these models deliver high accuracy for their size categories in areas such as scientific reasoning, coding, tool use, instruction following and chat. Designed to empower agents with deeper cognitive abilities, the models help AI systems explore options, weigh decisions and deliver results within defined constraints. Nemotron Nano 2 achieves up to six times higher token generation throughput compared to other models in its class. Llama Nemotron Super 1.5 offers top-tier performance and leads in reasoning accuracy, making it suitable for handling complex enterprise tasks. Nvidia is working with enterprise partners and consulting leaders to deploy these reasoning models. These include Zoom Video Communications Inc., which plans to integrate Nemotron into its AI Companion, as well as CrowdStrike Holdings Inc., which is testing the models in its Charlotte AI agents for writing cybersecurity queries, and Ernst & Young Global Ltd., adopting Nemotron Nano 2 to empower agents analyzing business data. Nvidia is empowering robotics and machines to "see" and reason about the world with new AI models that combine both the ability to ingest visual information and additionally think about that information. Vision language models, or VLMs, provided computer vision for robotics, allowing them to understand and interact with the world, but they lacked the ability to think about their actions. Today, Nvidia announced Cosmos Reason, a new open, customizable 7 billion-parameter reasoning VLM for physical AI vision agents and robotics. It allows robots and vision agents to think about what they see similar to humans and plan about what's in a scene using intelligence such as physics knowledge and common sense from training data. The company said it can help automate the curation and annotation of large, diverse training datasets, accelerating the development of high-accuracy AI models. It added that it can also serve as a sophisticated reasoning engine for robot planning, parsing complex instructions into steps for VLA models, even in new environments. These new advances form the foundation for smart cities, facilities and industrial processes. Nvidia said it's working with companies that include Accenture plc, Belden Inc., DeepHow Inc., Milestone Systems A/C and Telit Cinterion Ltd. to increase productivity and safety at their locations using physical AI-based perception and reasoning. Infrastructure that can perceive, reason and react that relies on sensors and vision AI using the Nvidia Metropolis platform, which makes it possible to develop and deploy video analytics AI agents and services for campuses and facilities.
[4]
NVIDIA Opens Portals to World of Robotics With New Omniverse Libraries, Cosmos Physical AI Models and AI Computing Infrastructure
Powered by new RTX PRO™ Servers and DGX™ Cloud, the libraries and models let developers anywhere develop physically accurate digital twins, capture and reconstruct the real world in simulation, generate synthetic data for training physical AI models and build AI agents that understand the physical world. "Computer graphics and AI are converging to fundamentally transform robotics," said Rev Lebaredian, vice president of Omniverse and simulation technologies at . "By combining AI reasoning with scalable, physically accurate simulation, we're enabling developers to build tomorrow's robots and autonomous vehicles that will transform trillions of dollars in industries." New Omniverse Libraries Advance Applications for World Composition New Omniverse software development kits (SDKs) and libraries are now available for building and deploying industrial AI and robotics simulation applications. Omniverse NuRec rendering is now integrated in CARLA, a leading open-source simulator used by over 150,000 developers. Autonomous vehicle (AV) toolchain leader Foretellix is integrating NuRec, Omniverse Sensor RTX™ and Cosmos Transfer to enhance its scalable synthetic data generation with physically accurate scenarios. Voxel51's data engine for visual and multimodal AI, FiftyOne, supports NuRec to ease data preparation for reconstructions. FiftyOne is used by customers such as and Porsche. Boston Dynamics, Figure AI, Hexagon, , Lightwheel and Skild AI are adopting Omniverse libraries, and to accelerate their AI robotics development, while Amazon Devices & Services is using them to power a new manufacturing solution. Cosmos Advances World Generation for Robotics Cosmos WFMs, downloaded over 2 million times, let developers generate diverse data for training robots at scale using text, image and video prompts. New models announced at SIGGRAPH deliver major advances in synthetic data generation speed, accuracy, language support and control: Lightwheel, Moon Surgical and Skild AI are using Cosmos Transfer to accelerate physical AI training by simulating diverse conditions at scale. Cosmos Reason Breaks Through World Understanding Since the introduction of OpenAI's CLIP model, vision language models (VLMs) have transformed computer-vision tasks like object and pattern recognition. However, they have not yet been able to solve multistep tasks nor handle ambiguity or novel experiences. Cosmos Reason -- a new open, customizable, 7-billion-parameter reasoning VLM for physical AI and robotics -- lets robots and vision AI agents reason like humans, using prior knowledge, physics understanding and common sense to understand and act in the real world. can be used for robotics and physical AI applications including: NVIDIA's robotics and DRIVE™ teams are using for data curation and filtering, annotation and VLA post-training. Uber is using it to annotate and caption AV training data. Magna is developing with as part of its City Delivery platform -- a fully autonomous, low-cost solution for instant delivery -- to help vehicles adapt more quickly to new cities. adds world understanding to the vehicles' long-term trajectory planner. VAST Data, Milestone Systems and Linker Vision are adopting to automate traffic monitoring, improve safety and enhance visual inspection in cities and industrial settings. New AI Infrastructure Powers Robotics Workloads Anywhere To enable developers to take full advantage of these advanced technologies and software libraries, announced AI infrastructure designed for the most demanding workloads. Accelerating the Developer Ecosystem To help robotics and physical AI developers advance 3D and simulation technology adoption, also announced: Certain statements in this press release including, but not limited to, statements as to: computer graphics and AI converging to fundamentally transform robotics; by combining AI reasoning with scalable, physically accurate simulation, enabling developers to build tomorrow's robots and autonomous vehicles that will transform trillions of dollars in industries; the benefits, impact, performance, and availability of NVIDIA's products, services, and technologies; expectations with respect to NVIDIA's third party arrangements, including with its collaborators and partners; expectations with respect to technology developments; and other statements that are not historical facts are forward-looking statements within the meaning of Section 27A of the Securities Act of 1933, as amended, and Section 21E of the Securities Exchange Act of 1934, as amended, which are subject to the "safe harbor" created by those sections based on management's beliefs and assumptions and on information currently available to management and are subject to risks and uncertainties that could cause results to be materially different than expectations. Important factors that could cause actual results to differ materially include: global economic and political conditions; NVIDIA's reliance on third parties to manufacture, assemble, package and test NVIDIA's products; the impact of technological development and competition; development of new products and technologies or enhancements to NVIDIA's existing product and technologies; market acceptance of NVIDIA's products or NVIDIA's partners' products; design, manufacturing or software defects; changes in consumer preferences or demands; changes in industry standards and interfaces; unexpected loss of performance of NVIDIA's products or technologies when integrated into systems; and changes in applicable laws and regulations, as well as other factors detailed from time to time in the most recent reports files with the , or , including, but not limited to, its annual report on Form 10-K and quarterly reports on Form 10-Q. Copies of reports filed with the are posted on the company's website and are available from without charge. These forward-looking statements are not guarantees of future performance and speak only as of the date hereof, and, except as required by law, disclaims any obligation to update these forward-looking statements to reflect future events or circumstances. Many of the products and features described herein remain in various stages and will be offered on a when-and-if-available basis. The statements above are not intended to be, and should not be interpreted as a commitment, promise, or legal obligation, and the development, release, and timing of any features or functionalities described for our products is subject to change and remains at the sole discretion of . will have no liability for failure to deliver or delay in the delivery of any of the products, features or functions set forth herein. © 2025 . All rights reserved. , the logo, Cosmos, DGX, DRIVE, Isaac Sim, Omniverse, Omniverse Cloud Sensor RTX, RTX and RTX PRO are trademarks and/or registered trademarks of in the and/or other countries. Other company and product names may be trademarks of the respective companies with which they are associated. Features, pricing, availability, and specifications are subject to change without notice. A photo accompanying this announcement is available at https://www.globenewswire.com/NewsRoom/AttachmentNg/e81778ef-d0e2-4944-9889-02609ac378cd
Share
Copy Link
Nvidia announces new AI models and infrastructure for robotics and enterprise applications, including Cosmos Reason for physical AI and Nemotron models for improved reasoning capabilities in AI agents.
Nvidia has unveiled a suite of new AI models and infrastructure aimed at revolutionizing robotics and enterprise applications. The announcement, made at the SIGGRAPH conference, showcases the company's commitment to advancing AI capabilities across various sectors 1.
At the forefront of Nvidia's announcements is Cosmos Reason, a 7-billion-parameter "reasoning" vision language model (VLM) designed for physical AI applications and robotics 1. This open, customizable model enables robots and AI agents to "reason" with human-like capabilities, utilizing prior knowledge, physics understanding, and common sense to interact with the real world 2.
Source: SiliconANGLE
Cosmos Reason's applications include:
Companies like Uber, Magna, and Milestone Systems are already adopting Cosmos Reason for various applications, from annotating autonomous vehicle training data to enhancing city delivery platforms 4.
Nvidia also introduced two new models in its Nemotron family: Nemotron Nano 2 and Llama Nemotron Super 1.5 2. These models offer:
Major companies like CrowdStrike, Uber, and Zoom are leveraging these models to enhance their AI agent platforms 2.
Source: TechCrunch
To support these new AI capabilities, Nvidia announced the RTX Pro Blackwell Servers, designed for high-performance AI inference workloads 3. These servers offer:
Additionally, Nvidia introduced DGX Cloud, a cloud-based management platform for robotics workflows 1.
Nvidia has also released new Omniverse software development kits (SDKs) and libraries for building and deploying industrial AI and robotics simulation applications 4. These include:
Companies like Boston Dynamics, Figure AI, and Amazon Devices & Services are adopting these libraries to accelerate their AI robotics development 4.
Source: NVIDIA Blog
Jensen Huang, founder and CEO of Nvidia, emphasized the transformative potential of these technologies: "AI is reinventing computing for the first time in 60 years -- what started in the cloud is now transforming the architecture of on-premises data centers" 3.
With AI agents projected to deliver up to $450 billion in revenue gains and cost savings by 2028, according to Capgemini, Nvidia's new offerings are poised to play a crucial role in shaping the future of AI across industries 2.
As these technologies continue to evolve, they promise to revolutionize sectors ranging from customer service and cybersecurity to manufacturing, logistics, and robotics, opening new frontiers in AI-driven innovation and productivity.
GitHub CEO Thomas Dohmke steps down, marking the end of GitHub's independence as Microsoft integrates it into its CoreAI organization, signaling a shift towards AI-focused development.
8 Sources
Business and Economy
12 hrs ago
8 Sources
Business and Economy
12 hrs ago
xAI, Elon Musk's AI company, has made its advanced Grok 4 model available to all users, including those on the free tier, for a limited time. This move comes as competition intensifies in the AI industry, particularly following the release of OpenAI's GPT-5.
6 Sources
Technology
20 hrs ago
6 Sources
Technology
20 hrs ago
Elon Musk accuses Apple of antitrust violations, claiming the company unfairly favors OpenAI's ChatGPT in App Store rankings. Musk's xAI threatens immediate legal action, escalating tensions in the AI industry.
10 Sources
Policy and Regulation
4 hrs ago
10 Sources
Policy and Regulation
4 hrs ago
NVIDIA announces the integration of RTX Pro 6000 Blackwell Server Edition GPUs into 2U rack mount servers, offering enhanced AI performance and efficiency for enterprise data centers.
4 Sources
Technology
12 hrs ago
4 Sources
Technology
12 hrs ago
Reddit has begun blocking the Internet Archive's Wayback Machine from indexing most of its content, citing concerns over AI companies scraping data without permission. This move has significant implications for digital preservation and raises questions about data access in the AI era.
8 Sources
Technology
12 hrs ago
8 Sources
Technology
12 hrs ago