Nvidia launches Nemotron 3 open-source AI models as Meta pulls back from the field

Reviewed byNidhi Govil

10 Sources

Share

Nvidia unveiled its Nemotron 3 family of open-source AI models, ranging from 30 billion to 500 billion parameters, with full training data transparency. The chipmaker is positioning itself to lead open-source AI development as Meta signals a potential shift away from its Llama open models toward proprietary systems.

News article

Nvidia Steps Into Open-Source AI Leadership With Nemotron 3 Family

Nvidia has released its Nemotron 3 family of open-source AI models, marking a strategic shift for the chipmaker from hardware supplier to serious model maker in the AI development ecosystem

1

. The new large language models come in three sizes: Nano with 30 billion parameters, Super with 100 billion, and Ultra with 500 billion parameters

1

. CEO Jensen Huang emphasized that "open innovation is the foundation of AI progress," positioning Nemotron as a platform that gives developers transparency and efficiency for building agentic AI systems at scale

1

.

The timing proves significant as Meta, which pioneered advanced open models with Llama in February 2023, appears to be retreating from open-source AI

2

. Llama models no longer appear in the top 100 on LMSYS's LMArena Leaderboard, and Bloomberg reports suggest Meta's forthcoming "Avocado" project may launch as a closed model

2

. Venture capital firm Menlo Ventures noted that enterprise open-source share declined from 19% to 11%, partly blaming Llama's stagnation with no major releases since April

2

.

Hybrid Architecture Delivers Efficiency and Scalability

Nvidia built the Nemotron 3 models using a breakthrough hybrid latent mixture-of-experts architecture combined with Mamba-Transformer design

4

. Kari Briski, Nvidia's vice president of generative AI software for enterprise, explained that this architecture achieves up to 4x higher throughput compared to Nemotron 2 Nano while reducing reasoning token generation by up to 60%

4

. The Nano model extends the context window to one million tokens, seven times larger than its predecessor

2

.

The mixture-of-experts architecture allows experts within the model to share a common core while keeping only small parts private, improving both scalability and efficiency

4

. Using Nvidia's ultra-efficient 4-bit NVFP4 training format, the larger Super and Ultra models can train on existing infrastructure without compromising accuracy

4

. This design compresses memory needs and optimizes compute while delivering exceptional intelligence for size

5

.

Full Transparency Sets Nvidia Apart in Open-Source Competition

Nvidia is taking a more transparent approach than many US rivals by releasing the training data used to build Nemotron 3, along with tools for customization and fine-tuning

1

. The company is releasing three trillion tokens of pretraining, post-training, and reinforcement learning dataβ€”samples that are "orders of magnitude larger than any available post-training data set," according to Briski

4

. This level of openness addresses enterprise concerns about understanding how models are trained and where training data originates

4

.

Nvidia is also launching NeMo Gym, a reinforcement learning lab where users can test their models and agents in simulated environments

4

. Reinforcement learning involves giving models simulated rewards and punishments, creating reasoning systems capable of operating in complex, dynamic conditions

5

. Early adopters include Accenture, CrowdStrike, Oracle Cloud Infrastructure, Palantir, Perplexity, ServiceNow, Siemens, and Zoom

4

.

Strategic Response to Chinese Dominance and Hardware Competition

The move addresses the growing dominance of Chinese open-source models from DeepSeek and Alibaba's Qwen, which currently lead in popularity on platforms like Hugging Face as American tech giants have become more secretive

1

. Briski told reporters that while she agrees with the decline of Llama, she doesn't agree with claims about open-source decline overall, pointing to the popularity of Qwen and DeepSeek models

2

. Nvidia emphasized it had the most contributions and repositories on HuggingFace this year

2

.

The chipmaker's push into model development may also hedge against AI companies like OpenAI, Google, and Anthropic developing their own chips, potentially reducing reliance on Nvidia hardware over time

1

. By providing open models optimized for its silicon, Nvidia aims to ensure new models built worldwide remain aligned with its hardware empire rather than emerging competitors in China

3

. "When we're the best development platform, people are going to choose us, choose our platform, choose our GPU," Briski said

3

.

Release Timeline and Enterprise Focus

Nemotron 3 Nano is available now on HuggingFace, with Super expected in January and Ultra due in March or April

2

. Briski emphasized that reliability and a consistent roadmap of releases are critical for developers who "can't rely on a model when there's only been just one model release and no road map"

3

. She noted that OpenAI's last open model release was over five years ago before its recent gpt-oss models in August

3

.

The AI ecosystem increasingly demands customization for particular tasks, with enterprises needing to hand queries off to different models and squeeze more intelligent responses through simulated reasoning

1

. Nemotron 3 addresses what Briski called "a tough trifecta"β€”finding models that are ultra-open, extremely intelligent, and highly efficient without forcing painful trade-offs between token costs, latency, and throughput

4

. The models aim to power the next generation of agentic AI operations that automate multiple models for proactive and intelligent agents, moving beyond single-model chatbots

5

.

Today's Top Stories

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

Β© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo