Nvidia Unveils Rubin CPX: A Game-Changer for Long-Context AI Inference

Reviewed byNidhi Govil

7 Sources

Share

Nvidia announces the Rubin CPX, a new GPU designed for processing AI workloads with over 1 million tokens. Set for release in late 2026, this chip promises to revolutionize video generation, software development, and other long-context AI tasks.

Nvidia's Rubin CPX: Powering Next-Gen AI

Nvidia has unveiled the Rubin CPX GPU, engineered for AI workloads demanding context windows exceeding 1 million tokens. This next-generation chip, part of the new Rubin architecture, represents a significant leap for long-context inference, crucial for evolving AI applications like video generation and complex software development

1

.

Source: Benzinga

Source: Benzinga

Technical Innovation and Performance

The Rubin CPX features a monolithic die with 128GB GDDR7 memory, optimized components for large language model attention mechanisms, and hardware support for video encoding/decoding. Key performance gains include 3x faster attention processing and the ability to handle one million tokens of data

3

. Nvidia's "disaggregated inference" approach further optimizes AI processing by separating input analysis and response generation, with CPX focusing on the initial context phase

3

.

Source: Analytics India Magazine

Source: Analytics India Magazine

Market Impact and Future Outlook

This innovation aims to revolutionize AI-driven tasks, with Nvidia projecting a $100 million investment in Rubin CPX infrastructure could generate up to $5 billion in token revenue

2

. Despite Nvidia's market dominance, competition from Broadcom and Google's TPUs is rising

5

. The Rubin CPX, slated for release in late 2026 as part of the Vera Rubin NVL144 CPX system, promises over 8 exaflops of computing capacity, seven times current top-end systems, to meet the escalating demands of sophisticated AI applications

3

.

Source: SiliconANGLE

Source: SiliconANGLE

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

Β© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo