OpenAI Cerebras $10 Billion AI Chip Deal Explained

OpenAI and Cerebras Forge Massive Computing Partnership

OpenAI announced Wednesday that it reached a multi-year agreement with AI chipmaker Cerebras in a $10 billion deal that will deliver 750 megawatts of computing power starting this year and continuing through 20281

. The partnership marks a strategic move by OpenAI to diversify its supply chain beyond traditional GPU providers and accelerate inference capabilities for ChatGPT users.

Source: SiliconANGLE

According to sources familiar with the matter, the AI chip deal is valued at more than $10 billion, making it one of the largest compute agreements in the AI industry3

. Cerebras will take on the risk of building and leasing datacenters equipped with its wafer-scale AI accelerators to serve OpenAI's inference workloads2

. The AI infrastructure will be deployed in multiple stages, with Cerebras hosting the systems dedicated to delivering low-latency responses.

Why Speed Matters for Real-Time AI Applications

Both companies emphasized that the partnership centers on delivering faster outputs for OpenAI's customers. "Integrating Cerebras into our mix of compute solutions is all about making our AI respond much faster," OpenAI explained in a blog post2

. When users ask complex questions, generate code, create images, or run AI agents, there's a loop happening behind the scenes where the model processes requests and sends responses back.

Andrew Feldman, co-founder and CEO of the semiconductor startup, drew a powerful analogy: "Just as broadband transformed the internet, real-time inference will transform AI"1

. He emphasized that AI's inference stage—the process of getting AI models to respond to queries—is crucial to advancement, and that's where Cerebras' products excel. Recent work showed Cerebras running OpenAI's GPT-OSS-120B model 15 times faster than conventional hardware3

Sachin Katti of OpenAI stated that "Cerebras adds a dedicated low-latency inference solution to our platform. That means faster responses, more natural interactions, and a stronger foundation to scale real-time AI to many more people"1

. In the age of reasoning models and AI agents, faster inference means models can "think" for longer without compromising on interactivity2

Technical Advantages of Cerebras' Wafer-Scale Architecture

By integrating Cerebras' wafer-scale compute architecture into its inference pipeline, OpenAI can leverage the chip's massive SRAM capacity to accelerate processing power. Each WSE-3 accelerator measures 46,225 mm² and is equipped with 44 GB of SRAM2

. Compared to the HBM found on modern GPUs, SRAM operates several orders of magnitude faster. While a single Nvidia Rubin GPU can deliver around 22 TB/s of memory bandwidth, Cerebras' chips achieve nearly 1,000x that at 21 petabytes per second2

Source: The Register

Running models like OpenAI's gpt-oss 120B, Cerebras' chips can reportedly achieve single-user performance of 3,098 tokens per second compared to 885 tok/s for competitor Together AI, which uses Nvidia GPUs2

. This performance advantage positions Cerebras as a compelling alternative in the market dominance currently held by Nvidia.

OpenAI's Strategy to Diversify Beyond Nvidia

The partnership represents OpenAI's continued effort to diversify supply chain dependencies as it pursues an aggressive expansion plan. "OpenAI's compute strategy is to build a resilient portfolio that matches the right systems to the right workloads," said Katti1

. While Nvidia CEO Jensen Huang boasted in November that "everything that OpenAI does runs on Nvidia today," competition is clearly emerging4

Last year, OpenAI committed more than $1.4 trillion to infrastructure deals with companies including Nvidia, Advanced Micro Devices, and Broadcom4

. In September, Nvidia announced it would invest as much as $100 billion in OpenAI to build AI infrastructure with at least 10 gigawatts of power capacity. In October, AMD said it would deploy 6 gigawatts worth of graphics processing units over multiple years for OpenAI3

. OpenAI is also developing its own chip with Broadcom3

Implications for Cerebras' IPO and Market Position

For Cerebras, this high-profile win moves it closer to tapping into the tens of billions of dollars being poured into new AI infrastructure. The company has been around for over a decade, but its star has risen significantly since the launch of ChatGPT in 2022 and the AI boom that followed1

. Feldman noted that "this transaction launches us into the big league and launches high-speed inference into the mainstream"3

Source: ET

Cerebras filed for an IPO in September 2024, revealing that revenue in the second quarter approached $70 million, up from about $6 million in the second quarter of 20235

. However, the company withdrew the paperwork in October after announcing a $1.1 billion funding round that valued it at $8.1 billion. The company is now in talks to raise another billion dollars at a $22 billion valuation1

. The OpenAI deal will help diversify Cerebras away from G42, which accounted for 87% of revenue in the first half of 20245

It's worth noting that Sam Altman is already an investor in Cerebras, and OpenAI once considered acquiring the company1

. The relationship between the two companies dates back to 2017, when they first began exploring collaboration3

. Greg Brockman, OpenAI co-founder and president, stated that "this partnership will make ChatGPT not just the most capable but also the fastest AI platform in the world," helping unlock "the next generation of use cases and onboard the next billion users to AI"3

OpenAI strikes $10 billion AI chip deal with Cerebras to power faster ChatGPT responses

OpenAI and Cerebras Forge Massive Computing Partnership

Why Speed Matters for Real-Time AI Applications

Technical Advantages of Cerebras' Wafer-Scale Architecture

OpenAI's Strategy to Diversify Beyond Nvidia

Implications for Cerebras' IPO and Market Position

References

OpenAI signs deal, worth $10 billion, for compute from Cerebras | TechCrunch

OpenAI to serve ChatGPT on Cerebras' AI dinner plates

OpenAI Forges $10 Billion Deal With Cerebras for AI Computing

OpenAI has committed billions to recent chip deals. Some big names have been left out

Cerebras scores OpenAI deal worth over $10 billion ahead of AI chipmaker's IPO

Related Stories

OpenAI Partners with Broadcom for Custom AI Chips in Massive Infrastructure Expansion

AMD Challenges Nvidia's AI Dominance with Massive OpenAI Deal

Nvidia and OpenAI Forge $100 Billion Partnership to Revolutionize AI Infrastructure

Recent Highlights

Google Gemini 3.1 Pro doubles reasoning score, beats rivals in key AI benchmarks

Meta strikes up to $100 billion AI chips deal with AMD, could acquire 10% stake in chipmaker

Pentagon threatens Anthropic with supply chain risk label over AI safeguards for military use

Recent Highlights

Today's Top Stories

Wayve Secures $1.5B From Nvidia, Uber, and Automakers to Scale Self-Driving AI Globally

Meta locks in $100B AI chips deal with AMD, secures option for 10% stake to fuel AI ambitions

ChatGPT Health fails critical emergency and suicide safety tests, study finds

Anthropic launches enterprise agents with plugins for finance, HR, and engineering workflows