OpenAI unveils Jalapeño chip to reduce dependence on Nvidia and make ChatGPT cheaper to run

Reviewed byNidhi Govil

3 Sources

Share

OpenAI has unveiled Jalapeño, its first custom-built AI inference chip developed with Broadcom and Celestica. Designed specifically for running large language models like ChatGPT, the processor could make AI responses faster and cheaper while reducing OpenAI's reliance on Nvidia. Engineering samples are already running workloads including GPT-5.3-Codex-Spark, with full deployment planned by the end of 2026.

News article

OpenAI's Custom AI Chip Targets Inference Efficiency

OpenAI has unveiled Jalapeño, its first custom-built AI inference chip developed in partnership with Broadcom and system partner Celestica

1

2

. Unlike general-purpose graphics processors from Nvidia that dominate today's AI infrastructure, the Jalapeño chip was designed from the ground up for one specific job: answering user prompts as quickly and efficiently as possible. The application-specific integrated circuit (ASIC) focuses on AI inference, the process that allows large language models like ChatGPT to generate responses after training is complete

2

.

OpenAI designed the chip from scratch in just nine months, with Broadcom handling silicon implementation, networking, and connectivity, while Celestica developed the physical computing hardware including circuit boards, hardware frames, and system integration

2

. Engineering samples are already running machine learning workloads, including GPT-5.3-Codex-Spark, at target performance and power levels

1

2

.

Strategic Move to Reduce Dependence on Nvidia

The Sam Altman-led company has been one of Nvidia's largest customers for AI chips, but surging demand for its models is pushing it to diversify its compute stack

2

. While Nvidia remains the undisputed leader in AI hardware, those chips were designed to tackle a wide variety of computing workloads, not exclusively the task of serving billions of chatbot responses every day

1

. By developing OpenAI's custom AI chip, the company aims to reduce dependence on Nvidia and gain greater control over its supply chain

3

.

According to OpenAI and Broadcom, the new chip was architected around the memory, networking, and computing patterns used by modern large language models, allowing it to perform those workloads more efficiently than conventional hardware

1

. Early internal testing suggests it delivers better performance per watt than today's leading AI accelerators, although independent benchmarks have not yet been released

1

. OpenAI has claimed the chip could lower costs by up to 50% while achieving significant energy savings and faster processing speeds

3

.

Making ChatGPT Cheaper to Run and More Scalable

If OpenAI can answer more questions while using less electricity and fewer expensive chips, the economics of running ChatGPT begin to change

1

. Lower infrastructure costs could allow OpenAI to respond to prompts faster, support more users during peak demand, reduce outages caused by hardware shortages, and roll out more capable AI models without dramatically increasing operational costs

1

. The custom-built AI inference chip is designed to optimize AI inference tasks by aligning hardware with the specific requirements of OpenAI's AI models, eliminating critical bottlenecks in AI deployment

3

.

Jalapeño marks the first generation of OpenAI's in-house compute platform, with initial deployment planned by the end of 2026

2

. OpenAI has described Jalapeño as the first step in a multi-generation compute platform, suggesting this won't be its last custom processor

1

. The companies first disclosed the project last October, aiming at deployment of over 10 gigawatts of custom AI accelerators used for large language model inference

2

.

Vertical Integration and AI-Assisted Design

By designing its own processors while continuing to build its own models and products, OpenAI is gradually taking control of more of the entire AI stack through vertical integration

1

. Companies that design both their hardware and software can optimize them to work together, which helps improve performance and cost efficiency while reducing long-term operational costs

1

.

In a notable innovation, OpenAI utilized its own AI models to assist in the design and optimization of Jalapeño, demonstrating how AI can streamline hardware development and reduce both costs and timelines

3

. Manufacturing and assembly are being handled by TSMC and Celestica, ensuring a streamlined production process

3

. This collaborative approach highlights the importance of partnerships in achieving innovative advancements in AI hardware

3

.

Broader Industry Shift Toward Custom Silicon

Custom ASICs are alternatives to general-purpose processors such as the graphics processing units produced by giants such as Nvidia. While Nvidia's H100 and A100 GPUs remain the industry's general-purpose AI processors, major cloud and AI companies are increasingly developing custom chips tailored to their own workloads

2

. Research firm 650 Group estimated that the market for custom data center AI chips was expected to double to $20 billion in 2025

2

.

OpenAI has also struck deals with Amazon Web Services to use its Trainium AI chips and has partnered with Advanced Micro Devices and AI chipmaker Cerebras to expand its access to AI hardware

2

. Other AI companies are following similar paths, with Anthropic signing a 10-year, $100 billion deal with AWS to use Trainium chips and announcing support for Google TPU

2

. Google introduced its inference-optimized Ironwood TPU last year and has ordered more than three million TPUs from Intel for delivery in 2028

2

. The shift toward custom silicon reflects growing demand for energy efficiency and cost optimization as AI companies scale their operations and seek to maintain competitive advantages in an increasingly crowded market.

Today's Top Stories

© 2026 TheOutpost.AI All rights reserved