OpenAI and Broadcom unveil Jalapeño chip to power next-generation AI inference at scale

Reviewed byNidhi Govil

28 Sources

Share

OpenAI and Broadcom have announced Jalapeño, a custom AI processor designed specifically for large language model inference in data centers. The chip promises better performance per watt than current options and marks OpenAI's push toward vertical integration. With deployment planned by year-end, this move signals OpenAI's strategy to reduce dependence on Nvidia while controlling its entire AI infrastructure stack.

OpenAI and Broadcom Reveal Custom-Built Inference Processor

OpenAI and Broadcom have jointly announced Jalapeño, a custom AI processor designed exclusively for large language model inference at scale. The chip represents OpenAI's first foray into custom silicon, developed in partnership with Broadcom as the implementation and integration partner

1

. This ASIC (Application-Specific Integrated Circuit) was built from scratch based on detailed insights from OpenAI researchers, with the design process informed by OpenAI's roadmap for future models and products

1

. The development cycle took just nine months from initial design to manufacturing tape-out, which OpenAI claims is the fastest ASIC development cycle ever in the high-performance semiconductor space

5

.

Source: Axios

Source: Axios

Performance Per Watt Claims and Technical Specifications

While final performance metrics remain under evaluation, OpenAI states that early testing shows the Jalapeño chip will deliver performance per watt substantially better than current state-of-the-art alternatives

1

. Broadcom CEO Hock Tan told Reuters that Jalapeño matches the performance of Nvidia's Blackwell chips and Google's Tensor Processing Units

3

. The chip architecture was optimized around kernels, memory movement, networking, and serving patterns that matter most for frontier AI models

4

. Engineering samples are already operating in the lab at target clock speed and power, running machine learning workloads such as GPT-5.3-Codex-Spark

4

. Based on available imagery, the chip features a massive compute chiplet measuring approximately 840 mm² surrounded by six HBM modules, approaching the reticle size of EUV lithography systems

4

.

Source: VentureBeat

Source: VentureBeat

Reducing Dependence on Nvidia Through Vertical Integration

The Jalapeño chip represents a strategic move by OpenAI to reduce its dependence on Nvidia GPUs, which are in limited supply amid a global compute crunch

1

. OpenAI president Greg Brockman explained the company's approach, stating they have been looking for specific workloads that are underserved and asking how they can build something to accelerate what's possible

2

. The company hopes to ultimately own the full stack behind its models and products, from chip architecture and kernels to memory systems, networking, scheduling, deployment systems, and product experience

2

. This vertical integration strategy aims to make models faster, more reliable, and more affordable for users

2

.

Source: Decrypt

Source: Decrypt

Purpose-Built for Inference, Not Training

Unlike general-purpose AI accelerators, the Jalapeño chip is specifically designed for inference—the process of running pre-built AI models in response to user commands—rather than training

2

. OpenAI emphasizes the chip's low operating cost when running real-time coding models, though more performance-intensive tasks like pre-training will likely still rely on Nvidia hardware

2

. The architecture addresses practical bottlenecks that matter for inference at scale, including costly data movement, balance between compute and memory resources, networking efficiency, and overall behavior

4

. The design aims to wed high throughput with low latency, which will be particularly useful for reasoning and agentic AI workloads

4

.

Deployment Timeline and Competitive Landscape

Both OpenAI and Broadcom claim that Jalapeño chips will be deployed in data centers by the end of 2026

1

. The chip is positioned as the first step in a multi-generation compute platform

3

. OpenAI joins other tech giants like Microsoft, Meta, Amazon, and Google who have launched custom-designed AI chips to power their servers for either training or inference

3

. While Broadcom has been a successful chipmaker for compute infrastructure, it has seen substantial movement recently building new business around providing custom silicon to hyperscalers and teams building frontier models during the AI boom

1

. The move toward custom silicon reflects the industry's efforts to squeeze out more compute capacity amid limited data center capacity

1

.

AI-Assisted Design and Economic Implications

In a notable twist, OpenAI's own AI models assisted in the development of the Jalapeño chip, with the company using its models to accelerate parts of the design and optimization process

5

. This marks a significant milestone where AI is now helping design the chips it will run on. Optimizing inference costs could prove crucial for AI economics going forward, potentially improving OpenAI's bottom line

2

. However, questions remain about funding this capital-intensive initiative, given that OpenAI reportedly ran an operating loss of over $20 billion last year according to leaked financials

5

. A detailed technical report on Jalapeño's performance will be presented in the coming months

1

.

Today's Top Stories

© 2026 TheOutpost.AI All rights reserved