Intel Crescent Island targets AI inference with cheaper memory, air-cooling to challenge Nvidia

4 Sources

Share

Intel unveils Crescent Island, an AI chip designed for inference workloads that uses cost-effective LPDDR5X memory instead of expensive HBM. The air-cooled GPU supports up to 480 GB of memory and ships later this year as Intel pivots from its failed Gaudi training chips to focus on the growing inference market.

Intel Crescent Island Shifts Focus to AI Inference Market

Intel is preparing to ship an AI chip by the end of this year that takes a fundamentally different approach from its rivals, targeting the inference market rather than competing head-on with Nvidia's dominance in training workloads

1

. The Intel Crescent Island GPU, built on the new Xe3P architecture, represents the company's strategic pivot after its earlier Gaudi series saw poor sales and its planned successor was cancelled last year

3

. Kevork Kechichian, who leads Intel's data center group, told the Financial Times that the company is "starting with the basics" as it rebuilds its position in AI, explicitly avoiding the training market based on past experience

1

.

Source: Wccftech

Source: Wccftech

Cost-Effective AI Solution Through LPDDR5X Memory Innovation

Intel's upcoming AI chip distinguishes itself by using LPDDR5X memory instead of the expensive High Bandwidth Memory (HBM) found in chips like Nvidia's Blackwell and AMD's offerings

1

. At Computex 2026, Intel revealed that while the reference design includes 160 GB of LPDDR5X, the data center GPU can scale up to 480 GB of memory, giving partners flexibility to build accelerators with massive capacity

2

. This approach gives Intel a significant advantage: Crescent Island offers up to 480 GB compared to AMD Instinct MI450X's 432 GB of HBM4 and Nvidia Vera Rubin's 288 GB of HBM4

4

. The wide-and-slow approach potentially uses a 640-bit bus connecting 20 LPDDR5X devices, achieving 684 GB/s of memory bandwidth with 10.7 Gbps LPDDR5X modules

2

.

Source: Tom's Hardware

Source: Tom's Hardware

Air-Cooling and Energy-Efficient Design for Practical Deployment

The Crescent Island AI chip is designed as an air-cooled PCI Express add-in card with a 350W power target, placing its thermal requirements close to products like Nvidia's RTX Pro 5000 Blackwell card

2

. This air-cooling capability means it can drop into traditional 4U or 5U GPU servers without requiring complex and costly liquid-cooling infrastructure that Nvidia and AMD solutions demand

1

. Intel states the Xe3P microarchitecture is optimized for performance-per-watt, and the use of LPDDR5X memory cuts down power significantly

4

. Eight accelerators with a full 480 GB of RAM each would produce a dense server with 3.8 TB of local GPU memory, allowing massive models or swarms of smaller AI agents to reside within one box .

Nvidia and AMD Competitor Positioning for Agentic AI

Intel describes the Xe3P architecture as "built for agentic AI," supporting a broad range of data types from FP4 for high-performance AI inference up to FP64 for scientific computing applications

2

. The GPU focuses solely on GPGPU workloads, removing traditional graphics or 3D support to free more die area for additional AI compute

4

. This positions Intel as a Nvidia and AMD competitor specifically in the inference segment, where the semiconductor market is expected to grow substantially as companies deploy on-premise inferencing solutions. Nvidia is also targeting inference through its partnership with Groq, blending a language accelerator with its Rubin platform

3

.

Manufacturing Strategy and Market Timing Under New Leadership

The effort marks Intel's first major push into AI infrastructure under CEO Lip-Bu Tan, who took over last year after Pat Gelsinger was ousted amid concerns about his turnaround strategy

1

. Kechichian indicated Intel hopes to build the new chip in-house, stating "for all data center products we are moving aggressively into our own foundry," which would make it cheaper than rivals who rely on TSMC

1

. Intel subsequently launched advanced PC and server chips built in its own factories this year after previously relying on Taiwan Semiconductor Manufacturing Company. The chip will start shipping in limited quantities to customers by the end of this year following an 18-month development process

1

. Intel is targeting customer sampling for second-half 2026

4

.

Source: PC Gamer

Source: PC Gamer

Software Ecosystem and Global Market Considerations

Intel will support Crescent Island through its oneAPI software stack, which the company describes as "open, upstreamed, and Day 0 ready"

2

. While oneAPI is far less widely adopted than CUDA or ROCm, Intel is already evaluating its open and unified software stack for heterogeneous AI systems with its Arc Pro B-series lineup

4

. Kechichian said Intel is assessing whether a version could be sold in China in compliance with US export controls, noting "there are tiers of [the chip] that might be OK there" as Nvidia and AMD's AI chip sales to China have been blocked by trade tensions

1

. Going with LPDDR5X doesn't put pressure on valuable advanced packaging capacity or compete with higher-end accelerators for scarce HBM, potentially making it easier for Intel to produce these accelerators economically and in volume

2

.

Today's Top Stories

© 2026 TheOutpost.AI All rights reserved