Intel packs 36,864 cores into 100kW rack as agentic AI shifts spotlight back to CPUs

2 Sources

Share

Intel announced rack-scale reference designs at Computex 2026 featuring up to 36,864 E-cores in a single 100kW rack, developed with Foxconn to handle agentic AI workloads at scale. The move signals a strategic shift as CPU compute densities become critical for AI agents that coordinate tasks, manage workflows, and communicate across systems.

Intel Xeon 6+ Targets Growing Demand for Agentic AI Infrastructure

Intel revealed ambitious rack-scale reference designs at Computex 2026, partnering with Foxconn and other infrastructure providers to deliver unprecedented CPU compute densities for agentic AI deployments. The blueprints support up to 128 of either Intel's 128-core Granite Rapids Intel Xeon 6 or 288-core Clearwater Forest Intel Xeon 6+ processors, delivering between 16,384 P-cores and 36,864 E-cores alongside up to 384 TB of DDR5 memory in a 100kW rack power envelope

1

. Intel CEO Lip Bu Tan emphasized during his keynote that customers are demanding system-level thinking to serve real agentic workloads at scale, marking a strategic pivot as the industry moves beyond GPU-dominated training toward inference-heavy deployments

1

.

Source: The Register

Source: The Register

Why Agentic AI Puts CPUs Back in the Spotlight

While AI models predominantly run on GPUs and other AI accelerators, the agent harnesses that connect them to tools, terminal shells, code interpreters, and APIs still run on CPUs

1

. Agentic AI represents a fundamental shift from passive question-answering systems to autonomous agents that plan actions, make decisions, and complete multi-step tasks automatically. Instead of simply providing flight information, an agentic AI assistant searches flights, compares prices, checks calendars, books tickets, reserves hotels, and sends reminders with minimal user input

2

. These systems require constant coordination of tasks, memory management, workflow handling, and software communication, making CPU capabilities essential for managing operations behind the scenes

2

.

Source: Digit

Source: Digit

Rack-Scale Reference Designs Address Latency and Density Requirements

Tan revealed two distinct rack-scale reference designs during the keynote: one optimized for latency-sensitive agentic workloads and another engineered for maximum density

1

. The announcement comes just months after Nvidia unveiled a similar rack-scale CPU platform packing 256 of its 88-core Vera CPUs, while Arm is developing a pair of reference designs for agentic workloads based on its new AGI CPUs: a 36 kW air-cooled system with 8,160 cores and a 200 kW liquid-cooled rack with 45,696 cores

1

. Tan expects systems based on these blueprints to become broadly available from ODM and OEM partners, with newly launched inference cloud provider Vector Core Compute among the first to deploy the platform and Together.AI serving as the first commercial customer

1

.

AI Infrastructure Shifts Toward CPU-GPU Balance

Intel argues that CPUs handle critical functions like scheduling, memory allocation, task coordination, concurrency, and data movement between components, positioning them as the "manager" that keeps entire systems running smoothly even as GPU processing remains vital

2

. The company referenced analyst Ben Bajarin, who suggests AI infrastructure could shift from a one-CPU-to-four-GPU setup toward something closer to one CPU per GPU in the future

2

. The approach builds on Intel's earlier disaggregated AI blueprint co-developed with SambaNova, which separates compute-heavy prefill operations to Nvidia GPUs while using SambaNova's AI accelerators for bandwidth-intensive decode operations to boost per-user token output by 2-3x

1

.

Cloud-Native AI Workloads Drive Xeon 6+ Architecture

Built using Intel's 18A process technology, the Intel Xeon 6 processors target cloud-native AI workloads, networking, and large-scale inference systems with emphasis on efficiency and density

2

. A single liquid-cooled rack powered by Xeon 6+ processors delivers up to 36,864 cores inside 32U of compute space, enabling extremely high "agent density" for AI infrastructure while operating around 100-kilowatt rack power

2

. As millions of users begin deploying AI agents that continuously perform tasks, coordinate applications, process workflows, and manage requests in real time, data centers face dramatically increased orchestration and communication demands between systems, making these high-density CPU configurations increasingly relevant for handling inference workloads at scale

2

.

Today's Top Stories

TheOutpost.ai

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

Instagram logo
LinkedIn logo
Youtube logo
© 2026 TheOutpost.AI All rights reserved