2 Sources
[1]
Intel and pals cram 36,864 CPU cores into a 100kW rack while chasing the agentic AI dragon
COMPUTEX 2026 Intel is working with Foxconn and other infrastructure providers to develop rack-scale reference designs based on the chipmaker's Xeon processors. Announced during Intel's Computex keynote on Tuesday, these blueprints aim to provide greater CPU compute densities for running AI agents at scale. While AI models predominantly run on GPUs and other AI accelerators, the agent harnesses, like OpenClaw, which are used to connect them to tools, terminal shells, code interpreters, and other APIs, still run on CPUs. "Our customers are asking us to think at the system level to help them serve real agentic workloads at scale," Intel CEO Lip Bu Tan said. On stage, Tan revealed two examples of these blueprints. One is aimed at latency-sensitive agentic workloads and another designed for maximum density. Both designs support up to 128 of either Intel's 128-core Granite Rapids Xeon 6 or 288-core Clearwater Forest Xeon 6+ processors, totaling between 16,384 P-cores and 36,864 E-cores, alongside up to 384 TB of DDR5 in a 100kW power envelope. The reference designs come just months after Nvidia announced a similar rack-scale CPU platform packing 256 of its 88-core Vera CPUs. Arm is also working on a pair of rack-scale reference designs for agentic workloads based on its new AGI CPUs: a 36 kW air-cooled system with 8,160 cores and a 200 kW liquid cooled rack with 45,696 cores. Tan expects systems based on these reference designs to be broadly available from its ODM and OEM partners. Alongside agentic AI workloads, the company also revealed that newly launched inference cloud provider Vector Core Compute will be among the first to deploy the platform, and that Together.AI is its first commercial customer. The approach is based on Intel's earlier disaggregated AI blueprint it co-developed with partner SambaNova. The architecture desegregates compute heavy prefill operations to Nvidia GPUs while using SambaNova's AI accelerators for bandwidth-intensive decode operations to boost per-user token output by between 2-3x. If that sounds familiar it's not dissimilar to what Nvidia is doing with Groq's LPUs or what AWS is doing with Trainium and Cerebra's waferscale AI accelerators.®
[2]
Computex 2026: Intel announces Xeon 6+ processors, says AI will make CPUs important again
It's that time of the year again. Computex 2026 is happening in Taipei, Taiwan and almost every other brand is once again talking about AI. We got AI-powered laptops, home appliances, PC chips, and much more. Now when we say AI, chances are you automatically thought of a GPU. But Intel has a different take. The brand believes that AI will put the spotlight firmly on CPUs. In a press release, Intel said, "With the emergence of agentic AI, the growing demand for AI inference is changing the balance of power in the data center, returning the CPU to a position of prominence." Which, in simpler words, means that the next phase of AI may rely heavily on CPUs once again. And the reason behind this could be agentic AI. But before we delve deep into this, let us take a moment and talk about Agentic AI and what it really is. Also read: ROG turns 20: Asus unveils Strix Scar 18, Xbox Ally X20 bundle and more at Computex 2026 What is agentic AI? Agentic AI is simply an AI tool that can do tasks on its own instead of just replying to your questions. For instance, in the case of ChatGPT, you ask it something and it gives you an answer. That's it. But, agentic AI goes a step further. In this case, instead of waiting for commands one by one, it can plan actions, make decisions, and complete multi-step tasks automatically. For example, if you ask a normal AI assistant about some cheap flights to Tokyo, it will give you some links and call it a day. But an agentic AI assistant will search flights, compare prices, check your calendar, book the tickets, reserve a hotel, and remind you before the trip. And it will do all of that with minimal input from you. Agentic AI systems need to constantly coordinate tasks, manage memory, handle workflows, and communicate with other software. And according to companies like Intel, CPUs become more important for managing all that behind the scenes. Why Intel thinks CPUs matter again Intel CEO Lip-Bu Tan, during his keynote, said that with the rise of 'inference, agentic, and physical AI, Intel is poised to bring the world new innovations from the chip to systems level that promise to transform industry and society for the better'. The Intel CEO also added that they are proud to join all their partners in building 'great products that will delight customers and bring the power of AI to more people'. Now for the last few years, GPUs have dominated almost every AI conversation. Whether it is ChatGPT, image generators, or advanced enterprise AI systems, GPUs are usually the hardware doing the heavy lifting. But Intel believes the industry is slowly moving into a different phase now. The company says AI is shifting from simply training models to actually deploying and running them at scale. This process is called inference. In simple words, training is when you teach an AI model. Inference is when people actually start using it in the real world. And according to Intel, inference workloads are growing very quickly because of agentic AI systems. Think about it this way. If millions of people start using AI agents that continuously perform tasks, coordinate apps, process workflows, and manage requests in real time, data centres will suddenly have to deal with a lot more orchestration and communication between systems. That is where CPUs come in. Intel argues that CPUs are extremely important for handling things like scheduling, memory allocation, task coordination, concurrency, and moving data between components. GPUs are still incredibly important for AI processing itself, but CPUs become the "manager" that keeps the whole system running smoothly. Interestingly, Intel even referenced analyst Ben Bajarin during its announcement. According to him, AI infrastructure could slowly move from a one-CPU-to-four-GPU setup toward something much closer to one CPU per GPU in the future. Now no, this does not mean GPUs are suddenly becoming irrelevant. NVIDIA is still dominating the AI space right now. But Intel clearly believes CPUs are about to become much more important than people think. Intel's Xeon 6+ processors To support this AI push, Intel announced its new Xeon 6+ processors at Computex 2026. These are next-generation server CPUs built using Intel's 18A process technology, which is a major step for the company itself. Intel says the chips are specifically designed for cloud-native AI workloads, networking, and large-scale inference systems. But what the company repeatedly highlighted was efficiency and density. According to Intel, a single liquid-cooled rack powered by Xeon 6+ processors can deliver up to 36,864 cores inside 32U of compute space. Intel says this allows extremely high "agent density" for AI infrastructure while operating around 100-kilowatt rack power. Now admittedly, most consumers will never directly interact with hardware like this. But these systems quietly power a huge chunk of the internet. Everything from AI chatbots and cloud apps to streaming services and enterprise platforms relies heavily on data centre infrastructure. And power efficiency is becoming a massive issue in this space. Modern AI data centres consume enormous amounts of electricity. As AI workloads continue growing, companies are desperately trying to find ways to deliver more performance without massively increasing energy consumption. Intel appears to be positioning Xeon 6+ as a solution for exactly that problem.
Share
Copy Link
Intel announced rack-scale reference designs at Computex 2026 featuring up to 36,864 E-cores in a single 100kW rack, developed with Foxconn to handle agentic AI workloads at scale. The move signals a strategic shift as CPU compute densities become critical for AI agents that coordinate tasks, manage workflows, and communicate across systems.
Intel revealed ambitious rack-scale reference designs at Computex 2026, partnering with Foxconn and other infrastructure providers to deliver unprecedented CPU compute densities for agentic AI deployments. The blueprints support up to 128 of either Intel's 128-core Granite Rapids Intel Xeon 6 or 288-core Clearwater Forest Intel Xeon 6+ processors, delivering between 16,384 P-cores and 36,864 E-cores alongside up to 384 TB of DDR5 memory in a 100kW rack power envelope
1
. Intel CEO Lip Bu Tan emphasized during his keynote that customers are demanding system-level thinking to serve real agentic workloads at scale, marking a strategic pivot as the industry moves beyond GPU-dominated training toward inference-heavy deployments1
.
Source: The Register
While AI models predominantly run on GPUs and other AI accelerators, the agent harnesses that connect them to tools, terminal shells, code interpreters, and APIs still run on CPUs
1
. Agentic AI represents a fundamental shift from passive question-answering systems to autonomous agents that plan actions, make decisions, and complete multi-step tasks automatically. Instead of simply providing flight information, an agentic AI assistant searches flights, compares prices, checks calendars, books tickets, reserves hotels, and sends reminders with minimal user input2
. These systems require constant coordination of tasks, memory management, workflow handling, and software communication, making CPU capabilities essential for managing operations behind the scenes2
.
Source: Digit
Tan revealed two distinct rack-scale reference designs during the keynote: one optimized for latency-sensitive agentic workloads and another engineered for maximum density
1
. The announcement comes just months after Nvidia unveiled a similar rack-scale CPU platform packing 256 of its 88-core Vera CPUs, while Arm is developing a pair of reference designs for agentic workloads based on its new AGI CPUs: a 36 kW air-cooled system with 8,160 cores and a 200 kW liquid-cooled rack with 45,696 cores1
. Tan expects systems based on these blueprints to become broadly available from ODM and OEM partners, with newly launched inference cloud provider Vector Core Compute among the first to deploy the platform and Together.AI serving as the first commercial customer1
.Related Stories
Intel argues that CPUs handle critical functions like scheduling, memory allocation, task coordination, concurrency, and data movement between components, positioning them as the "manager" that keeps entire systems running smoothly even as GPU processing remains vital
2
. The company referenced analyst Ben Bajarin, who suggests AI infrastructure could shift from a one-CPU-to-four-GPU setup toward something closer to one CPU per GPU in the future2
. The approach builds on Intel's earlier disaggregated AI blueprint co-developed with SambaNova, which separates compute-heavy prefill operations to Nvidia GPUs while using SambaNova's AI accelerators for bandwidth-intensive decode operations to boost per-user token output by 2-3x1
.Built using Intel's 18A process technology, the Intel Xeon 6 processors target cloud-native AI workloads, networking, and large-scale inference systems with emphasis on efficiency and density
2
. A single liquid-cooled rack powered by Xeon 6+ processors delivers up to 36,864 cores inside 32U of compute space, enabling extremely high "agent density" for AI infrastructure while operating around 100-kilowatt rack power2
. As millions of users begin deploying AI agents that continuously perform tasks, coordinate applications, process workflows, and manage requests in real time, data centers face dramatically increased orchestration and communication demands between systems, making these high-density CPU configurations increasingly relevant for handling inference workloads at scale2
.Summarized by
Navi
[1]
04 Mar 2026•Technology
26 Mar 2026•Technology

25 Sept 2024

1
Policy and Regulation

2
Technology

3
Technology

1
Pope Leo XIV releases major AI encyclical calling for 'disarmament' of artificial intelligence

2
Apple's Siri overhaul for iOS 27 brings Gemini integration and standalone app to compete with ChatGPT

3
Nvidia unveils RTX Spark chip to chase $200B CPU market with AI agent PCs from Microsoft, Dell, and HP
