2 Sources
[1]
AMD Highlights the Critical Role of CPUs in End to End Agentic AI Workflows
Reasoning with inference: To provide the intelligence agents need to get work done, they rely on inference. Large language models predominantly run on GPUs with a host CPU keeping the GPUs fully utilized. To keep accelerators busy, host-node CPUs often benefit from strong per-core performance, high-frequencies and the right balance of cores (sometimes fewer are needed than you might think), memory bandwidth, I/O and networking. The correct mix in the host-node CPU can keep GPU clusters fed with instructions so each cluster delivers as many tokens as possible. The AMD EPYC™ 9575F processor delivers on this high single-core performance with 64 cores capable of running at up to 5Ghz. "Venice" will further extend EPYC CPUs' high-frequency offerings. In conversations with enterprise customers, a couple of patterns stand out. First, many standardize their CPU infrastructure purchases around legacy specifications, such as using 16- and 32-core CPUs. Agentic workflows need higher core counts for some agentic stages, higher frequencies for others - and customers need the flexibility to configure for both. The mindset should shift from a single CPU standard to a portfolio matched to the agentic workflow. Second, there's a multiplier effect on enterprise applications and inference servers that comes as agents become greater users of existing IT infrastructure. Once you give employees the ability to build and deploy their own agents, agentic adoption grows rapidly. IT planning teams should ask what happens to their infrastructure - examples include databases; platforms for enterprise resource planning and customer relationship management, business intelligence, identity management; and inference servers - when agents dramatically increase usage. The Question for CIOs Agentic AI is changing how enterprises size their infrastructure. IT leaders who treat it as a monolithic problem - one GPU strategy or a one-size-fits-all CPU - will likely hit challenges. But as agents proliferate, those who plan for a diverse end-to-end workflow with different compute needs at each stage can scale more efficiently. The question worth asking isn't how many CPUs or GPUs your business needs for agentic AI. It's whether you're matching infrastructure to the way agentic AI works with its many stages across workloads. If you map those stages early and choose the right compute profile for each, your business will be well positioned for speed and efficiency as they scale.
[2]
Agentic AI Isn't One Workload. It's an End-to-End Workflow.
Much of the AI infrastructure conversation starts with an AI model running on GPUs. But in practice, AI infrastructure demands are increasingly determined by the workflow around the model. Agentic AI systems do not simply answer a prompt. They interpret intent, retrieve context, plan next steps, call tools, apply policy, run sandbox code, execute transactions, observe outcomes and return a result. Each step is a different workload, with all adding up to a varied workflow. Some demand high core density. Some benefit from high frequency and predictable latency. Others depend on memory capacity, I/O, data locality, power efficiency or the ability to host many concurrent services. As agentic AI becomes more pervasive, infrastructure teams need more than a single compute profile. CIOs and enterprise decision-makers need a portfolio of CPUs matched to the full agentic workflow. The AMD EPYC™ server CPU portfolio is ideally positioned to play those roles - not as a single CPU with a one-size-fits-all answer, but each as a unique part for agentic AI's many workloads. (Learn more about CPU importance in agentic AI in my earlier blog, Agentic AI Changes the CPU/GPU Equation.) Inside the Agentic AI Workflow When an agent takes on a task, it breaks the goal into steps and works through them, often looping back multiple times before finishing. In a typical sequence, the request hits a gateway where policies are enforced. A planning layer - often running smaller AI models - determines how to route the task. The agent then queries databases, invokes a GPU cluster for deeper reasoning, executes tools based on that reasoning, verifies the output and decides whether to loop again or exit. This explains why agentic AI should be viewed as an end-to-end workflow, not as a single workload. The right infrastructure strategy starts by mapping each workflow and then assigning the right CPU resources to it. AMD focuses on every step along the workflow: EPYC CPUs for high-frequency and high-density compute, AMD Instinct™ accelerators for AI inference and training, and Pensando™ networking to help move data predictably. Where Latency Matters, Where Throughput Wins - and Where You Need Both Each stage of the workflow has different needs, which is why we built the AMD EPYC portfolio around a mix of profiles. * Agentic orchestration, sandbox execution, tool calls: When you need many agents simultaneously running sandbox code (e.g., Python), calling APIs or querying databases, core density can matter more than clock speed. Our 5th Gen AMD EPYC™ server CPUs offer up to 192 cores and 384 threads with simultaneous multithreading. Later this year, our next-generation EPYC processors, codenamed "Venice," will push that to 256 cores and 512 threads. * Tool execution on enterprise applications: The ability to call the tools or enterprise applications makes agents useful. CPUs with a broad set of core counts combined with high performance handle the volume and variety of incoming requests. The AMD EPYC™ 9005 family of processors delivers on this balance with 8 to 192 cores and up to 640GB/s of memory bandwidth, with "Venice" extending core/thread count by 1.3x and memory bandwidth by 2.5x. * Reasoning with inference: To provide the intelligence agents need to get work done, they rely on inference. Large language models predominantly run on GPUs with a host CPU keeping the GPUs fully utilized. To keep accelerators busy, host-node CPUs often benefit from strong per-core performance, high-frequencies and the right balance of cores (sometimes fewer are needed than you might think), memory bandwidth, I/O and networking. The correct mix in the host-node CPU can keep GPU clusters fed with instructions so each cluster delivers as many tokens as possible. The AMD EPYC™ 9575F processor delivers on this high single-core performance with 64 cores capable of running at up to 5Ghz. "Venice" will further extend EPYC CPUs' high-frequency offerings. The Legacy Challenge In conversations with enterprise customers, a couple of patterns stand out. First, many standardize their CPU infrastructure purchases around legacy specifications, such as using 16- and 32-core CPUs. Agentic workflows need higher core counts for some agentic stages, higher frequencies for others - and customers need the flexibility to configure for both. The mindset should shift from a single CPU standard to a portfolio matched to the agentic workflow. Second, there's a multiplier effect on enterprise applications and inference servers that comes as agents become greater users of existing IT infrastructure. Once you give employees the ability to build and deploy their own agents, agentic adoption grows rapidly. IT planning teams should ask what happens to their infrastructure - examples include databases; platforms for enterprise resource planning and customer relationship management, business intelligence, identity management; and inference servers - when agents dramatically increase usage. The Question for CIOs Agentic AI is changing how enterprises size their infrastructure. IT leaders who treat it as a monolithic problem - one GPU strategy or a one-size-fits-all CPU - will likely hit challenges. But as agents proliferate, those who plan for a diverse end-to-end workflow with different compute needs at each stage can scale more efficiently. The question worth asking isn't how many CPUs or GPUs your business needs for agentic AI. It's whether you're matching infrastructure to the way agentic AI works with its many stages across workloads. If you map those stages early and choose the right compute profile for each, your business will be well positioned for speed and efficiency as they scale.
Share
Copy Link
AMD is reframing the AI infrastructure conversation by emphasizing that agentic AI workflows require diverse CPU capabilities, not just GPU power. The company's EPYC processor portfolio addresses different workflow stages—from orchestration to inference—with varying core counts and frequencies. As agents proliferate in enterprises, IT teams face a multiplier effect on existing infrastructure.
AMD is challenging the conventional wisdom that AI infrastructure begins and ends with GPUs. Instead, the chipmaker argues that agentic AI workflows demand a portfolio approach to CPUs, with different processors optimized for distinct stages of how AI agents operate
1
2
. When an agent receives a task, it doesn't simply generate a response. It interprets intent, retrieves context, plans steps, calls tools, applies policy, runs sandbox code, executes transactions, observes outcomes, and returns results. Each step represents a different workload with unique computational needs, making end-to-end agentic AI workflows far more complex than traditional AI deployments.
Source: DT
The AMD EPYC CPU portfolio addresses three critical workflow stages with specialized processors. For orchestration and tool execution, where multiple agents simultaneously run sandbox code or query databases, core counts matter more than clock speed. The 5th Gen AMD EPYC server CPUs deliver up to 192 cores and 384 threads, while the upcoming "Venice" processors will push that to 256 cores and 512 threads
2
. For enterprise application integration, the AMD EPYC 9005 family provides 8 to 192 cores with up to 640GB/s of memory bandwidth, scaling to handle varied incoming requests. When agents need reasoning capabilities through LLMs running on GPUs, host-node CPUs require high single-core performance to keep accelerators fully utilized. The AMD EPYC 9575F processor delivers 64 cores capable of running at up to 5GHz, ensuring GPU clusters generate maximum tokens1
.AMD identifies two patterns creating enterprise adoption challenges for agentic AI. Many organizations standardize CPU infrastructure purchases around legacy CPU specifications such as 16- and 32-core processors. However, agentic workflows need higher core counts for some stages and higher frequencies for others, requiring flexibility rather than rigid standardization
2
. The mindset must shift from a single CPU standard to a portfolio matched to workflow requirements. Once employees gain the ability to build and deploy their own agents, agentic adoption accelerates rapidly, creating a multiplier effect on IT infrastructure that extends to databases, enterprise resource planning platforms, customer relationship management systems, business intelligence tools, identity management, and inference servers1
.Related Stories
IT leaders treating agentic AI as a monolithic problem with one GPU strategy or a one-size-fits-all CPU approach will likely encounter scaling challenges. AMD's position centers on matching infrastructure to how agentic AI actually works across its many stages and workloads. The company's strategy encompasses EPYC CPUs for high-frequency and high-density compute, AMD Instinct accelerators for AI inference and training, and Pensando networking to move data predictably
2
. As agents become greater users of existing IT infrastructure, planning teams need to anticipate what happens when usage dramatically increases. The question worth asking isn't how many CPUs or GPUs a business needs for agentic AI, but whether infrastructure matches the way agentic AI operates with different compute needs at each stage. Organizations that map these stages early and choose the right compute profile for each workflow component will be better positioned for speed and efficiency as they scale their agentic deployments.Summarized by
Navi
02 Jun 2026•Technology

26 Mar 2026•Technology

28 Apr 2026•Technology

1
Policy and Regulation

2
Policy and Regulation

3
Policy and Regulation
