Qualcomm HBC: 6x Bandwidth-Per-Watt vs HBM

Qualcomm Targets AI Infrastructure with Revolutionary Memory Architecture

Qualcomm announced its ambitious entry into the AI data center market at Investor Day 2026, unveiling High-Bandwidth Compute (HBC), an HBC near-memory AI architecture designed to address the AI memory wall bottleneck that has constrained performance across the industry 1

. The technology represents a distinct shift in the company's AI infrastructure solution strategy, moving beyond its mobile Snapdragon processors to challenge established players like Nvidia and AMD in datacenter deployments 2

Source: Tom's Hardware

How HBC Stacks Compute Beneath DRAM for Efficiency Gains

The core innovation behind Qualcomm's approach involves disaggregating AI accelerators from the system-on-chip and positioning them directly underneath LPDDR DRAM stacks. This configuration leverages Through-Silicon Vias (TSVs) to create ultra-short data paths between compute and memory 1

. Tony Pialis, Executive Vice President and General Manager of Data Center Business at Qualcomm, explained the advantage: "Imagine working in the same building that you live in so you only travel up and down. What does that mean for the highways and the roads that connect the suburbs to the city? Guess what? The roads are clear" 2

Source: Wccftech

This 3D integration architecture delivers significant improvements in bandwidth-per-watt—claiming 6x higher efficiency compared to HBM and over 200x capacity compared to on-chip SRAM 4

. By eliminating expensive silicon interposers required for HBM solutions and reducing power consumption through shorter data travel distances, Qualcomm aims to lower total cost of ownership for AI deployments 1

AI250 and AI300: Dragonfly AI Accelerator Rack Roadmap

Qualcomm's product roadmap begins with the AI200, launching later this year with LPDDR5X memory, offering 43 TB of RAM per rack and 414 TB/s of memory bandwidth 3

. The AI250, expected by mid-2027, will incorporate first-generation HBC technology, delivering 768 GB of memory capacity and up to 133 TB/s of effective memory bandwidth per card—representing an 18x bandwidth increase over the AI200 2

. At the rack level, this translates to a theoretical peak memory bandwidth of up to 7.4 PB/s 3

Source: TechRadar

Looking further ahead, the AI300 with second-generation HBC is planned for 2028, promising 54x the effective bandwidth of the AI200 and a 7x improvement in bandwidth-per-watt versus HBM 4

. These AI accelerators are heavily optimized for inference workloads, particularly addressing bandwidth bottlenecks during decode operations when model weights are streamed autoregressively from memory 2

Competing in the AI Data Center Market Against Nvidia and AMD

Qualcomm's push into the AI data center market comes as hyperscalers like Microsoft, Meta, Google, and Amazon increasingly deploy custom silicon alongside Nvidia and AMD solutions 3

. While HBM remains the preferred choice for training workloads—used in Nvidia's Blackwell and AMD's Instinct offerings—Qualcomm is positioning its LPDDR-based approach as superior for inference-centric deployments where memory capacity and power consumption matter most 3

However, questions remain about the practical implementation. Qualcomm has not disclosed what operations the HBC accelerator actually performs—whether it's a transformer-specific engine, tensor core array, or preprocessing logic 1

. The company's heavy reliance on "effective" bandwidth metrics has also raised eyebrows, with observers noting that achieving claimed performance levels would require extraordinarily wide memory buses 2

. As datacenter operators grapple with memory shortages and escalating power consumption, Qualcomm's ability to demonstrate real-world performance gains will determine whether it can capture meaningful market share from entrenched competitors.

Qualcomm unveils High-Bandwidth Compute architecture to break AI memory wall with 6x efficiency

Qualcomm Targets AI Infrastructure with Revolutionary Memory Architecture

How HBC Stacks Compute Beneath DRAM for Efficiency Gains

AI250 and AI300: Dragonfly AI Accelerator Rack Roadmap

Competing in the AI Data Center Market Against Nvidia and AMD

References

Qualcomm reveals HBC near-memory AI architecture, AI250 and AI350 accelerators -- touts 6x higher bandwidth-per-watt compared to HBM, 200x capacity compared to on-chip SRAM

Qualcomm's proposed solution to catch up in AI infra: Bury the compute under the DRAM

Qualcomm targets Nvidia, AMD, Huawei with Dragonfly AI accelerator rack loaded with 43TB of LPDDR5x, future generations set to smash 7PB/s bandwidth

Qualcomm's HBC Stacks Compute Beneath DRAM To Smash The AI Memory Wall, Claiming 6x The Bandwidth Per Watt Of HBM

Related Stories

Qualcomm Challenges Nvidia with New AI Chips for Data Centers

Qualcomm Announces Return to Data Center Market with AI-Focused CPUs Compatible with Nvidia Technology

D-Matrix Challenges HBM with 3DIMC: A New Memory Technology for AI Inference

Recent Highlights

OpenAI rogue agent compromised multiple services in unprecedented AI security breach

AI Kill Switch Act gives DHS power to shut down rogue AI systems after OpenAI security breach

Nvidia forms Open Secure AI Alliance with Microsoft, but OpenAI, Google and Anthropic sit out

Recent Highlights

Today's Top Stories

Mark Zuckerberg says US should not block Chinese AI models, warns against regulatory capture

Mark Zuckerberg attacks rivals over AI governance, pushes for broader access to superintelligence

Google DeepMind dismantles Nobel Prize-winning AlphaFold team in strategic shift to Gemini

Google Gemini on macOS Gets Voice Commands via Fn Key for Hands-Free Productivity