Qualcomm unveils High-Bandwidth Compute architecture to break AI memory wall with 6x efficiency

4 Sources

Share

Qualcomm introduced its High-Bandwidth Compute (HBC) architecture at Investor Day 2026, positioning itself as a serious contender in the AI data center market. The technology stacks compute beneath DRAM to tackle the memory wall bottleneck, claiming 6x higher bandwidth-per-watt compared to HBM and 200x capacity versus on-chip SRAM. The first HBC-powered AI250 accelerators are expected by mid-2027.

Qualcomm Targets AI Infrastructure with Revolutionary Memory Architecture

Qualcomm announced its ambitious entry into the AI data center market at Investor Day 2026, unveiling High-Bandwidth Compute (HBC), an HBC near-memory AI architecture designed to address the AI memory wall bottleneck that has constrained performance across the industry

1

. The technology represents a distinct shift in the company's AI infrastructure solution strategy, moving beyond its mobile Snapdragon processors to challenge established players like Nvidia and AMD in datacenter deployments

2

.

Source: Tom's Hardware

Source: Tom's Hardware

How HBC Stacks Compute Beneath DRAM for Efficiency Gains

The core innovation behind Qualcomm's approach involves disaggregating AI accelerators from the system-on-chip and positioning them directly underneath LPDDR DRAM stacks. This configuration leverages Through-Silicon Vias (TSVs) to create ultra-short data paths between compute and memory

1

. Tony Pialis, Executive Vice President and General Manager of Data Center Business at Qualcomm, explained the advantage: "Imagine working in the same building that you live in so you only travel up and down. What does that mean for the highways and the roads that connect the suburbs to the city? Guess what? The roads are clear"

2

.

Source: Wccftech

Source: Wccftech

This 3D integration architecture delivers significant improvements in bandwidth-per-watt—claiming 6x higher efficiency compared to HBM and over 200x capacity compared to on-chip SRAM

4

. By eliminating expensive silicon interposers required for HBM solutions and reducing power consumption through shorter data travel distances, Qualcomm aims to lower total cost of ownership for AI deployments

1

.

AI250 and AI300: Dragonfly AI Accelerator Rack Roadmap

Qualcomm's product roadmap begins with the AI200, launching later this year with LPDDR5X memory, offering 43 TB of RAM per rack and 414 TB/s of memory bandwidth

3

. The AI250, expected by mid-2027, will incorporate first-generation HBC technology, delivering 768 GB of memory capacity and up to 133 TB/s of effective memory bandwidth per card—representing an 18x bandwidth increase over the AI200

2

. At the rack level, this translates to a theoretical peak memory bandwidth of up to 7.4 PB/s

3

.

Source: TechRadar

Source: TechRadar

Looking further ahead, the AI300 with second-generation HBC is planned for 2028, promising 54x the effective bandwidth of the AI200 and a 7x improvement in bandwidth-per-watt versus HBM

4

. These AI accelerators are heavily optimized for inference workloads, particularly addressing bandwidth bottlenecks during decode operations when model weights are streamed autoregressively from memory

2

.

Competing in the AI Data Center Market Against Nvidia and AMD

Qualcomm's push into the AI data center market comes as hyperscalers like Microsoft, Meta, Google, and Amazon increasingly deploy custom silicon alongside Nvidia and AMD solutions

3

. While HBM remains the preferred choice for training workloads—used in Nvidia's Blackwell and AMD's Instinct offerings—Qualcomm is positioning its LPDDR-based approach as superior for inference-centric deployments where memory capacity and power consumption matter most

3

.

However, questions remain about the practical implementation. Qualcomm has not disclosed what operations the HBC accelerator actually performs—whether it's a transformer-specific engine, tensor core array, or preprocessing logic

1

. The company's heavy reliance on "effective" bandwidth metrics has also raised eyebrows, with observers noting that achieving claimed performance levels would require extraordinarily wide memory buses

2

. As datacenter operators grapple with memory shortages and escalating power consumption, Qualcomm's ability to demonstrate real-world performance gains will determine whether it can capture meaningful market share from entrenched competitors.

Today's Top Stories

© 2026 TheOutpost.AI All rights reserved