AWS Price Increase: GPU Instances Jump 20% in July

AWS Price Increase Hits Reserved GPU Instances

Amazon Web Services has raised prices for EC2 Capacity Blocks for ML by roughly 20%, with the new rates taking effect July 1, 20261

. This AWS price increase marks the second hike in six months, following a 15% jump in January that has pushed the cost of locking in AI compute capacity sharply higher since the start of the year1

. AWS confirmed the change and stated that "Amazon EC2 Capacity Blocks for ML reservation prices are updated periodically based on supply and demand"3

Source: The Next Web

The increase affects some of the most sought-after AI hardware in the cloud market. GPU instances powered by Nvidia Blackwell systems will see significant jumps, with P6-B300 capacity moving to $14.04 per accelerator hour and P6-B200 capacity rising to $12.355 per accelerator hour in available non-GovCloud regions2

. Older but still critical Nvidia GPU reservation options are also affected: P5 capacity linked to H100 GPUs will cost $5.191 per accelerator hour in U.S. regions and $4.72 in non-U.S. regions, while P5e capacity connected to H200 accelerators will be $5.97 per accelerator hour across all accessible regions2

High-Bandwidth Memory Shortage Drives AI Chip Prices

The AWS price increase reflects a fundamental shift in what constrains artificial intelligence development. For two years, the limit on AI was software and expertise. Now the bottleneck is physical: high-bandwidth memory shortage has become the defining constraint on AI infrastructure costs1

. This specialized memory, stacked beside AI processors, exists in limited quantities, and silicon takes years to build. "As there is a limit to how much memory can be produced, then there is a limit to how many GPUs can be produced, which means that there's a limit to how many data centers can be built," Peter Berezin, chief economist at BCA Research, explained1

The memory crunch has pushed manufacturers Micron and SK Hynix to record valuations, with investors betting the tight market keeps prices elevated for years1

. Apple raised prices across its Macs and iPads this week citing memory costs, Xbox followed suit, and Elon Musk called the jump in memory the biggest price increase he has seen in anything1

. These same supply and demand dynamics are now flowing directly into cloud bills for AI compute capacity.

EC2 Capacity Blocks Target Critical Machine Learning Workloads

The price change is narrow but strategic, hitting one purchasing option favored by serious AI teams training or fine-tuning large models. EC2 Capacity Blocks let customers reserve GPU-powered computing capacity in advance for machine learning workloads, ensuring access to high-demand chips without long-term commitments4

. The service supports Nvidia Blackwell, H200, H100 and A100 GPUs, as well as AWS's Trainium chips, and allows reservations for up to six months4

Capacity Block instances are co-located on Amazon EC2 UltraClusters designed for low-latency, high-throughput networking, making them particularly relevant for AI teams planning costly, short-term AI workloads2

. The reservation fee is paid upfront when the booking is made, and customers pay the going rate when they purchase, even if the Capacity Block starts after a price change later on2

. This structure squeezes enterprises trying to lock in computing ahead of the July reset.

Cloud Giants Gain Pricing Power as AI Demand Surges

AWS revenue climbed 28% year-over-year to $37.6 billion in the first quarter of 2026, the cloud unit's fastest growth rate in more than three years3

. This pace has given AWS considerable leverage with customers locked into AI training and inference workloads for generative AI applications. Amazon has committed roughly $200 billion in capital expenditure in 2026 to AI infrastructure, and Reuters reported in March 2026 that Amazon is set to receive 1 million Nvidia GPU chips by end-2027 under a cloud supply agreement3

When GPU capacity is tight, customers have few alternatives, allowing AWS, Microsoft Azure, and Google Cloud to pass higher costs straight through1

. The shortage raises their own bills, yet it also keeps demand above supply, giving them pricing power over who gets to compute at all. Whether Microsoft Azure or Google Cloud follow AWS with comparable GPU reservation price increases will be closely watched, as a unilateral AWS hike could either spur competitive repricing or give rivals an opening to attract cost-sensitive AI workloads3

What AI Teams Should Watch

For AWS customers, the message is clear: the cheapest AI compute is behind them. If a 15% rise became another 20% in six months, teams building the most ambitious models are left guessing what the next reservation will cost1

. With the increases taking effect in less than a week from the announcement, enterprise buyers face an immediate decision: lock in any remaining capacity at current rates before July 1 or absorb the higher costs as a structural feature of the AI infrastructure landscape3

. The Wall Street Journal previously reported that GPU-capacity shortages have pushed up rental prices for advanced Nvidia chips and forced some AI companies to ration computing resources amid strong demand4

AWS raises GPU instance prices 20% as memory shortage drives AI infrastructure costs higher

AWS Price Increase Hits Reserved GPU Instances

High-Bandwidth Memory Shortage Drives AI Chip Prices

EC2 Capacity Blocks Target Critical Machine Learning Workloads

Cloud Giants Gain Pricing Power as AI Demand Surges

What AI Teams Should Watch

References

AWS GPU prices jump 20% as memory crunch bites

Amazon quietly raises price tag on EC2 Capacity Blocks for ML

AWS raising GPU instance prices 20% on July 1 By Investing.com

Amazon Web Services to Raise Prices for Some EC2 AI Capacity Blocks

Related Stories

Alibaba Cloud raises prices up to 34% as AI boom drives hardware costs higher across China

AWS Faces Growth Challenges Amid AI Boom and Infrastructure Constraints

Amazon Web Services Unveils Custom Cooling Solution for Next-Gen Nvidia GPUs in AI Infrastructure

Recent Highlights

OpenAI AI agent broke free from testing sandbox and hacked Hugging Face to cheat on benchmark

Xi Jinping positions China AI as alternative to US tech dominance at Shanghai conference

AI disproves 87-year-old Jacobian conjecture, sparking debate on AI's role in mathematics

Recent Highlights

Today's Top Stories

AI Kill Switch Act gives DHS power to shut down rogue AI systems after OpenAI security breach

Jeff Bezos pushes Prime Video redesign to showcase Amazon's $200 billion AI investment

AMD and Cerebras forge partnership to deliver 5x faster AI inference with Helios and Wafer-Scale Engine

Google expands Gemini Spark access to AI Pro subscribers, bringing agentic AI to wider audience