AWS raises GPU instance prices 20% on July 1 amid surging AI computing demand

2 Sources

Share

Amazon's AWS will increase prices for EC2 Capacity Block reservations for machine-learning GPU instances by approximately 20% starting July 1, 2026. The price increase affects Nvidia-powered instance families including P6-B300, P6-B200, and P5, driven by tight supply and soaring enterprise demand for AI infrastructure. The move could reshape competitive dynamics with Microsoft Azure and Google Cloud.

AWS Implements Major Price Increase for GPU Instances

Amazon's AWS announced a significant price increase for its EC2 Capacity Block reservations, raising rates for machine-learning GPU instances by approximately 20% effective July 1, 2026. The decision reflects what AWS describes as periodic pricing updates based on supply and demand dynamics in the AI infrastructure market

1

. The new hourly rates per accelerator will affect AWS's most powerful Nvidia-powered instance families, with the P6-B300 billed at $14.04, the P6-B200 at $12.355, and P5 instances in US regions at $5.191

1

. Other affected tiers include P5e at $5.97, P5en at $6.865 for US regions, and P4de at $2.214 for US deployments, while all other EC2 prices remain unchanged

1

.

Surging Demand Drives AI Computing Capacity Constraints

The price increase lands amid sustained enterprise appetite for GPU compute resources. AWS revenue climbed 28% year-over-year to $37.6 billion in the first quarter of 2026, marking the cloud unit's fastest growth rate in more than three years

1

. This accelerated growth has given AWS considerable pricing leverage with customers locked into AI model training and inference workloads. Amazon has committed roughly $200 billion in capital expenditure in 2026 to AI infrastructure, and Reuters reported in March 2026 that Amazon is set to receive 1 million Nvidia GPU chips by end-2027 under a cloud supply agreement

1

. The Wall Street Journal previously reported that GPU-capacity shortages have pushed up rental prices for advanced Nvidia chips and forced some AI companies to ration computing resources amid strong demand

2

.

What EC2 Capacity Block Reservations Offer Enterprises

Capacity Blocks for ML are a reserved-capacity product that lets enterprises secure scarce GPU instances on a future date for time-bound workloads, typically large-scale generative AI boom projects. The service supports Nvidia Blackwell, H200, H100 and A100 GPUs, as well as AWS's Trainium chips, and allows reservations for up to six months

2

. Because the product is reservation-based, customers have been willing to pay a premium over spot-market rates for the guarantee of availability

1

. AWS's Capacity Blocks require customers to pay an upfront reservation fee, with operating-system charges billed separately while instances are running .

Competitive Implications for Microsoft Azure and Google Cloud

Whether Microsoft Azure or Google Cloud follow AWS with comparable GPU reservation price increases will be closely watched. Azure is AWS's nearest rival in enterprise cloud infrastructure, and a unilateral AWS hike could either spur competitive repricing or give Azure and Google Cloud an opening to attract cost-sensitive AI workloads

1

. Rising reservation costs could prompt some AWS customers to evaluate alternatives, including Nvidia-powered offerings on rival clouds or Google Cloud's TPU-based instances, which Alphabet has been actively marketing as a cost-competitive option

1

. With the increases taking effect in less than a week, enterprise buyers face an immediate decision: lock in any remaining capacity at current rates before July 1 or absorb the higher costs as a structural feature of the AI infrastructure landscape

1

.

Today's Top Stories

© 2026 TheOutpost.AI All rights reserved