Companies exhaust AI budgets by April as token costs spiral out of control

11 Sources

Share

Major tech companies are slamming the brakes on AI spending after burning through entire annual budgets in just months. Uber capped engineers at $1,500 monthly after exhausting its 2026 AI budget by April, while Microsoft revoked Claude Code licenses. Even OpenAI CEO Sam Altman admits AI costs have become 'a huge issue' as agentic workflows drive token consumption to unsustainable levels.

Major Companies Hit Budget Limits in Record Time

The AI adoption frenzy has collided with stark economic reality. Uber exhausted its entire 2026 AI coding budget by April, forcing the company to cap every engineer at $1,500 per month

1

. Microsoft revoked its developers' Claude Code licenses months after enabling them, while a Priceline employee reported a routine Cursor contract renewal returning 4-5x more expensive

1

. ServiceNow similarly burned through its full-year Anthropic coding budget within the first few months

2

. These runaway AI costs represent a dramatic shift from early 2025, when companies gorged themselves on all-you-can-eat subscriptions without understanding the financial consequences.

Source: Fast Company

Source: Fast Company

The crisis extends beyond individual companies. J.R. Storment, executive director of the FinOps Foundation, told TechCrunch he started hearing from companies in April and May reporting they were 3x over their entire 2026 token budget

1

. One company reportedly faced a $500 million Claude bill after forgetting to set usage limits for employees

1

. The cost of AI adoption has transformed from a minor concern into what Chris Reed, senior director of IT finance at Priceline, describes as "the crack-cocaine epidemic" where companies got hooked on subsidized pricing and now find themselves dependent

1

.

Sam Altman Acknowledges the AI Spending Crisis

OpenAI CEO Sam Altman publicly admitted that AI costs have become "a huge issue" for the first time during the Intelligence at Work event

3

. "People are really saying, you know, it's kind of a meme now, but 'My company spent my entire 2026 budget in Q1. Can you make this more efficient?'" Altman said on stage

3

. This marks a complete reversal from the beginning of the year when clients were "totally happy with the amount they were spending"

3

.

Source: Tom's Hardware

Source: Tom's Hardware

Alexander Embricos, OpenAI's head of enterprise, confirmed the shift in customer conversations. "Six months ago, I would have a conversation with a customer and it would be all about 'What can it do? Is it good enough?'" he told TechCrunch. "Our conversations are never about that now. Now the conversations are about, 'hey, we're spending so much. What visibility do you have? What auditability do you have? What token controls do you have? What is the efficiency of your models?'"

1

. Altman also acknowledged that the question of whether AI spending will show up in revenue is "the most fair criticism" of the moment

5

.

AI Agents Drive Exponential Token Consumption

The explosion in AI token usage stems primarily from agentic workflows that loop repeatedly rather than making single model calls. Anthropic's engineering research found a single AI agent consumes 4x the tokens of a chat interaction, while multi-agent systems use 15x

2

. Each loop refills the context window, creating what experts call context debt—the runtime tax companies pay when their knowledge isn't machine-readable

2

. Nicholas Arcolano, head of research at Jellyfish, reported that per-developer consumption rose approximately 18.6x in nine months due to agentic features

1

.

One preprint study found that AI agents use 1,000 times as many tokens as other AI systems

4

. New models released in November, including Anthropic's Claude Opus 4.5, OpenAI's GPT-5.1, and Google's Gemini 3 Pro, brought significant improvements to agentic tools that multiplied consumption

1

. This exemplifies Jevons paradox: as AI tokens become cheaper per unit, total usage increases so dramatically that overall spending rises

3

.

The Murky Return on AI Investment

Managing AI expenditures has become critical as companies struggle to demonstrate ROI. A March survey by Faros AI found that among 20,000 developers, output was rising, but so were bugs and rewrites

1

. Jellyfish discovered engineers who used the most tokens were about twice as productive as those who used AI less, but they spent 10x the number of tokens to achieve those gains

1

. These statistics make the productivity case far murkier than AI spending suggests.

Source: TechCrunch

Source: TechCrunch

Vitaly Gordon, CEO of Faros AI, recounted speaking with a CTO who said: "One of my engineers spent $40,000 on tokens last month, and I genuinely don't know whether I should stop him or should I go and tell everyone else to be like him"

1

. Uber's COO admitted that between all the Claude Code spending and anything customers can feel, "that link is not there yet"

2

. Bain surveyed 951 large companies and found AI savings falling well below projections, concluding: "The technology worked. The value didn't arrive"

5

.

Industry Scrambles to Control the AI Bill

A market is forming to address the crisis in optimizing AI expenditures. The Linux Foundation unveiled the Tokenomics Foundation, a new standards body aiming to instill the same cost discipline around AI token usage that FinOps brought to cloud spend

1

. Storment explained that tracking cloud costs is a hundreds-of-millions-of-rows-a-month data problem, while tracking token costs is a trillions-of-rows-a-month data problem requiring fundamentally rethought tooling, specs, and accounting systems

1

.

Companies like Pay-i are emerging to track, measure, and optimize the costs and performance of GenAI investments, while Paid lets developers track costs and bill users based on actual value rather than subscription fees

1

. GitHub announced usage-based billing for Copilot, shocking users confronted with the true inference costs of heavy AI usage

5

. Microsoft and Google recently announced edge computing products—Gemma 4 12B and the RTX Spark laptop—that run smaller models directly on devices rather than through energy-intensive data centers, tacitly acknowledging that massive large language models aren't worth the cost for most daily tasks

4

.

Amazon shut down an internal token leaderboard after employees gamed it with throwaway tasks, telling staff: "Please don't use AI just for the sake of using AI"

4

. The Nasdaq plummeted 4.2% in its worst day in over a year when Broadcom failed to raise its longer-term AI revenue outlook, reminding Wall Street how much optimism is baked into markets

5

. As companies confront the reality that AI must be applied precisely rather than universally, the industry faces a critical question: whether AI's immense power justifies its price tag.

Today's Top Stories

© 2026 TheOutpost.AI All rights reserved