Developers refuse to work without AI, but research reveals it may be making them worse

2 Sources

Share

AI research lab METR discovered developers won't participate in studies without AI coding tools, even for limited tasks. But mounting evidence from Amazon, Uber, and academic researchers suggests AI reliance may be creating more problems than it solves, with companies spending 44% of tokens on AI-generated bug fixes.

Developers Won't Work Without AI Anymore

In February 2026, AI research lab METR attempted to update groundbreaking research from 2025 on developer productivity with AI coding tools. The original study had revealed a paradox: while developers reported feeling more productive with AI, the data showed AI actually slowed them down because they spent extra time finding and fixing code errors, steering the AI, and waiting for it to complete tasks

1

. When METR tried to replicate the experiment, they hit an unexpected wall. Coders refusing to work without AI made the study impossible, as developers wouldn't participate even for a limited number of tasks in a research setting

2

.

Instead, METR published a survey in May allowing technical employees to self-report their AI reliance and perceived gains. Not surprisingly, developers claimed AI made them twice as valuable to their organizations

1

. But recent evidence from major tech companies and independent researchers suggests this perception may be dangerously disconnected from reality.

AI Spending and Productivity Gains Don't Match Up

The trend of tokenmaxxing—using token consumption as a proxy for productivity—has dominated 2026, but it may already be collapsing under scrutiny. Amazon shut down its internal token-tracking leaderboard called Kirorank after employees gamed the system by using AI agents excessively and running up costs without demonstrating actual productivity gains, the Financial Times reported this week

1

. Uber blew through its entire 2026 AI budget within the first four months of the year, The Information reported. COO Andrew Macdonald admitted on a podcast that such spending hadn't led to a measurable increase in projects or developer productivity

2

.

These examples from two of the world's most technically sophisticated companies reveal a troubling pattern: massive AI spending doesn't automatically translate to better outcomes in software development. Salesforce projects $300 million in Anthropic token spending this year, with CEO Marc Benioff calling for an "intermediary layer" to route tokens intelligently between frontier and cheaper models—an implicit admission that not every token produces value

2

.

AI Making Developers Worse Through Maintenance Costs

The negative consequences of AI dependence extend beyond immediate spending concerns. Programmer and author James Shore argued in a viral blog post that faster code generation creates a dangerous trap. "You write code twice as quick now? Better hope you've halved your maintenance costs," he wrote. "Otherwise, you're screwed. You're trading a temporary speed boost for permanent indenture"

1

.

Data supports this warning about AI's impact on coding quality. Aiswarya Sankar, founder and CEO of reliability engineering startup Entelligence AI, claims companies spend 44% of their tokens on bug fixes that their AI-generated code created. Code reviewing tool company CodeRabbit analyzed open source pull requests and found that AI produced 1.7 times more problems than human code

1

. While these statistics come from companies selling AI code review tools, independent researchers at Singapore Management University reached the same conclusion in an April report, warning that "AI-generated code can introduce long-term maintenance costs into real software projects"

2

.

What Engineering Leaders Should Watch

The dependency has outpaced the evidence. Cognition founder Scott Wu, maker of AI coding agent Devin, admits his tool's skill level sits between a junior and mid-level programmer depending on the task—not a hand-off-and-forget solution

2

. The SMU researchers recommend treating AI output the way you would code from a junior developer: implement strong quality assurance systems designed for AI, carefully review everything, and keep humans responsible for software architecture and security design

1

.

The question engineering leaders must address is whether the productivity gains from AI coding tools are real or merely perceived. If developers refuse to work without AI but the tools generate more bugs than they prevent, the net effect could be negative. The AI coding market is growing faster than the evidence that it works, creating a critical need for quality assurance infrastructure and review processes to ensure faster code production doesn't become faster technical debt production

2

.

Today's Top Stories

TheOutpost.ai

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

Instagram logo
LinkedIn logo
Youtube logo
© 2026 TheOutpost.AI All rights reserved