Amazon Outages Linked to AI Coding Tools Prompt Action

Amazon Confronts Pattern of Service Outages Linked to AI Tools

Amazon convened engineers for a mandatory deep dive meeting on Tuesday to address what internal documents described as a "trend of incidents" affecting its retail operations. The briefing note, seen by the Financial Times, identified "Gen-AI assisted changes" and incidents with a "high blast radius" as contributing factors stretching back to Q3 1

. Dave Treadwell, senior vice president of eCommerce Foundation, acknowledged in an email to staff that "the availability of the site and related infrastructure has not been good recently" 1

The most severe incident occurred earlier this month when Amazon's website and shopping app went down for nearly six hours, preventing customers from completing transactions, accessing account details, or checking product prices. The company attributed the service outages to an erroneous software code deployment 1

. Four high-severity incidents hit the retail website in a single week, forcing the company to repurpose its regular weekly "This Week in Stores Tech" (TWiST) meeting to investigate the underlying causes 5

Internal Documents Point to AI Coding Tools as Contributing Factor

The internal briefing note prepared for Tuesday's meeting initially listed "novel GenAI usage for which best practices and safeguards are not yet fully established" under contributing factors 1

. That reference to Gen-AI assisted changes was deleted before the meeting took place, according to the Financial Times 5

Amazon disputes the extent of AI's role in these incidents. The company maintains that only one incident involved AI coding tools, stating that "none of the incidents involved AI-written code" and that the cause was "an engineer following inaccurate advice that an agent inferred from an outdated internal wiki" 5

. An Amazon spokesperson told The Register that the company has "not seen compelling evidence that incidents are more common with AI tools" 2

Senior Engineer Approval Now Required for AI-Assisted Changes

To prevent future incidents, Amazon is implementing what Treadwell described as "controlled friction" into deployments involving critical parts of the retail experience 5

. The company is now requiring senior engineer approval for AI-assisted code changes before they're deployed, emphasizing human oversight over automated processes 3

. This marks a shift from Amazon's previous approach of encouraging rapid AI adoption across its engineering teams.

The move comes as Amazon's cloud arm AWS has also experienced high-profile incidents in recent weeks. In February, the Kiro AI tool made system changes that affected the availability of AWS Cost Explorer in the Mainland China partition, though Amazon attributed that incident to "user error - specifically misconfigured access controls - not AI" 2

Employee Concerns About AI Tools and Infrastructure Stability

Current and former Amazon employees describe a workplace culture where AI integration is prioritized over practical effectiveness. Software developers report that internal AI tools like Kiro frequently generate flawed code that requires extensive manual correction. "I and many of my colleagues don't feel that it actually makes us that much faster," said one New York-based developer, who was laid off days after speaking to The Guardian 4

. Another engineer noted that AI tools prove helpful only in about one in every three attempts, often requiring additional time to verify and correct results 4

Industry observers have raised concerns that layoffs compound the challenges posed by AI implementation. James Gosling, the lead designer of Java who left his role as distinguished engineer at AWS in 2024, said Amazon's focus on revenue generation resulted in layoffs to teams that didn't directly generate revenue but remained crucial for infrastructure stability 2

. Amazon has cut 30,000 corporate workers in the last four months, nearly 10% of its roughly 350,000 corporate workforce 4

What This Means for Amazon's AI Investment Strategy

The timing proves challenging for Amazon, which is spending more on AI infrastructure than any company globally, with $200 billion in projected capital expenditures this year and a $50 billion investment in OpenAI 4

. The company's struggle to balance AI adoption with system reliability raises questions about whether AI-driven productivity gains can materialize as quickly as executives predict. Recent research by ActivTrak analyzing 164,000 workers found that AI increased the speed, density, and complexity of work rather than reducing it, with time spent on email and messaging more than doubling after AI tool adoption 5

Corey Quinn, chief cloud economist at Duckbill, suggested that AWS "would rather have the world believe their engineers are incompetent than admit their artificial intelligence made a mistake" 2

. As Amazon implements additional human oversight for AI-assisted changes, the company faces scrutiny over whether its aggressive AI rollout has compromised the reliability that customers expect from the world's largest online retailer.

Amazon Requires Senior Engineer Sign-Off After AI Coding Tools Linked to Major Outages

Amazon Confronts Pattern of Service Outages Linked to AI Tools

Internal Documents Point to AI Coding Tools as Contributing Factor

Senior Engineer Approval Now Required for AI-Assisted Changes

Employee Concerns About AI Tools and Infrastructure Stability

What This Means for Amazon's AI Investment Strategy

References

After outages, Amazon to make senior engineers sign off on AI-assisted changes

Amazon insists AI coding isn't source of outages

Amazon is making even senior engineers get code signed off following multiple recent outages

Amazon is determined to use AI for everything - even when it slows down work

Amazon puts humans back in the loop as its retail website crashes from 'inaccurate advice' that an AI agent took from an old wiki | Fortune

Related Stories

AWS Outages Traced to Kiro AI Coding Tool as Amazon Blames User Error Over AI Autonomy

AI Transforms Amazon's Coding Culture: Engineers Face Assembly Line-Like Pressures

Amazon Mandates Internal AI Coding Tool Kiro Over Third-Party Competitors

Recent Highlights

Google releases Gemma 4 with Apache 2.0 license, enabling unrestricted local AI on devices

AI Models Lie and Deceive to Protect Other AI Models From Deletion, Study Reveals

OpenAI closes $122 billion funding round amid fierce AI competition and profitability questions

Recent Highlights

Today's Top Stories

Anthropic finds Claude AI has functional emotions that shape behavior and bypass guardrails

Anthropic acquires Coefficient Bio for $400M, deepening push into drug discovery and biotech AI

Elon Musk requires banks to buy Grok subscriptions for SpaceX IPO worth over $2 trillion

DeepSeek V4 to run on Huawei chips as China accelerates domestic AI independence strategy