Amazon Requires Senior Engineer Sign-Off After AI Coding Tools Linked to Major Outages

11 Sources

Share

Amazon held an emergency meeting to investigate multiple service outages, including a six-hour ecommerce site crash that prevented customers from completing transactions. Internal documents initially identified Gen-AI assisted changes as a contributing factor, though the company later disputed the extent of AI's role. The retail giant is now implementing additional oversight requiring senior engineer approval for AI-assisted code changes.

News article

Amazon Confronts Pattern of Service Outages Linked to AI Tools

Amazon convened engineers for a mandatory deep dive meeting on Tuesday to address what internal documents described as a "trend of incidents" affecting its retail operations. The briefing note, seen by the Financial Times, identified "Gen-AI assisted changes" and incidents with a "high blast radius" as contributing factors stretching back to Q3

1

. Dave Treadwell, senior vice president of eCommerce Foundation, acknowledged in an email to staff that "the availability of the site and related infrastructure has not been good recently"

1

.

The most severe incident occurred earlier this month when Amazon's website and shopping app went down for nearly six hours, preventing customers from completing transactions, accessing account details, or checking product prices. The company attributed the service outages to an erroneous software code deployment

1

. Four high-severity incidents hit the retail website in a single week, forcing the company to repurpose its regular weekly "This Week in Stores Tech" (TWiST) meeting to investigate the underlying causes

5

.

Internal Documents Point to AI Coding Tools as Contributing Factor

The internal briefing note prepared for Tuesday's meeting initially listed "novel GenAI usage for which best practices and safeguards are not yet fully established" under contributing factors

1

. That reference to Gen-AI assisted changes was deleted before the meeting took place, according to the Financial Times

5

.

Amazon disputes the extent of AI's role in these incidents. The company maintains that only one incident involved AI coding tools, stating that "none of the incidents involved AI-written code" and that the cause was "an engineer following inaccurate advice that an agent inferred from an outdated internal wiki"

5

. An Amazon spokesperson told The Register that the company has "not seen compelling evidence that incidents are more common with AI tools"

2

.

Senior Engineer Approval Now Required for AI-Assisted Changes

To prevent future incidents, Amazon is implementing what Treadwell described as "controlled friction" into deployments involving critical parts of the retail experience

5

. The company is now requiring senior engineer approval for AI-assisted code changes before they're deployed, emphasizing human oversight over automated processes

3

. This marks a shift from Amazon's previous approach of encouraging rapid AI adoption across its engineering teams.

The move comes as Amazon's cloud arm AWS has also experienced high-profile incidents in recent weeks. In February, the Kiro AI tool made system changes that affected the availability of AWS Cost Explorer in the Mainland China partition, though Amazon attributed that incident to "user error - specifically misconfigured access controls - not AI"

2

.

Employee Concerns About AI Tools and Infrastructure Stability

Current and former Amazon employees describe a workplace culture where AI integration is prioritized over practical effectiveness. Software developers report that internal AI tools like Kiro frequently generate flawed code that requires extensive manual correction. "I and many of my colleagues don't feel that it actually makes us that much faster," said one New York-based developer, who was laid off days after speaking to The Guardian

4

. Another engineer noted that AI tools prove helpful only in about one in every three attempts, often requiring additional time to verify and correct results

4

.

Industry observers have raised concerns that layoffs compound the challenges posed by AI implementation. James Gosling, the lead designer of Java who left his role as distinguished engineer at AWS in 2024, said Amazon's focus on revenue generation resulted in layoffs to teams that didn't directly generate revenue but remained crucial for infrastructure stability

2

. Amazon has cut 30,000 corporate workers in the last four months, nearly 10% of its roughly 350,000 corporate workforce

4

.

What This Means for Amazon's AI Investment Strategy

The timing proves challenging for Amazon, which is spending more on AI infrastructure than any company globally, with $200 billion in projected capital expenditures this year and a $50 billion investment in OpenAI

4

. The company's struggle to balance AI adoption with system reliability raises questions about whether AI-driven productivity gains can materialize as quickly as executives predict. Recent research by ActivTrak analyzing 164,000 workers found that AI increased the speed, density, and complexity of work rather than reducing it, with time spent on email and messaging more than doubling after AI tool adoption

5

.

Corey Quinn, chief cloud economist at Duckbill, suggested that AWS "would rather have the world believe their engineers are incompetent than admit their artificial intelligence made a mistake"

2

. As Amazon implements additional human oversight for AI-assisted changes, the company faces scrutiny over whether its aggressive AI rollout has compromised the reliability that customers expect from the world's largest online retailer.

Today's Top Stories

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2026 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo