Anthropic warns AI may soon build itself, calls for global pause on frontier development

Reviewed byNidhi Govil

35 Sources

Share

Anthropic is urging the world's leading AI companies to consider pausing frontier AI development as its Claude chatbot now writes over 80% of the code merged into its systems. The company warns that AI systems may be approaching recursive self-improvement, where they can design their own successors with minimal human input, potentially increasing the risk of humans losing control of the technology.

Anthropic Calls for Global Slowdown of AI Development

Anthropic, the maker of Claude AI, has issued a stark warning that artificial intelligence systems may be approaching a critical threshold where they can design and build their own successors with little human input. In a blog post titled "When AI Builds Itself," published on June 4, the company stated that "it would be good for the world to have the option to slow or temporarily pause frontier AI development to enable societal structures and alignment research to keep up with the advance of the technology"

1

. This call comes just days after Anthropic confidentially filed for an initial public offering and reached a valuation close to $1 trillion

3

.

Source: Market Screener

Source: Market Screener

The company revealed that Claude now writes more than 80% of the code merged into its production codebase as of last month, up from low single digits before Claude Code reached research preview in February 2025

2

. Engineers at Anthropic are now merging eight times as much code per quarter as they did from 2021 to 2025, demonstrating how rapidly the human role in AI development is shrinking

2

.

The Risk of Recursive Self-Improvement

Recursive self-improvement represents a potential inflection point where AI systems become capable enough to design their next generation autonomously. Anthropic warns this "might increase the risks of humans losing control over AI systems"

4

. The company outlined three scenarios for how the next few years could unfold, with the most severe warning reserved for situations where models become capable of fully improving themselves. In that case, the pace of progress would be determined almost entirely by available compute, with humans pushed toward oversight and verification roles

2

.

Source: CXOToday

Source: CXOToday

The firm described alignment issues as particularly concerning in this future scenario. Misalignment that's rare and survivable today could compound generation over generation, "growing more frequent but less understood until we lose control of them"

2

. This concern about AI alignment research keeping pace with technological advances sits at the heart of Anthropic's proposal.

Evidence of Accelerating AI Capabilities

Anthropic backed its warnings with internal data showing dramatic improvements in Claude's capabilities. On the hardest, least-specified coding tasks, Claude succeeded 76% of the time in May 2026, a rise of 50 percentage points in just six months

2

. A recurring internal test asking each new model to optimize training code saw results climb from roughly triple the original speed with Claude Opus 4 in May 2025 to about 52 times faster with the unreleased Mythos Preview model in April

2

.

The length of human tasks that models can reliably complete on their own had been doubling every seven months as measured in March 2025, but now that timeline has accelerated to every four months

3

. While Claude 3 Opus released in March 2024 could reliably complete tasks taking humans four minutes, Claude Opus 4.6 can handle tasks that take humans 12 hours

3

.

Proposal for Global Coordination Mechanism

Anthropic is proposing what it calls a "global coordination mechanism" to slow or pause AI development, pointing to arms-control agreements on intermediate-range nuclear missiles as a loose model

1

. The company emphasized that "a meaningful slowdown or pause would require multiple well-resourced labs at or near the frontier, in multiple countries, agreeing to stop under the same conditions"

5

. Mechanisms must be in place to verify that all parties have actually stopped or slowed development, preventing bad actors from using the pause to jump ahead in secret

4

.

Source: PYMNTS

Source: PYMNTS

The Anthropic Institute, the company's research division established in March, will spend the coming months convening governments, researchers, and rival AI companies to explore whether a coordinated slowdown could function in practice

1

. The institute plans to conduct research on building the systems that a credible slowdown or pause would require

5

.

Skepticism and Criticism

The proposal has drawn significant skepticism from researchers and industry observers. Noah Giansiracusa, associate professor of mathematics at Bentley University, stated bluntly: "It's literally impossible. Zero chance there will be a slowdown"

1

. Mark Riedl, a professor at Georgia Institute of Technology, suggested that "the big AI companies are all jumping on the 'recursive self-improvement' hype train"

1

.

Critics point to the timing of Anthropic's warnings as suspicious. The proposal comes just days after the company filed for an IPO and follows the limited release of its Mythos model, which Anthropic claimed was too good at finding software vulnerabilities to release publicly

1

. To skeptics, such pronouncements read as business strategy designed to draw regulatory scrutiny to competitors while Anthropic continues racing toward the frontier

1

.

Broader Context of AI Risks and Safety

The warning comes amid growing concerns about AI cybersecurity and AI safety. Researchers at the University of Toronto recently demonstrated how AI tools could create adaptive AI worms that spread from device to device, taking over computing networks

4

. President Donald Trump also signed an executive order directing the Treasury Department to establish an "AI cybersecurity clearinghouse" to coordinate with the AI industry

3

.

Anthropic acknowledged that unknown bottlenecks could emerge to slow progress, and that current models consistently underperform in "taste" or selecting the next step when left unprompted by humans

3

. The company stated it's "genuinely unclear" whether today's training methods and architectures could unlock the capacity for fully autonomous AI development

3

. However, if current trends hold, the alternative scenario is that progress becomes determined entirely by the availability of compute, with humans playing a substantially diminished role

3

.

Today's Top Stories

© 2026 TheOutpost.AI All rights reserved