Anthropic's Jack Clark warns AI needs a 'brake pedal' as systems edge toward self-improvement

3 Sources

Share

Anthropic co-founder Jack Clark has issued a stark warning that the AI industry needs the ability to slow down development before systems begin improving themselves without human oversight. With Claude already writing 80% of its own code, Clark calls for new regulations to ensure human control over increasingly autonomous AI systems.

News article

Anthropic Co-Founder Calls for AI Regulation and Control Mechanisms

Anthropic co-founder Jack Clark has raised urgent concerns about the trajectory of AI development, warning that the technology is rapidly approaching a critical threshold where it could evolve without meaningful human oversight. In an interview with BBC Newsnight, Clark emphasized that while the AI industry has accelerated at breakneck speed, it lacks fundamental safety mechanisms. "You want the option to be able to take your foot off the gas and put your foot on the brake," he explained. "Right now, it's like the AI industry has a gas pedal, but it doesn't have a brake pedal."

1

Clark stressed that government policy must ensure people maintain control over AI systems as they become more powerful and exert broader impacts on society. He drew parallels to the oil industry's regulatory evolution at the turn of the last century, suggesting that "society's response was to come up with a sensible policy and regulatory framework that gave people confidence in oil and the benefits that oil could provide to the world."

1

Claude Chatbot Already Writing 80% of Its Own Code

The warning comes as Anthropic revealed striking evidence of AI's growing autonomy. The company's popular Claude chatbot currently operates on code that is 80% self-written by the system itself. Clark indicated that reaching 100% autonomy is possible within two years, which "would have huge implications."

1

This development underscores how quickly autonomous AI systems are advancing toward self-sufficiency in their own development processes.

According to Anthropic's recent blog post, code correction rates by their staff have been falling steadily over the past year, indicating fewer errors in what Claude produces. The system can now run its own research experiments when given open-ended questions and develop solutions without human input. "The evidence suggests that the human role is narrowing at each step in the AI development process," the company noted.

2

Recursive Self-Improvement Poses Control Challenges

The concept of recursive self-improvement represents one of the most significant concerns for maintaining human control over AI. In this scenario, AI agents—autonomous workers built by chatbots—could become capable enough to build and train models themselves, meaning "Claude could be continuously improved by Claude," as Anthropic explained.

2

While recursive self-improvement could benefit fields like science and healthcare, Anthropic warns it might increase "the risks of humans losing control over AI systems." The company's blog post emphasized that "if systems are capable of fully building their own successors, the ways we secure them, monitor them and shape their behaviour all grow much more important."

2

Anthropic stated that recursive self-improvement "is not inevitable" but warned it "could come sooner than most institutions are prepared for."

3

The company believes "it would be good for the world to have the option to slow or temporarily pause frontier AI development to enable societal structures and alignment research to keep up with the advance of the technology."

3

Economic Disruption and Job Displacement Concerns

Beyond technical control issues, Clark highlighted AI's potential for economic disruption and job displacement. He specifically pointed to AI agents that conduct routine tasks somewhat autonomously as a threat to certain employment sectors. Major tech companies have conducted mass layoffs over the past year, often citing the growing ability of AI tools to do the work of hundreds or even thousands of software engineers.

1

However, Clark suggested that creativity may provide a competitive advantage over AI technology. "There are open questions about whether AI systems can be truly creative... there is not really evidence for that yet," he noted. At Anthropic, the company is "now limited more by the ability to generate good ideas than the ability to do the engineering to turn those ideas into reality."

1

Regulatory Framework Remains Voluntary Despite Warnings

Despite these concerns, meaningful AI regulation remains elusive. Anthropic welcomed an executive order on AI from US President Donald Trump that was relatively hands-off in its directives toward companies. It did not require AI companies to submit to safety testing by the government, leaving AI safety as a voluntary effort. Major AI companies pursuing advances in the technology, including Anthropic, OpenAI and Google, have not indicated they will pause their own research.

1

Anthropic acknowledged that a real slowdown would require "multiple well-resourced labs at or near the frontier, in multiple countries, agreeing to stop under the same conditions."

2

The company said its institute will conduct research to build a system to verify whether developers have actually stopped or slowed the move toward recursive AI.

As Anthropic prepares to debut on the public stock market with a valuation estimated by private investors at nearly $1tn, Clark insisted the company's motivation for discussing AI's growing capabilities isn't to enhance its reputation with customers. "I am worried for my kids if we as a society don't have a serious conversation about what the implications of AI's continued advances mean," he told Newsnight. "There are potentially great benefits. There are also risks."

1

The challenge facing policymakers is creating effective AI regulation that balances innovation with societal changes and AI risks before autonomous systems advance beyond meaningful human oversight.

Today's Top Stories