Anthropic Drops AI Safety Pledge Amid AI Race

Anthropic Abandons Core AI Safety Commitment

Anthropic has formally revised its Responsible Scaling Policy, removing the pledge that defined its identity since 2023: the promise to pause model development if adequate safety mitigations couldn't be guaranteed in advance1

. The company, which positioned itself as a safer alternative in the AI race, now states it will only delay development "until and unless we no longer believe we have a significant lead" over competitors2

. This represents one of the most dramatic policy shifts in the industry, as startups once focused on helping humanity pivot toward profitability and competitive survival.

Source: PYMNTS

Founded in 2021 by former OpenAI employees seeking to prioritize AI safety, Anthropic hoped to inspire an industry-wide "race to the top" rather than a race to the bottom3

. Chief Science Officer Jared Kaplan told TIME that "we didn't really feel, with the rapid advance of AI, that it made sense for us to make unilateral commitments... if competitors are blazing ahead"5

. The company's original vision hasn't "pan out" as hoped, with the policy environment now prioritizing AI competitiveness and economic growth over safety-oriented discussions at the federal level3

Competitive Pressures Drive Relaxing Safety Protocols

The revision reflects mounting competitive pressures among AI labs and between nations, particularly the U.S. and China. Google DeepMind CEO Demis Hassabis has warned that "race conditions" can drive reckless decisions as the world approaches superhuman AI2

. Max Tegmark, founder of the Future of Life Institute, argues that AI companies have stalled meaningful regulation for years. "It's their fault that we have the race condition in the first place," Tegmark told Axios, noting that Anthropic is "the most striking one because they were the ones who always talk such a big game about safety"2

Source: Axios

The new framework emphasizes transparency through "Frontier Safety Roadmaps" and regular "Risk Reports" rather than categorical pauses4

. Anthropic maintains it will match or exceed competitors' safety efforts, but the core promise that distinguished Claude from rivals is gone. "The new policy still includes some guardrails, but the core promise, that Anthropic would not release models unless it could guarantee adequate safety mitigations in advance, is gone," said Nik Kairinos, CEO of RAIDS AI4

Pentagon Tensions and Military Use Controversies

The timing coincides with escalating tensions between Anthropic and the Pentagon over a $200 million military contract signed last summer5

. Defense Secretary Pete Hegseth reportedly gave Dario Amodei an ultimatum: lighten AI guardrails to make them more amenable to military use, or face being cut off and labeled a "supply chain risk"5

. Anthropic has refused to allow its models for autonomous weapons or domestic surveillance, prompting the Defense Department to cut use of Claude2

Source: Futurism

This highlights a critical problem with voluntary AI safety commitments: even if one company refuses on safety grounds, another steps in. Hours after Anthropic's stance became public, OpenAI announced a deal to provide models for classified networks2

. Critics note OpenAI's agreement leaves broad room for military use, including surveillance of U.S. citizens, despite sharing many of Anthropic's stated safety concerns.

Implications for Self-Regulation and AI Regulation

The shift exposes fundamental limits of self-regulation without binding AI regulation. When Anthropic introduced its original policy in 2023, executives hoped it might inspire rivals or inform eventual legislation4

. That regulatory momentum never materialized, and the current political climate has tilted away from developing frameworks. Companies now choose between voluntary restraint and competitive survival in an environment where capabilities advance while global AI summits increasingly focus on commercialization over guardrails2

Kairinos noted the irony that Anthropic recently donated $20 million to Public First Action, supporting congressional candidates pledging to push for AI safety regulation, while simultaneously recalibrating its own internal constraints4

. Tegmark suggests mounting evidence of chatbot risks affecting children and teens has sparked rare bipartisan concern, potentially opening pathways for laws mandating companies test frontier AI models before release2

What This Means for Users and Industry

Interestingly, Anthropic surged to the top of Apple's App Store download charts in days following its standoff with the Pentagon, suggesting consumers may value safety commitments over competitive positioning2

. Yet everyday users might not notice immediate changes as they interact with Claude or other AI tools. The AI guardrails governing how systems are trained influence everything from accuracy to fraudulent misuse4

Anthropic argues that meaningful safety research requires staying at the frontier rather than stepping back, and that unilateral restraint could result in a less safe world where "developers with the weakest protections would set the pace"5

. Kaplan insists "I don't think we're making any kind of U-turn," though critics view the policy change as flagrantly contradicting the organization's entire reason for existence5

. As Hassabis has argued, the closer the world gets to superhuman AI, the more essential global cooperation becomes—yet the trajectory appears moving in the opposite direction2

Anthropic drops hallmark safety pledge as AI race intensifies and Pentagon pressures mount

Anthropic Abandons Core AI Safety Commitment

Competitive Pressures Drive Relaxing Safety Protocols

Pentagon Tensions and Military Use Controversies

Implications for Self-Regulation and AI Regulation

What This Means for Users and Industry

References

Watch Anthropic Drops Hallmark Safety Pledge in Race With AI Peers

AI labs ease safety rules amid race pressure

Anthropic's AI safety policy just changed for this reason

Anthropic drops its signature safety promise and rewrites AI guardrails

Anthropic Drops Its Huge Safety Pledge That Was Supposedly the Whole Point of the Company

Related Stories

Pentagon's Anthropic showdown exposes who controls AI guardrails in military contracts

Global AI Summit in Paris Shifts Focus from Safety to Opportunity, Sparking Debate

Top AI labs score D's and F's on existential safety as race to superintelligence accelerates

Recent Highlights

Google Maps unveils Ask Maps with Gemini AI and 3D Immersive Navigation in biggest update

AI chatbots help plan violent attacks as safety guardrails fail, new investigation reveals

Three Tennessee teens sue xAI over Grok AI creating child sexual abuse material from real photos

Recent Highlights

Today's Top Stories

Val Kilmer to appear in new film via AI, a year after his death at 65

Meta's Manus launches desktop app with AI agent to automate tasks on Mac and Windows

Stanford study reveals AI chatbots fuel delusions and self-harm through excessive flattery

Microsoft threatens lawsuit over OpenAI's $50 billion Amazon cloud deal amid exclusivity dispute