Anthropic Mythos evolves faster than expected, now creates working exploits from vulnerabilities

Reviewed byNidhi Govil

11 Sources

Share

The UK AI Security Institute reports Anthropic Mythos is advancing faster than anticipated, with capability doubling times shrinking from 8 months to around 4 months. The model now completes previously unsolved cybersecurity challenges and can create functional exploits from software vulnerabilities, raising concerns about AI-driven cyberattacks targeting critical infrastructure within months.

Anthropic Mythos Outpaces Predictions in Cybersecurity Testing

Anthropic Mythos is evolving at a pace that has caught even specialized AI safety researchers off guard. The UK AI Security Institute (AISI) reported on Wednesday that a newer version of the model has already surpassed both its earlier performance and OpenAI GPT-5.5, just one month after the initial release

1

. The updated Mythos Preview checkpoint completed both of AISI's cyber ranges, solving "The Last Ones" in 6 of 10 attempts and the previously unsolved "Cooling Tower" challenge in 3 of 10 attempts, marking the first time any model completed the second cyber range

1

.

Source: ZDNet

Source: ZDNet

This rapid advancement in AI cybersecurity capabilities demonstrates that improvements aren't restricted to individual model releases but can happen within versions of a single model. AISI's time window benchmark for cybersecurity, which estimates how much work an AI can do compared to a human, shows the human-comparable task time is growing at an accelerating rate

4

. In February 2026, AISI internally estimated that the length of cyber tasks AI models could complete had doubled every 4.7 months since late 2024, already an acceleration from their November 2025 estimate of 8 months

1

.

AI Models Creating Exploits, Not Just Finding Software Vulnerabilities

The more pressing concern centers on whether AI models for cybersecurity can transform discovered flaws into functional exploits that work in real-world scenarios. New research from UC Berkeley, Max Planck Institute for Security and Privacy, UC Santa Barbara, Arizona State University, Anthropic, OpenAI, and Google provides a definitive answer through ExploitGym, a benchmark evaluating autonomous exploit development capabilities of AI agents .

Source: Axios

Source: Axios

ExploitGym consists of 898 real software vulnerabilities found in applications, Google's V8 JavaScript engine, and the Linux kernel. Mythos Preview successfully exploited 157 test instances while GPT-5.5 managed 120 within the allotted two-hour window

2

. Even with standard security defenses like ASLR or the V8 sandbox activated, a meaningful number of exploits still worked. More strikingly, AI models creating exploits sometimes discovered and weaponized entirely different software vulnerabilities than the ones they were initially pointed at

2

.

Three-to-Five-Month Window Before AI-Driven Cyberattacks Become Norm

Palo Alto Networks has issued a stark warning about the timeline organizations face. "We now estimate a narrow three-to-five-month window for organizations to outpace the adversary before AI-driven cyberattacks start to become the new norm," according to a blog post on Wednesday

3

. This impending vulnerability deluge demands urgency from cybersecurity teams as they brace for attacks capable of exploiting previously unknown zero-day exploits.

The concerns have escalated to the highest levels, leading to White House officials meeting with bank leaders and technology giants

3

. Anthropic is scheduled to brief the Financial Stability Board (FSB), a global watchdog working with finance ministry officials and central bankers across the G20, on critical vulnerabilities Mythos has exposed "in every major operating system and web browser"

5

. Andrew Bailey, governor of the Bank of England, invited Anthropic to present these findings amid growing concerns that AI discovering vulnerabilities could threaten the stability of the global banking system

5

.

Source: TechRadar

Source: TechRadar

Defensive Applications Show Promise but Raise Questions

Anthropic has provided Mythos to around 40 companies through Project Glasswing to enable offensive and defensive cybersecurity measures. Mozilla found and patched 423 Firefox security bugs in a single month after deploying the model on the web browser, including some that had persisted in the code for over 15 years

5

. However, many more companies have requested access, but a Trump administration request has prevented Anthropic from distributing the software further

5

.

While AISI's tests capped tasks at 2.5 million tokens to enable better performance comparisons over time, this inherently "understates what frontier models can do"

1

. In cyber range experiments using up to 100 million tokens, performance would likely continue improving beyond that budget, especially for recent models which disproportionately benefit from higher token limits

1

. This means AI safety assessments may not fully capture the capabilities these models possess when operating without constraints.

The race is now on to patch AI-discovered vulnerabilities as quickly as possible before adversaries and state-sponsored threat actors develop their own capabilities. While AI models such as Mythos are not yet widely part of the threat actors' toolkit, Google recently observed attackers using an AI model to discover a zero-day exploit chain for the first time

5

. What remains unclear is whether the current acceleration trend will hold or whether these findings indicate a lasting increase in AI security capabilities.🟡 compliments=🟡The selected images provide a comprehensive visual narrative for the story. "ar-138607" effectively captures the theme of Anthropic Mythos's rapid evolution and accelerated capabilities in the cybersecurity domain. "ar-138516" visually represents the concept of AI models creating exploits, going beyond mere vulnerability discovery, which is a central point of the article. Finally, "ar-138729" underscores the urgency of the three-to-five-month window before AI-driven cyberattacks become the norm, by symbolizing the increasing interaction between AI and critical systems. Together, these images enhance the story's impact by illustrating the core themes of rapid AI advancement, exploit creation, and the impending cyber threat landscape.🟡 bar_chart_description=🟡No bar chart was used.🟡 funnel_chart_description=🟡No funnel chart was used.

Today's Top Stories

TheOutpost.ai

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

Instagram logo
LinkedIn logo
Youtube logo
© 2026 TheOutpost.AI All rights reserved