Anthropic Mythos evolves faster, creates exploits

Anthropic Mythos Outpaces Predictions in Cybersecurity Testing

Anthropic Mythos is evolving at a pace that has caught even specialized AI safety researchers off guard. The UK AI Security Institute (AISI) reported on Wednesday that a newer version of the model has already surpassed both its earlier performance and OpenAI GPT-5.5, just one month after the initial release 1

. The updated Mythos Preview checkpoint completed both of AISI's cyber ranges, solving "The Last Ones" in 6 of 10 attempts and the previously unsolved "Cooling Tower" challenge in 3 of 10 attempts, marking the first time any model completed the second cyber range 1

Source: ZDNet

This rapid advancement in AI cybersecurity capabilities demonstrates that improvements aren't restricted to individual model releases but can happen within versions of a single model. AISI's time window benchmark for cybersecurity, which estimates how much work an AI can do compared to a human, shows the human-comparable task time is growing at an accelerating rate 4

. In February 2026, AISI internally estimated that the length of cyber tasks AI models could complete had doubled every 4.7 months since late 2024, already an acceleration from their November 2025 estimate of 8 months 1

AI Models Creating Exploits, Not Just Finding Software Vulnerabilities

The more pressing concern centers on whether AI models for cybersecurity can transform discovered flaws into functional exploits that work in real-world scenarios. New research from UC Berkeley, Max Planck Institute for Security and Privacy, UC Santa Barbara, Arizona State University, Anthropic, OpenAI, and Google provides a definitive answer through ExploitGym, a benchmark evaluating autonomous exploit development capabilities of AI agents .

Source: Axios

ExploitGym consists of 898 real software vulnerabilities found in applications, Google's V8 JavaScript engine, and the Linux kernel. Mythos Preview successfully exploited 157 test instances while GPT-5.5 managed 120 within the allotted two-hour window 2

. Even with standard security defenses like ASLR or the V8 sandbox activated, a meaningful number of exploits still worked. More strikingly, AI models creating exploits sometimes discovered and weaponized entirely different software vulnerabilities than the ones they were initially pointed at 2

Three-to-Five-Month Window Before AI-Driven Cyberattacks Become Norm

Palo Alto Networks has issued a stark warning about the timeline organizations face. "We now estimate a narrow three-to-five-month window for organizations to outpace the adversary before AI-driven cyberattacks start to become the new norm," according to a blog post on Wednesday 3

. This impending vulnerability deluge demands urgency from cybersecurity teams as they brace for attacks capable of exploiting previously unknown zero-day exploits.

The concerns have escalated to the highest levels, leading to White House officials meeting with bank leaders and technology giants 3

. Anthropic is scheduled to brief the Financial Stability Board (FSB), a global watchdog working with finance ministry officials and central bankers across the G20, on critical vulnerabilities Mythos has exposed "in every major operating system and web browser" 5

. Andrew Bailey, governor of the Bank of England, invited Anthropic to present these findings amid growing concerns that AI discovering vulnerabilities could threaten the stability of the global banking system 5

Source: TechRadar

Defensive Applications Show Promise but Raise Questions

Anthropic has provided Mythos to around 40 companies through Project Glasswing to enable offensive and defensive cybersecurity measures. Mozilla found and patched 423 Firefox security bugs in a single month after deploying the model on the web browser, including some that had persisted in the code for over 15 years 5

. However, many more companies have requested access, but a Trump administration request has prevented Anthropic from distributing the software further 5

While AISI's tests capped tasks at 2.5 million tokens to enable better performance comparisons over time, this inherently "understates what frontier models can do" 1

. In cyber range experiments using up to 100 million tokens, performance would likely continue improving beyond that budget, especially for recent models which disproportionately benefit from higher token limits 1

. This means AI safety assessments may not fully capture the capabilities these models possess when operating without constraints.

The race is now on to patch AI-discovered vulnerabilities as quickly as possible before adversaries and state-sponsored threat actors develop their own capabilities. While AI models such as Mythos are not yet widely part of the threat actors' toolkit, Google recently observed attackers using an AI model to discover a zero-day exploit chain for the first time 5

. What remains unclear is whether the current acceleration trend will hold or whether these findings indicate a lasting increase in AI security capabilities.🟡 compliments=🟡The selected images provide a comprehensive visual narrative for the story. "ar-138607" effectively captures the theme of Anthropic Mythos's rapid evolution and accelerated capabilities in the cybersecurity domain. "ar-138516" visually represents the concept of AI models creating exploits, going beyond mere vulnerability discovery, which is a central point of the article. Finally, "ar-138729" underscores the urgency of the three-to-five-month window before AI-driven cyberattacks become the norm, by symbolizing the increasing interaction between AI and critical systems. Together, these images enhance the story's impact by illustrating the core themes of rapid AI advancement, exploit creation, and the impending cyber threat landscape.🟡 bar_chart_description=🟡No bar chart was used.🟡 funnel_chart_description=🟡No funnel chart was used.

Anthropic Mythos evolves faster than expected, now creates working exploits from vulnerabilities

Anthropic Mythos Outpaces Predictions in Cybersecurity Testing

AI Models Creating Exploits, Not Just Finding Software Vulnerabilities

Three-to-Five-Month Window Before AI-Driven Cyberattacks Become Norm

Defensive Applications Show Promise but Raise Questions

References

Anthropic's Mythos is evolving faster than expected, reports AI safety agency

AI agents show they can create exploits, not just find vulns

AI-driven cyberattacks will start to be the 'new norm' in months, Palo Alto warns

AI models are getting better at replacing cybersecurity pros on certain tasks

Anthropic to present exposed Mythos flaws to global watchdog - claims critical vulnerabilities found 'in every major operating system and web browser'

Related Stories

Security researchers use Anthropic Mythos to expose macOS vulnerabilities on Apple M5 chips

OpenAI launches GPT-5.4-Cyber model days after Anthropic's Mythos sparks global security concerns

Anthropic's Mythos AI model triggers global alarm over cybersecurity and banking threats

Recent Highlights

Nvidia RTX Spark chips power new AI laptops with up to 128GB memory and local agent capabilities

Florida sues OpenAI and Sam Altman over ChatGPT safety, alleging AI harms linked to violence

Trump signs AI executive order seeking voluntary 30-day review after industry pushback

Recent Highlights

Today's Top Stories

Google launches Gemma 4 12B, bringing multimodal AI agents to consumer laptops with 16GB RAM

AI outperforms law professors 75% of the time in Stanford Law School study on legal reasoning

British lawmaker Jess Asato sues Elon Musk's xAI over Grok AI deepfakes in landmark case

Apple ditches Vision Pro successors, pivots to smart glasses with mass-market appeal