GPT-5.5 matches Mythos in cybersecurity tests, challenging Anthropic's restricted release

Reviewed byNidhi Govil

15 Sources

Share

OpenAI's publicly released GPT-5.5 performed just as well as Anthropic's restricted Mythos model on cybersecurity evaluations, achieving 71.4% success on expert-level tasks. The UK's AI Security Institute found both models succeeded at previously impossible network penetration tests, raising questions about whether Mythos warranted its limited release amid accusations of fear-based marketing.

GPT-5.5 Achieves Similar Cybersecurity Performance as Mythos

New research from the UK's AI Security Institute (AISI) reveals that OpenAI's GPT-5.5, which launched publicly last week, reached "a similar level of performance on our cyber evaluations" as Anthropic's heavily restricted Mythos Preview model

1

. On expert-level Capture the Flag challenges designed to test capabilities in reverse engineering, web exploitation, and cryptography, GPT-5.5 passed an average of 71.4 percent of tasks, slightly higher than the 68.6 percent achieved by Mythos Preview, though within the margin of error

1

.

Source: Gadgets 360

Source: Gadgets 360

The AISI has tested frontier AI models through 95 different challenges since 2023, and both GPT-5.5 and Mythos demonstrated unprecedented success on "The Last Ones" simulation, which mimics a 32-step data extraction attack on a corporate network. GPT-5.5 succeeded in 3 of 10 attempts compared to Mythos Preview's 2 of 10—no previous model had ever succeeded even once

1

. In one particularly difficult task involving building a disassembler to decode a Rust binary, GPT-5.5 solved the challenge in 10 minutes and 22 seconds with no human assistance at a cost of $1.73 in API calls

1

.

Questions About Anthropic's Restricted Release Strategy

The comparable performance raises significant questions about whether Mythos warranted its restricted release through Project Glasswing, which granted exclusive access to select tech giants

3

. Anthropic claimed in April that Mythos represented such a severe cybersecurity threat that immediate public release would be too risky

3

. However, AISI's findings suggest Mythos was likely not "a breakthrough specific to one model" but rather "a byproduct of more general improvements in long-horizon autonomy, reasoning, and coding"

1

.

Source: The Register

Source: The Register

OpenAI CEO Sam Altman criticized what he calls fear-based marketing in promoting limited AI model releases. While acknowledging Mythos is "a great model for cybersecurity," Altman told the Core Memory podcast that "it is clearly incredible marketing to say, 'We have built a bomb. We are about to drop it on your head. We will sell you a bomb shelter for $100 million'"

1

. He predicted there will be "a lot more rhetoric about models that are too dangerous to release," while acknowledging that truly dangerous models will require controlled release programs

1

.

Reality Check on Finding Software Vulnerabilities

Early reports from Mythos preview users including AWS and Mozilla indicate that while the model excels at finding software vulnerabilities quickly and requires less hands-on guidance from security engineers, it hasn't eclipsed elite human security researchers

4

. Mozilla CTO Bobby Holley revealed that Mythos found 271 vulnerabilities in Firefox 150, noting "So far we've found no category or complexity of vulnerability that humans can find that this model can't," but added crucially: "We also haven't seen any bugs that couldn't have been found by an elite human researcher"

4

.

Source: PYMNTS

Source: PYMNTS

Cybersecurity researchers emphasize that Mythos didn't discover fundamentally new classes of software vulnerabilities. The vulnerabilities are generally variations of well-known and well-understood security flaws

3

. What concerned researchers was the intense scale and speed with which Mythos could find and exploit those vulnerabilities, not a fundamental change in the nature of the cybersecurity threat

3

.

OpenAI Follows Controlled Release Approach

Despite Altman's criticism, OpenAI is adopting similar controlled release programs for its own cybersecurity-focused models. In February, OpenAI rolled out its Trusted Access for Cyber pilot program, allowing security researchers and enterprises to verify their identities and register interest in studying frontier models for "legitimate defensive work"

1

. Last month, the company used this list to control the limited launch of GPT-5.4-Cyber, a model variant "purposely fine-tuned for additional cyber capabilities and with fewer capability restrictions"

1

.

On Thursday, Altman announced that GPT-5.5-Cyber would similarly be limited "to critical cyber defenders in the next few days," though details about who qualifies as "critical cyber defenders" and the model's specific capabilities remain unclear

2

. This staggered rollout represents a growing trend in the AI industry of companies branding their top models too dangerous for public release due to potential misuse by criminals

2

.

Security Concerns and Geopolitical Implications

Anthropic CEO Dario Amodei warned that AI has created a narrow window for tech firms, governments, and banks to fix tens of thousands of software vulnerabilities found by Mythos. Since AI models from geopolitical adversaries like China are "maybe six to 12 months" behind Anthropic's product, there is "roughly that amount of time" to address these issues before potential misuse by criminals becomes a more serious concern

5

. Amodei noted that the scale of potential cyber exploits has ballooned with each generation of Claude, with an earlier model finding roughly 20 vulnerabilities in Firefox while Mythos found nearly 300, bringing the total count across all software into the tens of thousands

5

.

Today's Top Stories

TheOutpost.ai

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

Instagram logo
LinkedIn logo
Youtube logo
© 2026 TheOutpost.AI All rights reserved