GPT-5.5 Matches Mythos Cybersecurity Performance

GPT-5.5 Achieves Similar Cybersecurity Performance as Mythos

New research from the UK's AI Security Institute (AISI) reveals that OpenAI's GPT-5.5, which launched publicly last week, reached "a similar level of performance on our cyber evaluations" as Anthropic's heavily restricted Mythos Preview model 1

. On expert-level Capture the Flag challenges designed to test capabilities in reverse engineering, web exploitation, and cryptography, GPT-5.5 passed an average of 71.4 percent of tasks, slightly higher than the 68.6 percent achieved by Mythos Preview, though within the margin of error 1

Source: Gadgets 360

The AISI has tested frontier AI models through 95 different challenges since 2023, and both GPT-5.5 and Mythos demonstrated unprecedented success on "The Last Ones" simulation, which mimics a 32-step data extraction attack on a corporate network. GPT-5.5 succeeded in 3 of 10 attempts compared to Mythos Preview's 2 of 10—no previous model had ever succeeded even once 1

. In one particularly difficult task involving building a disassembler to decode a Rust binary, GPT-5.5 solved the challenge in 10 minutes and 22 seconds with no human assistance at a cost of $1.73 in API calls 1

Questions About Anthropic's Restricted Release Strategy

The comparable performance raises significant questions about whether Mythos warranted its restricted release through Project Glasswing, which granted exclusive access to select tech giants 3

. Anthropic claimed in April that Mythos represented such a severe cybersecurity threat that immediate public release would be too risky 3

. However, AISI's findings suggest Mythos was likely not "a breakthrough specific to one model" but rather "a byproduct of more general improvements in long-horizon autonomy, reasoning, and coding" 1

Source: The Register

OpenAI CEO Sam Altman criticized what he calls fear-based marketing in promoting limited AI model releases. While acknowledging Mythos is "a great model for cybersecurity," Altman told the Core Memory podcast that "it is clearly incredible marketing to say, 'We have built a bomb. We are about to drop it on your head. We will sell you a bomb shelter for $100 million'" 1

. He predicted there will be "a lot more rhetoric about models that are too dangerous to release," while acknowledging that truly dangerous models will require controlled release programs 1

Reality Check on Finding Software Vulnerabilities

Early reports from Mythos preview users including AWS and Mozilla indicate that while the model excels at finding software vulnerabilities quickly and requires less hands-on guidance from security engineers, it hasn't eclipsed elite human security researchers 4

. Mozilla CTO Bobby Holley revealed that Mythos found 271 vulnerabilities in Firefox 150, noting "So far we've found no category or complexity of vulnerability that humans can find that this model can't," but added crucially: "We also haven't seen any bugs that couldn't have been found by an elite human researcher" 4

Source: PYMNTS

Cybersecurity researchers emphasize that Mythos didn't discover fundamentally new classes of software vulnerabilities. The vulnerabilities are generally variations of well-known and well-understood security flaws 3

. What concerned researchers was the intense scale and speed with which Mythos could find and exploit those vulnerabilities, not a fundamental change in the nature of the cybersecurity threat 3

OpenAI Follows Controlled Release Approach

Despite Altman's criticism, OpenAI is adopting similar controlled release programs for its own cybersecurity-focused models. In February, OpenAI rolled out its Trusted Access for Cyber pilot program, allowing security researchers and enterprises to verify their identities and register interest in studying frontier models for "legitimate defensive work" 1

. Last month, the company used this list to control the limited launch of GPT-5.4-Cyber, a model variant "purposely fine-tuned for additional cyber capabilities and with fewer capability restrictions" 1

On Thursday, Altman announced that GPT-5.5-Cyber would similarly be limited "to critical cyber defenders in the next few days," though details about who qualifies as "critical cyber defenders" and the model's specific capabilities remain unclear 2

. This staggered rollout represents a growing trend in the AI industry of companies branding their top models too dangerous for public release due to potential misuse by criminals 2

Security Concerns and Geopolitical Implications

Anthropic CEO Dario Amodei warned that AI has created a narrow window for tech firms, governments, and banks to fix tens of thousands of software vulnerabilities found by Mythos. Since AI models from geopolitical adversaries like China are "maybe six to 12 months" behind Anthropic's product, there is "roughly that amount of time" to address these issues before potential misuse by criminals becomes a more serious concern 5

. Amodei noted that the scale of potential cyber exploits has ballooned with each generation of Claude, with an earlier model finding roughly 20 vulnerabilities in Firefox while Mythos found nearly 300, bringing the total count across all software into the tens of thousands 5

GPT-5.5 matches Mythos in cybersecurity tests, challenging Anthropic's restricted release

GPT-5.5 Achieves Similar Cybersecurity Performance as Mythos

Questions About Anthropic's Restricted Release Strategy

Reality Check on Finding Software Vulnerabilities

OpenAI Follows Controlled Release Approach

Security Concerns and Geopolitical Implications

References

Amid Mythos' hyped cybersecurity prowess, researchers find GPT-5.5 is just as good

OpenAI's new security model is for 'critical cyber defenders' only

Mythos AI is a cybersecurity threat, but it doesn't rewrite the rules of the game

Anthropic Mythos shaping up as nothingburger

Anthropic CEO warns of cyber 'moment of danger' as AI exposes thousands of vulnerabilities

Related Stories

Anthropic's Mythos AI faces unauthorized access as security breach exposes controlled rollout flaws

Security researchers use Anthropic Mythos to expose macOS vulnerabilities on Apple M5 chips

OpenAI launches GPT-5.4-Cyber model days after Anthropic's Mythos sparks global security concerns

Recent Highlights

Google bets on AI agents with Gemini 3.5 Flash, Spark, and Omni at I/O 2026

AI passes the Turing Test as GPT-4.5 appears more human than actual people in landmark study

OpenAI cracks 80-year-old Erdős problem, stunning mathematicians with AI's biggest math breakthrough

Recent Highlights

Today's Top Stories

Pope Leo XIV calls to disarm AI, warns tech elite threatens humanity in landmark encyclical

Huawei unveils Tau Scaling Law to bypass US chip sanctions, targets 1.4nm chips by 2031

Anthropic's Chris Olah says AI development cannot be left to tech companies alone at Vatican

99% of Executives Expect AI Layoffs Within Two Years as Worker Anxiety Surges