OpenAI warns upcoming AI models will likely pose high cybersecurity risk with exploit capabilities

Reviewed byNidhi Govil

3 Sources

Share

OpenAI has issued a warning that its next-generation AI models are expected to reach high cybersecurity risk levels under its Preparedness Framework. The company revealed dramatic capability increases, with recent models scoring 76% on penetration testing exercises compared to 27% months earlier. OpenAI is now preparing safeguards and investing in defensive AI tools to help security teams audit code and patch vulnerabilities at scale.

OpenAI Flags Rising Cybersecurity Risk in Next-Generation AI Models

OpenAI has issued a stark warning that its upcoming AI models will likely pose a "high" cybersecurity risk, marking a significant escalation in the dual-use capabilities of advanced AI models

1

2

. Under the company's Preparedness Framework, this rating means future models could develop working zero-day exploits or assist with sophisticated cyberattacks targeting enterprise intrusions and industrial systems

3

. The "high" designation sits just below the "critical" threshold at which models would be deemed unsafe for public release

2

.

Source: ET

Source: ET

Dramatic Capability Leap Shows Accelerating Trajectory

The warning comes amid evidence of rapid capability growth in recent releases. GPT-5 scored just 27% on a capture-the-flag cybersecurity exercise in August, but GPT-5.1-Codex-Max achieved a striking 76% success rate in the same test just months later

2

. This nearly threefold improvement demonstrates how quickly these systems are advancing. According to OpenAI's Fouad Matin, the key forcing function behind this escalation is the models' growing autonomous capabilities—specifically their ability to work for extended periods without human intervention, enabling brute-force attacks that require sustained effort over time

2

.

New Cybersecurity Measures Target Defense-First Approach

In response to these advanced cyber threats, OpenAI says it is preparing safeguards as if every new model could reach the high-risk threshold, ensuring progress is paired with strong risk controls

1

. The company is expanding investments in models designed to support defensive workflows, from code auditing to vulnerability patching at scale

1

3

. OpenAI emphasizes giving defenders an edge in a landscape where security teams are "outnumbered and under-resourced"

1

.

Source: Axios

Source: Axios

Because offensive and defensive cyber tasks rely on the same knowledge base, OpenAI is adopting a defense-in-depth strategy rather than depending on any single safeguard

1

. The company is implementing a mix of access controls, infrastructure hardening, egress controls, and monitoring to counter potential misuse

3

. The focus is on shaping "how capabilities are accessed, guided, and applied" to ensure AI strengthens cybersecurity rather than lowering barriers to attacks

1

.

What This Means for the Threat Landscape

The implications extend beyond OpenAI alone, as leading models across the industry are getting better at finding security vulnerabilities

2

. The growing capabilities could significantly expand the number of people able to carry out cyberattacks, democratizing access to sophisticated techniques previously limited to skilled threat actors

2

. However, Matin noted that brute-force attacks relying on extended autonomous operation are more easily defended against, stating "in any defended environment this would be caught pretty easily"

2

.

This warning follows a similar alert OpenAI issued regarding bioweapons risk in June, before releasing ChatGPT Agent in July—which indeed rated "high" on its risk levels

2

. The company has not specified exactly when to expect the first models rated high for cybersecurity risk or which model types could pose such threats

2

. As AI capabilities continue their upward trajectory, the race between offensive potential and defensive AI capabilities will define the security posture of organizations worldwide.

Today's Top Stories

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo