OpenAI Warns AI Models Pose High Cybersecurity Risk

OpenAI Sounds Alarm on Rising Cyber Capabilities

OpenAI has issued a warning that the cybersecurity risk posed by its advancing AI models is climbing to what it classifies as high levels. The company stated in a recent announcement that upcoming AI models will likely reach capabilities sufficient to develop working zero-day exploits against well-defended systems or meaningfully assist with complex, stealthy intrusion operations aimed at real-world effects 1

. This escalation reflects the dual-use risks inherent in advanced AI models, which can serve both defensive and offensive purposes in equal measure 5

The concern centers on weaponized artificial intelligence that could automate brute-force attacks, generate malware generation content, create phishing content, and refine existing code to make cyberattack chains more efficient 1

. According to Fouad Matin from OpenAI, the forcing function driving this risk is the model's ability to work for extended periods of time autonomously, enabling these types of persistent attacks 3

Source: Axios

Dramatic Capability Surge in Recent Months

The evidence supporting OpenAI's warning comes from measurable performance improvements. In capture-the-flag challenges—traditionally used to test cybersecurity capabilities in controlled environments—GPT-5 scored just 27% in August 2025. By November 2025, GPT-5.1-Codex-Max achieved a 76% success rate, marking a substantial leap in just four months 1

. This trajectory suggests that sophisticated cyberattacks could become accessible to a broader range of threat actors, significantly expanding the pool of individuals capable of executing complex operations 3

OpenAI expects this upward trend to continue, stating the company is planning and evaluating as though each new model could reach high levels of cybersecurity capability as measured by the OpenAI Preparedness Framework 2

. High is the second-highest risk classification, sitting just below critical—the threshold at which models are deemed unsafe for public release 3

Deploying the Preparedness Framework

To address these mounting concerns, OpenAI is relying on its Preparedness Framework, last updated in April 2025, which outlines the company's approach to balancing innovation with risk mitigation 1

. The framework establishes measurable thresholds that indicate when AI models could cause severe harm across three priority categories: cybersecurity, chemical and biological threats, and persuasion capabilities 1

. OpenAI has committed not to deploy highly capable models until sufficient safeguards are built to minimize associated risks 1

Source: ET

Because offensive and defensive cyber tasks rely on the same underlying knowledge, OpenAI is adopting a defense-in-depth approach rather than depending on any single safeguard to prevent misuse of its technology 2

. The company is training models to detect and refuse malicious requests, though this presents challenges since threat actors can masquerade as defenders to generate output later used for criminal activity 1

Strengthening Defensive Cybersecurity Systems

While the risks are significant, OpenAI emphasizes that these same autonomous capabilities can enhance defensive AI capabilities for security professionals. The company is investing heavily in strengthening models for defensive cybersecurity tasks and creating tools that enable defenders to perform workflows such as code auditing and vulnerability patching at scale 2

OpenAI plans to introduce a program providing cyberdefense workers with access to enhanced capabilities in its models 5

. The company is also testing Aardvark, an agentic security researcher, and establishing the Frontier Risk Council—an advisory group bringing together security practitioners and OpenAI teams 5

. The goal is for AI models and products to bring significant advantages for defenders, who are often outnumbered and under-resourced 1

Source: Digit

Multi-Layered Defense Strategy

OpenAI's risk mitigation strategy combines multiple layers of protection. The company is implementing access controls, infrastructure hardening, egress controls, and system-wide monitoring to detect potentially malicious cyber activity 4

. When activity appears unsafe, the company may block output, route prompts to safer or less capable models, or escalate for enforcement 1

The organization is working with Red Teams providers to evaluate and improve its safety measures, leveraging offensive testing to discover defensive weaknesses for remediation 1

. Dedicated threat intelligence and insider risk programs have been launched as part of this comprehensive approach 1

. OpenAI acknowledges this is ongoing work and expects to keep evolving these programs as it learns what most effectively advances real-world security 5

OpenAI warns upcoming AI models will likely pose high cybersecurity risk with zero-day exploits

OpenAI Sounds Alarm on Rising Cyber Capabilities

Dramatic Capability Surge in Recent Months

Deploying the Preparedness Framework

Strengthening Defensive Cybersecurity Systems

Multi-Layered Defense Strategy

References

Weaponized AI risk is 'high,' warns OpenAI - here's the plan to stop it

OpenAI unveils new measures as frontier AI grows cyber-powerful

Exclusive: Future OpenAI models likely to pose "high" cybersecurity risk, it says

OpenAI warns new models pose 'high' cybersecurity risk

OpenAI Plans to Offer AI Models' Enhanced Capabilities to Cyberdefense Workers | PYMNTS.com

Related Stories

OpenAI Updates Safety Framework Amid Growing AI Risks and Competition

OpenAI Disrupts Malicious AI Use by State-Sponsored and Cybercriminal Groups

OpenAI admits prompt injection attacks on AI agents may never be fully solved

Recent Highlights

OpenAI Releases GPT-5.4, New AI Model Built for Agents and Professional Work

Anthropic takes Pentagon to court over unprecedented supply chain risk designation

Meta smart glasses face lawsuit and UK probe after workers watched intimate user footage

Recent Highlights

Today's Top Stories

Microsoft launches Copilot Cowork with Anthropic to automate work across M365 apps

Age verification tech matures as governments push aggressive online safety laws for kids

OpenAI delays ChatGPT adult mode again to prioritize intelligence and personality improvements

Microsoft launches Agent 365 to govern AI agents as 'double agent' threats emerge in enterprises