2 Sources
2 Sources
[1]
OpenAI Buying AI Security Startup Promptfoo to Safeguard AI Agents
OpenAI has agreed to buy Promptfoo, a startup that enables large businesses to find and fix security issues in artificial intelligence models during their development, part of a push to help corporate customers reduce possible risks from deploying AI software. The ChatGPT maker is set to announce Monday that it will integrate Promptfoo's technology into OpenAI Frontier, a recently launched platform that allows organizations to build and manage AI agents. Frontier is meant to help ensure each agent -- which OpenAI refers to as "AI coworkers" -- has appropriate guardrails and data access. Terms of the acquisition weren't disclosed. OpenAI and its rivals are racing to develop more advanced AI agents that can field complex tasks on a user's behalf, with limited need for human intervention. At the same time, the AI developer is working to convince a broader mix of businesses to pay for the technology by ensuring these products are efficient and safe. "OpenAI acquiring Promptfoo signals a clear commitment to making enterprise AI not just powerful, but safe and reliable at scale," said Ganesh Bell, managing director at Insight Partners, which led an $18.4 million funding round in the security startup last year. Two-year-old Promptfoo makes open-source tools for testing the security of AI systems as well as for helping companies try to attack their own products in order to find vulnerabilities, a process known as red-teaming. The San Francisco-based startup counts roughly a quarter of Fortune 500 firms as customers. Promptfoo is one of a host of startups working to develop cybersecurity products that use AI to guard against hackers even as bad actors turn to similar technology to probe for ways into critical networks. OpenAI, too, has moved to imbue its AI products and agents with security features. Last week, the company introduced an AI agent meant to help security teams find and patch vulnerabilities in large databases, similar to a tool from rival Anthropic PBC. As part of the new acquisition, OpenAI's Frontier will get automated security testing and red-teaming features. The product will also get capabilities to help organizations monitor changes and track testing in order to keep up with risk and compliance needs. OpenAI said it will keep building out Promptfoo's open-source work while adding the latter's technology to Frontier.
[2]
The open-source AI red-teaming tool used by Fortune 500 companies is now part of OpenAI
The acquisition of Promptfoo, which counts more than 125,000 developers and 30-plus Fortune 500 companies among its users, is OpenAI's most direct move yet into AI application security. Its technology will go into Frontier, the company's enterprise agent platform launched just a month ago. When Ian Webster was leading the LLM engineering team at Discord, shipping AI products to 200 million users, he noticed something the security industry had not yet caught up with: the tools his team relied on to keep those products safe were built for a different era. Traditional vulnerability scanners could not reason about prompt injection. Static analysis had nothing to say about a model that promised a user something it had no authority to deliver. The testing infrastructure for AI applications, he concluded, simply did not exist. So he built it himself, nights and weekends, as an open-source project. That project became Promptfoo. On Monday, OpenAI announced it is acquiring the company. The deal, terms of which were not disclosed, will see Promptfoo's technology integrated into OpenAI Frontier, the enterprise agent management platform that OpenAI launched in early February. In a post on X, OpenAI said the acquisition would "strengthen agentic security testing and evaluation capabilities" within Frontier, and pledged that Promptfoo would remain open source under its current licence, with continued support for existing customers. Promptfoo, which Webster co-founded with Michael D'Angelo - a former VP of engineering and head of AI at identity verification firm Smile Identity - launched commercially in 2024 with $5 million in seed funding from Andreessen Horowitz. The seed round attracted backing from a notable roster of angels, including Shopify CEO Tobi Lütke, Discord CTO Stanislav Vishnevskiy, and Okta co-founder Frederic Kerrest. By July 2025, the company had raised an $18.4 million Series A led by Insight Partners, with a16z again participating. Total funding ahead of the acquisition was approximately $23.4 million. At the time of the Series A, Promptfoo said it had more than 125,000 developers using its open-source framework and over 30 Fortune 500 companies running its enterprise platform in production. Customers span retail, telecoms, financial services, and media, sectors with acute exposure to the regulatory and reputational risks of AI failures. The product works by acting as an automated adversary. Rather than relying on manual penetration testing, Promptfoo's platform talks directly to a customer's AI application, through its chat interface or APIs, using specialised models and agents that behave like users, or specifically like attackers. When an attack succeeds, the platform records it, analyses why it worked, and iterates through an agentic reasoning loop to refine the test and expose deeper vulnerabilities. Risks the platform targets include prompt injection, data leakage, jailbreaks, and what Webster has called "application-level" failures: AI systems that promise users things they cannot deliver, or that reveal database contents to a customer service query, or that stray into political opinion in a homework tutor. It is precisely those application-level risks that make Promptfoo's acquisition a strategic fit for OpenAI's current direction. Frontier, which OpenAI has described as an attempt to create "AI coworkers" for the enterprise, is designed to give AI agents access to production systems, CRM platforms, data warehouses, internal ticketing tools, and to execute workflows with real-world consequences. Agents operating at that level of access create a correspondingly enlarged attack surface. Early customers named by OpenAI for Frontier include Uber, State Farm, Intuit, and Thermo Fisher Scientific: organisations for whom a misbehaving agent is not an inconvenience but a liability. OpenAI has been building out Frontier at speed. Since launching the platform on 5 February, the company has announced Frontier Alliances with Accenture, Boston Consulting Group, Capgemini, and McKinsey, enlisting the consulting firms to drive enterprise deployment. Separately, the company has been rolling out Codex Security, an AI-powered application security agent for software repositories, formerly known internally as Aardvark, which entered wider availability on the same day as the Promptfoo acquisition announcement. Promptfoo is not the only AI security product entering broader availability this month. Anthropic launched Claude Code Security in February, targeting similar vulnerability scanning use cases. The convergence suggests that as AI agents move into production at scale, the question of who secures them, and how, is fast becoming one of the defining commercial battlegrounds in enterprise AI. For Promptfoo's open-source community, OpenAI's commitment to keeping the project open source under its current licence will be the line to watch. The project has over 248 contributors, and its adoption by developers at companies across the AI industry - including, according to Promptfoo's own website, teams at Anthropic and Google - was built on the premise that the tool belonged to the developer community rather than to any one vendor. That promise now sits alongside a commercial integration into one of the most powerful enterprise AI platforms in the market.
Share
Share
Copy Link
OpenAI has acquired Promptfoo, a two-year-old AI security startup that helps Fortune 500 companies find and fix security vulnerabilities in AI models. The acquisition will integrate Promptfoo's open-source red-teaming technology into OpenAI Frontier, the enterprise agent management platform launched in February. The move signals OpenAI's commitment to making AI agents safe and reliable as they gain access to production systems with real-world consequences.
OpenAI announced Monday it has agreed to acquire Promptfoo, an AI security startup that enables large businesses to identify and fix security vulnerabilities in AI models during development
1
. The acquisition marks OpenAI's most direct move yet into AI application security, bringing aboard a company that counts more than 125,000 developers and over 30 Fortune 500 companies among its users2
. Terms of the deal were not disclosed, though Promptfoo had raised approximately $23.4 million in total funding, including an $18.4 million Series A led by Insight Partners in July 20252
.
Source: The Next Web
The ChatGPT maker will integrate Promptfoo's technology into OpenAI Frontier, the enterprise agent management platform launched just over a month ago in early February
1
. Frontier is designed to help organizations build and manage AI agents—what OpenAI calls "AI coworkers"—with appropriate guardrails and data access controls1
. Early customers include Uber, State Farm, Intuit, and Thermo Fisher Scientific, organizations for whom a misbehaving agent represents significant liability rather than mere inconvenience2
."OpenAI acquiring Promptfoo signals a clear commitment to making enterprise AI not just powerful, but safe and reliable at scale," said Ganesh Bell, managing director at Insight Partners
1
. The timing reflects a critical shift as AI agents move beyond experimental deployments into production systems with real-world consequences. Frontier gives these agents access to CRM platforms, data warehouses, internal ticketing tools, and the ability to execute workflows autonomously, creating a correspondingly enlarged attack surface that demands robust security testing2
.
Source: Bloomberg
OpenAI and its rivals are racing to develop more advanced AI agents that can handle complex tasks on a user's behalf with limited human intervention
1
. At the same time, the company is working to convince a broader mix of businesses to pay for the technology by ensuring these products are both efficient and safe1
. The acquisition addresses a gap that Promptfoo co-founder Ian Webster identified while leading the LLM engineering team at Discord: traditional vulnerability scanners cannot reason about prompt injection, and static analysis tools have nothing to say about models that promise users things they have no authority to deliver2
.Two-year-old Promptfoo makes open-source tools for testing AI security and helps companies attack their own products to find vulnerabilities through a process known as red-teaming
1
. The San Francisco-based startup counts roughly a quarter of Fortune 500 firms as customers across retail, telecoms, financial services, and media—sectors with acute exposure to regulatory and reputational risks of AI failures1
2
.The platform works by acting as an automated adversary, talking directly to a customer's AI application through its chat interface or APIs using specialized models and agents that behave like attackers . When an attack succeeds, the platform records it, analyzes why it worked, and iterates through an agentic reasoning loop to refine the test and expose deeper vulnerabilities
2
. Risks the platform targets include prompt injection, data leakage, jailbreaks, and what Webster has called "application-level" failures—AI systems that reveal database contents to customer service queries or stray into political opinion in homework tutors2
.Related Stories
As part of the deal, OpenAI Frontier will receive automated security testing and red-teaming features, along with capabilities to help organizations monitor changes and track testing to keep up with risk and compliance needs
1
. OpenAI pledged that Promptfoo would remain open-source under its current license, with continued support for existing customers2
. The company said it will keep building out Promptfoo's open-source work while adding the technology to Frontier1
.Since launching Frontier on February 5, OpenAI has announced Frontier Alliances with Accenture, Boston Consulting Group, Capgemini, and McKinsey, enlisting these consulting firms to drive enterprise deployment . Separately, the company has been rolling out Codex Security, an AI-powered application security agent for software repositories that entered wider availability on the same day as the Promptfoo acquisition announcement
2
.Promptfoo is one of several startups developing cybersecurity products that use AI to guard against hackers even as bad actors turn to similar technology to probe for ways into critical networks
1
. OpenAI has moved to imbue its AI products and agents with security features, including introducing an AI agent last week meant to help security teams find and patch vulnerabilities in large databases, similar to a tool from rival Anthropic1
. Anthropic launched Claude Code Security in February, targeting similar vulnerability scanning use cases2
.The convergence suggests that as AI agents move into production at scale, the question of who secures them and how is becoming one of the defining commercial battlegrounds in enterprise AI
2
. For organizations deploying AI agents with access to sensitive systems, the ability to safeguard AI agents against prompt injection, data leakage, and application-level risks will determine whether these tools deliver value or create new vectors for catastrophic failures. As Promptfoo's technology becomes embedded in Frontier, businesses will be watching whether OpenAI's commitment to open-source development holds and how effectively integrated security testing can reduce the regulatory and operational risks that have kept many enterprises cautious about AI adoption.Summarized by
Navi
05 Feb 2026•Technology

23 Feb 2026•Technology

11 Dec 2025•Policy and Regulation

1
Technology

2
Policy and Regulation

3
Policy and Regulation
