Critical Vulnerabilities in Nvidia's Triton Inference Server Expose AI Models to Potential Attacks

Vulnerability Discovery in Nvidia's Triton Inference Server

Security researchers from Wiz have uncovered a chain of high-severity vulnerabilities in Nvidia's Triton Inference Server, an open-source platform designed for running AI models at scale. These flaws, if exploited, could potentially lead to remote code execution (RCE) and expose organizations to significant risks 1

Source: Dataconomy

The Vulnerability Chain

The researchers identified three critical vulnerabilities in the Triton Inference Server's Python backend:

CVE-2025-23320 (CVSS score: 7.5): A flaw that allows attackers to exceed the shared memory limit by sending a very large request, revealing the unique name of the backend's internal IPC shared memory region 2
2
.
CVE-2025-23319 (CVSS score: 8.1): An out-of-bounds write vulnerability that can be exploited using the information leaked from CVE-2025-23320 .
CVE-2025-23334 (CVSS score: 5.9): An out-of-bounds read vulnerability that, when combined with the other flaws, completes the attack chain .

Potential Impact and Risks

If successfully exploited, these vulnerabilities could allow an unauthenticated attacker to gain complete control of the Triton Inference Server. The potential consequences include:

Theft of valuable AI models
Exposure of sensitive data
Manipulation of AI model responses
Establishment of a foothold for deeper network penetration 3
3

Source: Hacker News

Widespread Usage and Affected Systems

Triton Inference Server is used by numerous organizations for AI/ML workloads, including major companies such as Microsoft, Amazon, Oracle, Siemens, and American Express. A 2021 press release indicated that over 25,000 companies use Nvidia's AI stack 4

The vulnerabilities affect both Windows and Linux systems running the Triton Inference Server 3

Nvidia's Response and Mitigation

Nvidia has addressed these vulnerabilities in version 25.07 of the Triton Inference Server, released on August 4, 2025. The company strongly recommends all users to update to this latest version immediately 1

Nir Ohfeld, Wiz's Head of Vulnerability Research, emphasized the importance of updating: "The single most important step is to update to the patched version of the Nvidia Triton Inference Server (version 25.07 or newer). This directly fixes the entire vulnerability chain." 5

Broader Implications for AI Security

Source: TechRadar

This incident highlights the growing importance of security in AI infrastructure. As companies increasingly deploy AI and machine learning technologies, securing the underlying infrastructure becomes paramount. The discovery of these vulnerabilities underscores the need for a defense-in-depth approach, where security is considered at every layer of an application 1

While there is currently no evidence of these vulnerabilities being exploited in the wild, the widespread use of Nvidia's Triton Inference Server in AI workloads makes it a potentially attractive target for attackers 5

Critical Vulnerabilities in Nvidia's Triton Inference Server Expose AI Models to Potential Attacks

Vulnerability Discovery in Nvidia's Triton Inference Server

The Vulnerability Chain

Potential Impact and Risks

Widespread Usage and Affected Systems

Nvidia's Response and Mitigation

Broader Implications for AI Security

References

Nvidia patches bug chain leading to total Triton takeover

NVIDIA Triton Bugs Let Unauthenticated Attackers Execute Code and Hijack AI Servers

Security flaws in key Nvidia enterprise tool could have let hackers run malware on Windows and Linux systems

Nvidia releases update for 'critical' vulnerabilities in AI stack

Wiz finds exploit chain in Nvidia AI inference software

Related Stories

Critical Security Flaws Discovered in NVIDIA Container Toolkit

AI-Powered HexStrike Tool Accelerates Exploitation of Citrix Vulnerabilities

Google's Gemini AI Vulnerabilities Expose Risks of Indirect Prompt Injection

Weekly Highlights

Google TPUs Challenge Nvidia's AI Chip Dominance as Meta Explores Billion-Dollar Switch

OpenAI and Jony Ive Reveal First Hardware Prototype for Screenless AI Device

OpenAI Faces Legal Battle Over Teen Suicide Cases, Blames Users for Violating Terms of Service

Weekly Highlights

Today's Top Stories

AI-Generated Country Hit 'Walk My Walk' Sparks Ethics Debate Over Artist Attribution and Voice Cloning

ChatGPT Transforms Information Discovery: How AI Chatbots Are Reshaping Search Behavior

Micron Announces $9.6 Billion Investment in Japan AI Memory Chip Plant

AMD Quietly Unveils New Graphics Cards Including RDNA 4-Based AI Pro Models