Critical Vulnerabilities in Nvidia's Triton Inference Server Expose AI Models to Potential Attacks

5 Sources

Security researchers uncover a chain of high-severity vulnerabilities in Nvidia's Triton Inference Server that could lead to remote code execution and AI model theft. Nvidia releases patches to address the issues.

Vulnerability Discovery in Nvidia's Triton Inference Server

Security researchers from Wiz have uncovered a chain of high-severity vulnerabilities in Nvidia's Triton Inference Server, an open-source platform designed for running AI models at scale. These flaws, if exploited, could potentially lead to remote code execution (RCE) and expose organizations to significant risks 1.

Source: Dataconomy

Source: Dataconomy

The Vulnerability Chain

The researchers identified three critical vulnerabilities in the Triton Inference Server's Python backend:

  1. CVE-2025-23320 (CVSS score: 7.5): A flaw that allows attackers to exceed the shared memory limit by sending a very large request, revealing the unique name of the backend's internal IPC shared memory region 2.

  2. CVE-2025-23319 (CVSS score: 8.1): An out-of-bounds write vulnerability that can be exploited using the information leaked from CVE-2025-23320 2.

  3. CVE-2025-23334 (CVSS score: 5.9): An out-of-bounds read vulnerability that, when combined with the other flaws, completes the attack chain 2.

Potential Impact and Risks

If successfully exploited, these vulnerabilities could allow an unauthenticated attacker to gain complete control of the Triton Inference Server. The potential consequences include:

  1. Theft of valuable AI models
  2. Exposure of sensitive data
  3. Manipulation of AI model responses
  4. Establishment of a foothold for deeper network penetration 3
Source: The Hacker News

Source: The Hacker News

Widespread Usage and Affected Systems

Triton Inference Server is used by numerous organizations for AI/ML workloads, including major companies such as Microsoft, Amazon, Oracle, Siemens, and American Express. A 2021 press release indicated that over 25,000 companies use Nvidia's AI stack 4.

The vulnerabilities affect both Windows and Linux systems running the Triton Inference Server 3.

Nvidia's Response and Mitigation

Nvidia has addressed these vulnerabilities in version 25.07 of the Triton Inference Server, released on August 4, 2025. The company strongly recommends all users to update to this latest version immediately 1.

Nir Ohfeld, Wiz's Head of Vulnerability Research, emphasized the importance of updating: "The single most important step is to update to the patched version of the Nvidia Triton Inference Server (version 25.07 or newer). This directly fixes the entire vulnerability chain." 5

Broader Implications for AI Security

Source: TechRadar

Source: TechRadar

This incident highlights the growing importance of security in AI infrastructure. As companies increasingly deploy AI and machine learning technologies, securing the underlying infrastructure becomes paramount. The discovery of these vulnerabilities underscores the need for a defense-in-depth approach, where security is considered at every layer of an application 1.

While there is currently no evidence of these vulnerabilities being exploited in the wild, the widespread use of Nvidia's Triton Inference Server in AI workloads makes it a potentially attractive target for attackers 5.

Explore today's top stories

NVIDIA Unveils Major GeForce NOW Upgrade with RTX 5080 Performance and Expanded Game Library

NVIDIA announces significant upgrades to its GeForce NOW cloud gaming service, including RTX 5080-class performance, improved streaming quality, and an expanded game library, set to launch in September 2025.

CNET logoengadget logoPCWorld logo

9 Sources

Technology

6 hrs ago

NVIDIA Unveils Major GeForce NOW Upgrade with RTX 5080

Space: The New Frontier of 21st Century Warfare

As nations compete for dominance in space, the risk of satellite hijacking and space-based weapons escalates, transforming outer space into a potential battlefield with far-reaching consequences for global security and economy.

AP NEWS logoTech Xplore logoeuronews logo

7 Sources

Technology

22 hrs ago

Space: The New Frontier of 21st Century Warfare

OpenAI Tweaks GPT-5 to Be 'Warmer and Friendlier' Amid User Backlash

OpenAI updates GPT-5 to make it more approachable following user feedback, sparking debate about AI personality and user preferences.

ZDNet logoTom's Guide logoFuturism logo

6 Sources

Technology

14 hrs ago

OpenAI Tweaks GPT-5 to Be 'Warmer and Friendlier' Amid User

Russian Disinformation Campaign Exploits AI to Spread Fake News

A pro-Russian propaganda group, Storm-1679, is using AI-generated content and impersonating legitimate news outlets to spread disinformation, raising concerns about the growing threat of AI-powered fake news.

Rolling Stone logoBenzinga logo

2 Sources

Technology

22 hrs ago

Russian Disinformation Campaign Exploits AI to Spread Fake

AI in Healthcare: Patients Trust AI Medical Advice Over Doctors, Raising Concerns and Challenges

A study reveals patients' increasing reliance on AI for medical advice, often trusting it over doctors. This trend is reshaping doctor-patient dynamics and raising concerns about AI's limitations in healthcare.

ZDNet logoMedscape logoEconomic Times logo

3 Sources

Health

14 hrs ago

AI in Healthcare: Patients Trust AI Medical Advice Over
TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo