NVIDIA Triton Inference Server Python Backend Shared Memory Limit Vulnerability Allowing Information Disclosure

Vulnerability

A vulnerability exists in the Python backend of NVIDIA Triton Inference Server for Windows and Linux. This issue allows an attacker to exceed the shared memory limit by sending an excessively large request. Exploiting this vulnerability could lead to unauthorized information disclosure.

Impact

Successful exploitation may result in the unintentional release of sensitive information.

Remediation

Users can update to NVIDIA Triton Inference Server version 25.07 or later. For guidance on secure deployment, refer to the NVIDIA Triton Inference Server Secure Deployment Considerations Guide.

Added: Aug 6, 2025, 1:21 PM
Updated: Aug 6, 2025, 1:21 PM

Vulnerability Rating

Custom Algorithm
spread
0.0
impact
2.5
exploitability
7.4
remediation
7.7
relevance
0.3
threat
0.0
urgency
2.9
incentive
5.8

Our algorithm analyzes dozens of metrics to generate these 8 key vulnerability categories, which are then combined to calculate the overall risk score.