NVIDIA Triton Inference Server Python Backend Shared Memory Limit Vulnerability Allowing Information Disclosure
Vulnerability
A vulnerability exists in the Python backend of NVIDIA Triton Inference Server for Windows and Linux. This issue allows an attacker to exceed the shared memory limit by sending an excessively large request. Exploiting this vulnerability could lead to unauthorized information disclosure.
Impact
Successful exploitation may result in the unintentional release of sensitive information.
Remediation
Users can update to NVIDIA Triton Inference Server version 25.07 or later. For guidance on secure deployment, refer to the NVIDIA Triton Inference Server Secure Deployment Considerations Guide.
Added: Aug 6, 2025, 1:21 PM
Updated: Aug 6, 2025, 1:21 PM
Vulnerability Rating
Custom Algorithm
spread
0.0impact
2.5exploitability
7.4remediation
7.7relevance
0.3threat
0.0urgency
2.9incentive
5.8Our algorithm analyzes dozens of metrics to generate these 8 key vulnerability categories, which are then combined to calculate the overall risk score.
