NVIDIA Triton Inference Server Model Loading API Integer Overflow Vulnerability Leading to Denial-of-Service

Vulnerability

A vulnerability exists in the model loading API of NVIDIA Triton Inference Server, where a user could induce an integer overflow or wraparound error by uploading a model with an excessively large file size. This oversized file can overflow an internal variable, potentially leading to a denial-of-service condition.

Impact

Exploitation of this vulnerability can cause a denial-of-service, disrupting the availability of the Triton Inference Server.

Remediation

Users can upgrade to NVIDIA Triton Inference Server version 24.12 to address this vulnerability. The latest release can be downloaded from the Triton Inference Server Releases page on GitHub.

Added: Jun 9, 2025, 7:46 PM
Updated: Jun 9, 2025, 7:46 PM

Vulnerability Rating

Custom Algorithm
spread
0.0
impact
2.5
exploitability
4.8
remediation
7.7
relevance
0.0
threat
0.0
urgency
2.9
incentive
1.7

Our algorithm analyzes dozens of metrics to generate these 8 key vulnerability categories, which are then combined to calculate the overall risk score.