NVIDIA Triton Inference Server Denial-of-Service Vulnerability via Large Compressed Payload

Vulnerability

A denial-of-service vulnerability has been identified in NVIDIA Triton Inference Server, present in all versions prior to 26.01. The issue arises in the HTTP endpoint, where an attacker can cause a denial of service by sending a large compressed payload. Exploiting this vulnerability may lead to a significant disruption of service.

Impact

Exploitation of this vulnerability can cause a denial-of-service condition, leading to increased resource consumption and potential service unavailability.

Remediation

Users are advised to update to NVIDIA Triton Inference Server version 26.01 or later. The updated version can be downloaded from the NVIDIA GitHub repository.

Added: Mar 24, 2026, 9:56 PM
Updated: Mar 24, 2026, 9:56 PM

Vulnerability Rating

Custom Algorithm
spread
0.0
impact
2.5
exploitability
7.4
remediation
0.0
relevance
4.6
threat
0.0
urgency
2.9
incentive
4.2

Our algorithm analyzes dozens of metrics to generate these 8 key vulnerability categories, which are then combined to calculate the overall risk score.