Primary

Context

vLLM Hash Collision Vulnerability in Prefix Caching Allowing Cache Poisoning

Vulnerability

Patched

A vulnerability in vLLM, a high-throughput inference engine for large language models, arises from hash collisions in prefix caching. This issue, present in vLLM versions prior to 0.7.2, is exploited by using maliciously crafted prompts that take advantage of Python's built-in hash function. As of Python 3.12, the hash value for 'None' has become a predictable constant, increasing the risk of collisions. Exploiting this vulnerability could lead to unintended behavior by reusing cached responses generated from different content, potentially disrupting the accuracy of the model's output.

Impact

Exploitation of this vulnerability allows for prefix cache reuse based on predictable hash collisions, which can interfere with the accuracy of responses in a shared model inference environment.

Reproduction

The vulnerability can be reproduced by using vLLM versions prior to 0.7.2 with Python 3.12. Malicious prompts can be crafted to collide by taking advantage of the predictable hashing of 'None', leading to cache entries being mixed up during processing. This can cause responses to reflect cached data from different prompts, creating 'mixed summaries' or incorrect outputs.

Remediation

Users are advised to upgrade to vLLM version 0.7.2 or later, where this vulnerability has been fixed.

Added: Jun 9, 2025, 7:46 PM

Updated: Jun 9, 2025, 7:46 PM

Vulnerability Rating

Custom Algorithm

spread

2.6

impact

0.6

exploitability

5.2

remediation

7.7

relevance

0.0

threat

1.6

urgency

2.9

incentive

1.7

Our algorithm analyzes dozens of metrics to generate these 8 key vulnerability categories, which are then combined to calculate the overall risk score.

Vulnerability Rating

Custom Algorithm

spread

2.6

impact

0.6

exploitability

5.2

remediation

7.7

relevance

0.0

threat

1.6

urgency

2.9

incentive

1.7

Our algorithm analyzes dozens of metrics to generate these 8 key vulnerability categories, which are then combined to calculate the overall risk score.

vLLM Hash Collision Vulnerability in Prefix Caching Allowing Cache Poisoning

Vulnerability

Impact

Reproduction

Remediation

Affected Products

vLLM

CVSS Scores

References

Vulnerability Rating

Vulnerability Rating