Google Cloud Vertex AI Connection Desynchronization Vulnerability in HTTP Proxies

Vulnerability

A connection desynchronization vulnerability has been identified in Google Cloud Vertex AI, specifically within certain third-party models when using streaming requests. This issue arose from some internal HTTP proxies not properly managing requests with an 'Expect: 100-continue' header, leading to a misrouting of responses between recipients. As a result, a response intended for one request could be delivered as the response for a subsequent request. The vulnerability affected various models on the Vertex AI API, but Google models like Gemini were not impacted.

Impact

The vulnerability caused a misrouting of streaming responses, where responses intended for one request were incorrectly delivered to another, potentially leading to unintended data exposure or confusion in response handling.

Remediation

Google has implemented fixes to address the desynchronization issue caused by the 'Expect: 100-continue' header. These fixes were rolled out for different models on separate schedules, with all affected surfaces remediated by September 28, 2025. No action is required from users.

Added: Oct 22, 2025, 10:16 AM
Updated: Oct 22, 2025, 10:16 AM

Vulnerability Rating

Custom Algorithm
spread
0.0
impact
0.6
exploitability
7.4
remediation
0.0
relevance
0.8
threat
0.0
urgency
0.0
incentive
5.8

Our algorithm analyzes dozens of metrics to generate these 8 key vulnerability categories, which are then combined to calculate the overall risk score.