CVE-2026-34756

6.5 MEDIUM

Published: April 06, 2026 Modified: April 20, 2026

Description

vLLM is an inference and serving engine for large language models (LLMs). From 0.1.0 to before 0.19.0, a Denial of Service vulnerability exists in the vLLM OpenAI-compatible API server. Due to the lack of an upper bound validation on the n parameter in the ChatCompletionRequest and CompletionRequest Pydantic models, an unauthenticated attacker can send a single HTTP request with an astronomically large n value. This completely blocks the Python asyncio event loop and causes immediate Out-Of-Memory crashes by allocating millions of request object copies in the heap before the request even reaches the scheduling queue. This vulnerability is fixed in 0.19.0.

AI Explanation

Get an AI-powered plain-language explanation of this vulnerability and remediation steps.

Login to generate AI explanation

CVSS v3.x Details

0.0 Low Medium High Critical 10.0

Vector String

CVSS:3.1/AV:N/AC:L/PR:L/UI:N/S:U/C:N/I:N/A:H

References to Advisories, Solutions, and Tools

Patch Vendor Advisory Exploit Third Party Advisory

https://github.com/vllm-project/vllm/commit/b111f8a61f100fdca08706f41f29ef3548de7380

Source: security-advisories@github.com

Patch

https://github.com/vllm-project/vllm/pull/37952

Source: security-advisories@github.com

Issue Tracking Patch

https://github.com/vllm-project/vllm/security/advisories/GHSA-3mwp-wvh9-7528

Source: security-advisories@github.com

Patch Vendor Advisory

3 reference(s) from NVD

Quick Stats

CVSS v3 Score

6.5 / 10.0

EPSS (Exploit Probability)

0.1%

18th percentile

Exploitation Status

Not in CISA KEV

Weaknesses (CWE)

CWE-770

Affected Vendors

vllm

Related CVEs

CVE-2026-9278 N/A

CVE-2026-8935 N/A

CVE-2026-8386 N/A

CVE-2026-8385 N/A

CVE-2026-12223 5.5