Which versions of vLLM are affected by CVE-2026-56340?

Production AI inference systems built on vLLM 0.10.2-0.12.x face availability and integrity risks from an authentication-gated tensor validation bypass that can force service outages or memory…. A patch is available.

How to fix or mitigate CVE-2026-56340?

24 hours: Catalog all vLLM 0.10.2-0.12.x deployments in production and development; confirm API access logging is enabled; prepare rollback procedures. 7 days: Retrieve vendor security patch; execute testing in isolated staging environment to validate inference quality; schedule maintenance windows. 30 days: Execute production patch deployment across all inference clusters; verify multimodal feature functionality and performance baselines post-remediation.

vLLM CVE-2026-56340

Q: What is the CVSS score of CVE-2026-56340?

CVE-2026-56340 has a CVSS 4.0 base score of 8.7 (High). CVSS vector: CVSS:4.0/AV:N/AC:L/AT:N/PR:L/UI:N/VC:H/VI:H/VA:H/SC:N/SI:N/SA:N/E:X/CR:X/IR:X/AR:X/MAV:X/MAC:X/MAT:X/MPR:X/MUI:X/MVC:X/MVI:X/MVA:X/MSC:X/MSI:X/MSA:X/S:X/AU:X/R:X/V:X/RE:X/U:X. EPSS exploitation probability: 0.3%.

EUVD-2026-38129 HIGH

Improper Input Validation (CWE-20)

2026-06-20 VulnCheck

Denial Of Service Buffer Overflow Vllm

8.7

CVSS 4.0 · Vendor: VulnCheck

Severity by source

Vendor (VulnCheck) PRIMARY

8.7 HIGH

CVSS:4.0/AV:N/AC:L/AT:N/PR:L/UI:N/VC:H/VI:H/VA:H/SC:N/SI:N/SA:N/E:X/CR:X/IR:X/AR:X/MAV:X/MAC:X/MAT:X/MPR:X/MUI:X/MVC:X/MVI:X/MVA:X/MSC:X/MSI:X/MSA:X/S:X/AU:X/R:X/V:X/RE:X/U:X

vuln.today AI

7.6 HIGH

Network-reachable inference API needing a low-privilege caller (PR:L), no user interaction; demonstrated impact is crash/DoS (A:H) with only potential, unproven memory corruption, so C and I are L.

3.1 AV:N/AC:L/PR:L/UI:N/S:U/C:L/I:L/A:H

4.0 AV:N/AC:L/AT:N/PR:L/UI:N/VC:L/VI:L/VA:H/SC:N/SI:N/SA:N

Primary rating from Vendor (VulnCheck).

CVSS VectorVendor: VulnCheck

CVSS:4.0/AV:N/AC:L/AT:N/PR:L/UI:N/VC:H/VI:H/VA:H/SC:N/SI:N/SA:N/E:X/CR:X/IR:X/AR:X/MAV:X/MAC:X/MAT:X/MPR:X/MUI:X/MVC:X/MVI:X/MVA:X/MSC:X/MSI:X/MSA:X/S:X/AU:X/R:X/V:X/RE:X/U:X

Attack Vector

Network

Attack Complexity

Low

Privileges Required

Low

User Interaction

None

Scope

Lifecycle Timeline

Source Code Evidence Fetched

Jun 22, 2026 - 05:53 vuln.today

Analysis Generated

Jun 22, 2026 - 05:53 vuln.today

Patch available

Jun 20, 2026 - 20:01 EUVD

DescriptionCVE.org

vLLM versions >= 0.10.2 and < 0.13.0 are missing sparse tensor validation in multimodal embeddings processing. Because PyTorch disables sparse tensor invariant checks by default, an attacker can submit crafted embedding requests with malformed (negative or out-of-bounds) tensor indices, when the prompt-embeds feature is enabled, to trigger crashes or resource exhaustion (denial of service), with potential for out-of-bounds/write-what-where memory corruption. This continues CVE-2025-62164, whose prior fix only disabled the feature by default rather than addressing the root cause.

AnalysisAI

Denial of service and potential memory corruption in vLLM versions 0.10.2 through 0.12.x stems from missing sparse tensor validation in multimodal embeddings processing, allowing authenticated remote users to submit crafted prompt-embedding requests with malformed tensor indices. Because PyTorch disables sparse tensor invariant checks by default, attackers can crash the inference server or exhaust resources, with potential out-of-bounds or write-what-where memory corruption. …

Unlock full vulnerability intelligence

Risk assessment & exploitation conditions
Attack chain visualization
Remediation with exact patch versions
Threat intelligence from 22 sources
Personal watchlist & email alerts

Continue with Google Continue with GitHub

Free forever · No credit card required

Attack ChainAIDerived

Hypothetical attack flow derived from CVE metadata

Recon

Identify vLLM endpoint with prompt-embeds enabled

Delivery

Obtain low-privilege API credential

Exploit

Craft embedding tensor with malformed sparse indices

Install

Submit inference request to multimodal endpoint

Bypass disabled PyTorch invariant checks

Execute

Trigger crash, resource exhaustion, or OOB memory write

Impact

Deny service or corrupt inference process memory