Skip to main content

NVIDIA TensorRT-LLM EUVD-2025-209903

| CVE-2025-33255 HIGH
Deserialization of Untrusted Data (CWE-502)
2026-05-20 nvidia GHSA-gvr5-23jj-pf9p
7.5
CVSS 3.1
Share

CVSS VectorNVD

CVSS:3.1/AV:L/AC:H/PR:H/UI:N/S:C/C:H/I:H/A:H
Attack Vector
Local
Attack Complexity
High
Privileges Required
High
User Interaction
None
Scope
Changed
Confidentiality
High
Integrity
High
Availability
High

Lifecycle Timeline

1
Analysis Generated
May 20, 2026 - 04:00 vuln.today

DescriptionNVD

NVIDIA TRT-LLM for any platform contains a vulnerability in MPI server, where an attacker could cause an unsafe deserialization. A successful exploit of this vulnerability might lead to code execution, denial of service, data tampering, and information disclosure.

AnalysisAI

Unsafe deserialization in NVIDIA TensorRT-LLM's MPI server component allows a high-privileged local attacker to achieve code execution, denial of service, data tampering, or information disclosure on systems running the affected library. The CVSS 7.5 score reflects high impact but constrained exploitability (AV:L/AC:H/PR:H), and no public exploit identified at time of analysis. Scope change (S:C) indicates compromise can extend beyond the vulnerable component to impact other resources on the host.

Technical ContextAI

TensorRT-LLM (TRT-LLM) is NVIDIA's open-source library for optimizing large language model inference on NVIDIA GPUs, commonly deployed in multi-GPU and multi-node configurations using MPI (Message Passing Interface) for distributed workload coordination. The vulnerability resides in the MPI server component, which evidently accepts serialized objects from peer processes and reconstructs them without sufficient validation. The root cause is CWE-502 (Deserialization of Untrusted Data) - a class of flaw where attacker-influenced serialized payloads are reconstructed into in-memory objects, allowing gadget chains or type confusion to drive arbitrary code paths during the deserialization process itself, before any application-level authorization checks run.

RemediationAI

Patch available per vendor advisory - consult NVIDIA's official security bulletin at https://nvidia.custhelp.com/app/answers/detail/a_id/5805 for the exact fixed TensorRT-LLM version and upgrade to that release. Until upgrading is feasible, compensating controls include restricting MPI traffic to a dedicated isolated network segment with strict firewall rules (block external access to MPI ports, typically in the 1024-65535 ephemeral range used by Open MPI/MPICH), enforcing host-level authentication and authorization for any user able to join the MPI communicator, and avoiding multi-tenant TRT-LLM deployments where untrusted workloads share the same MPI fabric - the trade-off being reduced flexibility for shared GPU cluster usage. Additionally, monitor MPI server processes for unexpected child processes or outbound connections as a detective control, accepting the limitation that exploitation may occur in-memory without spawning new processes.

Share

EUVD-2025-209903 vulnerability details – vuln.today

This site uses cookies essential for authentication and security. No tracking or analytics cookies are used. Privacy Policy