What is the critical vulnerability reported in the llama-cpp-python package?

The critical vulnerability in the llama-cpp-python package, tracked as CVE-2024-34359 with a CVSS score of 9.7, involves server-side template injection that can lead to remote code execution (RCE) due to improper implementation of the Jinja2 template engine.

How many models are potentially affected by this vulnerability?

The vulnerability potentially affects over 6,000 models, posing a significant risk for supply chain attacks.

What does the llama-cpp-python package do?

The llama-cpp-python package provides Python bindings for the llama.cpp library, which is used to run large-language models (LLMs) like Meta’s LLaMA.

What is the recommended solution to mitigate the risk posed by CVE-2024-34359?

Users of the affected llama-cpp-python package are strongly advised to update to version 0.2.72, which includes input validation and sandboxing measures during template rendering to mitigate this critical risk.

Advisory

Critical vulnerability reported in llama-cpp-python can lead to remote code execution

Q: How does the vulnerability in the llama-cpp-python package work?

The vulnerability arises from the llama-cpp-python package parsing chat templates stored in metadata without proper sanitization or sandboxing. This allows attackers to inject malicious templates, potentially leading to remote code execution (RCE).

published: May 17, 2024

Take action: If you are using the llama-cpp-python package and are exposing chat interface to users for your model, update it ASAP. Otherwise hackers will eventually find your chat interface and exploit it.

Learn More

A critical vulnerability in the llama-cpp-python package, a popular Python package for large-language models (LLMs), has been reported, potentially affecting over 6,000 models and posing a significant risk for supply chain attacks.

The flaw, tracked as CVE-2024-34359 (CVSS score 9.7) involves server-side template injection that can lead to remote code execution (RCE) due to improper implementation of the Jinja2 template engine. The llama-cpp-python package, which provides Python bindings for the llama.cpp library used to run LLMs like Meta’s LLaMA, was found to be parsing chat templates stored in metadata without proper sanitization or sandboxing. This allows attackers to inject malicious templates.

Security researcher Patrick Peng demonstrated a proof-of-concept exploit on Hugging Face, showing how compromised models could execute arbitrary code upon loading or initiating a chat session.

A fix for CVE-2024-34359 was released in version 0.2.72 of llama_cpp_python, which includes input validation and sandboxing measures during template rendering. Users of the affected package are strongly advised to update to the latest version to mitigate this critical risk.

Critical vulnerability reported in llama-cpp-python can lead to remote code execution

Read More
Golang team reports two vulnerabilities, one critical
Critical Google Cloud Dataform path traversal flaw enables …
Critical Remote Code Execution Vulnerability Discovered in Protobuf.js …
Security flaw reported in Cl0p ransomware gang data …
Anthropic's Claude Code Source Code Leaked Through npm …