What vulnerabilities were identified in LangChain?

Researchers from Palo Alto Networks identified two vulnerabilities in LangChain: CVE-2023-46229, a server-side request forgery (SSRF) vulnerability, and CVE-2023-44467, a prompt injection vulnerability. These vulnerabilities could allow attackers to execute arbitrary code and access sensitive data.

What is CVE-2023-46229?

CVE-2023-46229 (CVSS score 9.3) is an SSRF vulnerability in LangChain versions earlier than 0.0.317. It allows attackers to obtain sensitive information from intranets, potentially bypassing access controls. It's patched in LangChain Experimental version 0.0.317, with mitigations including the function _extract_scheme_and_domain and an allowlist to control permitted domains.

How can CVE-2023-46229 be exploited?

The SSRF vulnerability in LangChain's SitemapLoader feature can be exploited by including URLs to intranet resources in a sitemap, leading to data exfiltration. The SitemapLoader class extends WebBaseLoader and accepts a URL (web_path), parsing it as a sitemap without restrictions on the scope of sitemap access.

What is CVE-2023-44467?

CVE-2023-44467 (CVSS score 9.8) is a prompt injection vulnerability in LangChain Experimental versions before 0.0.306. It affects the PALChain feature, allowing attackers to execute harmful commands or code by exploiting PALChain's processing capabilities. It's patched in LangChain Experimental version 0.0.306, with mitigations including an expanded blocklist to prevent unauthorized code execution.

How can CVE-2023-44467 be exploited?

The prompt injection vulnerability in LangChain Experimental's PALChain feature can be exploited by malicious inputs that bypass security checks and execute unauthorized commands. The from_math_prompt method translates user queries into Python code, and initial blocklist protections were insufficient to prevent all potential bypasses.

What should users do to protect against these vulnerabilities?

Users should update to the latest versions of LangChain (0.0.317) and LangChain Experimental (0.0.306) and apply a blocklist to prevent the execution of the vulnerable functions.

Advisory

Multiple critical vulnerabilities impacting LangChain generative AI framework

published: July 24, 2024

Take action: If you are using LangChain generative AI framework, invest in a patch. While the exploit may be complex, the longer you leave it unpatched, the more hackers will learn how to attack it. Blocklists will help, but are not always practical.

Learn More

Researchers from Palo Alto Networks have identified two vulnerabilities in LangChain, a popular open-source generative AI framework. LangChain is an open-source library designed to simplify the usage of large language models (LLMs) by providing composable building blocks, connectors to models, and integrations with third-party services. Its partner packages include major names in cloud, AI, databases, and tech development.

The reported vulnerabilities could allow attackers to execute arbitrary code and access sensitive data, respectively.

These vulnerabilities are tracked as:

CVE-2023-46229 (CVSS score 9.3) - Server-side request forgery (SSRF) vulnerability in LangChain versions earlier than 0.0.317. This flaw allows attackers to obtain sensitive information from intranets, potentially bypassing access controls. It's patched in LangChain Experimental version 0.0.317. As mitigation, LangChain added the function _extract_scheme_and_domain and an allowlist to control permitted domains.
- LangChain's SitemapLoader feature, which loads documents from third-party websites, lacked restrictions on the scope of sitemap access, resulting in an SSRF vulnerability. This could be exploited by including URLs to intranet resources in a sitemap, leading to data exfiltration.
  - Code Flow:
    - SitemapLoader class extends WebBaseLoader, accepting a URL (web_path) and parsing it as a sitemap.
    - The scrape_all method uses aiohttp.ClientSession.get to fetch content without filtering or sanitization.
    - Malicious actors could exploit this to access sensitive intranet APIs and obtain confidential information.
CVE-2023-44467 (CVSS score: 9.8) - Prompt injection vulnerability in LangChain Experimental versions before 0.0.306. This flaw affects the PALChain feature, allowing attackers to execute harmful commands or code by exploiting PALChain's processing capabilities. It's patched in LangChain Experimental version 0.0.306. As a mitigation, LangChain expanded the blocklist to include additional functions and methods to prevent unauthorized code execution.
- The PALChain feature of LangChain Experimental converts user queries into executable Python code. This feature is vulnerable to prompt injection attacks, allowing the execution of harmful commands
  - Code Flow:
    - The from_math_prompt method translates user queries into Python code.
    - Malicious inputs can bypass security checks and execute unauthorized commands using techniques like the built-in __import__() function.
    - LangChain's initial blocklist included only four functions (system, exec, execfile, and eval), which was insufficient to prevent all potential bypasses.

Users should update to the latest versions of LangChain (0.0.317) and LangChain Experimental (0.0.306), and apply blocklist to prevent the execution of the vulnerable functions.

Multiple critical vulnerabilities impacting LangChain generative AI framework

Read More
Critical Cisco Smart Software Manager Vulnerability Allows Root …
Security researchers warn that a critical Apache HugeGraph …
Veeam Patches Critical Remote Code Execution Flaw in …
Supermicro Management Controllers expose servers to XSS exploits
Microsoft Issues Urgent Hotpatch for Windows 11 RRAS …