arXiv

From Untrusted Input to Trusted Memory: A Systematic Study of Memory Poisoning Attacks in LLM Agents

Title: From Untrusted Input to Trusted Memory: A Systematic Study of Memory Poisoning Attacks in LLM Agents

Abstract:

Memory serves as a foundational element for AI agents, allowing them to build knowledge bases over time and enhance their operational performance through repeated interactions. Nevertheless, the persistence of this memory creates a vulnerability to memory poisoning, a threat where a single malicious memory update can permanently alter an agent’s future behavior. This paper offers a comprehensive analysis of memory poisoning within Large Language Model (LLM) agents. We pinpoint four distinct channels for memory writes and nine structural weaknesses—ranging from model capabilities and system prompt configurations to overall agent architecture—that render these channels susceptible to exploitation. Leveraging these identified vulnerabilities, we establish a taxonomy comprising six categories of memory poisoning attacks. Additionally, we introduce MPBench, a specialized benchmark designed to assess the efficacy of memory poisoning attacks, demonstrating that agents configured for aggressive memory writing and retrieval are particularly prone to exploitation. Our results also indicate that current prompt injection defenses are insufficient against memory poisoning threats. These insights lay the groundwork for better comprehension and mitigation of memory poisoning risks facing AI agents.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)
Bloomberg

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)

SpaceX aims for a record $75 billion valuation through an initial public offering. This historic IPO marks a significant...

Broadcom AI Chip Outlook Disappoints Investors
Bloomberg

Broadcom AI Chip Outlook Disappoints Investors

Broadcom’s AI chip projections disappointed investors, dampening market sentiment. The outlook fell short of expectation...

Hiranandani Group CEO on Powering India's Digital Future
Bloomberg

Hiranandani Group CEO on Powering India's Digital Future

Hiranandani Group CEO discusses driving India's digital transformation.

Cerebras Says It’s Working With All AI Gear Makers Except Nvidia
Bloomberg

Cerebras Says It’s Working With All AI Gear Makers Except Nvidia

Cerebras confirmed partnerships with all major AI hardware vendors except Nvidia. This broad engagement positions Cerebr...

Putin Turns Russia’s AI Future Into a Kremlin Family Business
Bloomberg

Putin Turns Russia’s AI Future Into a Kremlin Family Business

Putin is consolidating Russia’s AI ambitions into a Kremlin family business, effectively turning the sector into a dynas...

Reuters

Meta repeatedly pushes back new AI model release for developers, WSJ says

Meta has repeatedly delayed the release of its new AI model for developers, according to the WSJ. This ongoing postponem...