MemoryGraft: Persistent Compromise of LLM Agents via Poisoned Experience Retrieval

arXiv — cs.LGMonday, December 22, 2025 at 5:00:00 AM
  • The paper titled 'MemoryGraft: Persistent Compromise of LLM Agents via Poisoned Experience Retrieval' introduces a novel attack method that targets the long-term memory of Large Language Model (LLM) agents, compromising their behavior by embedding malicious experiences. This indirect injection attack exploits the trust between an agent's reasoning core and its past experiences, raising concerns about the security of LLMs in autonomous applications.
  • This development is significant as it highlights a critical vulnerability in LLMs, which increasingly rely on long-term memory and Retrieval-Augmented Generation (RAG) for improved performance. The introduction of MemoryGraft underscores the need for enhanced security measures to protect against such sophisticated attacks that could undermine the reliability of AI systems in various applications.
  • The emergence of MemoryGraft aligns with ongoing discussions about the security of AI agents, particularly in the context of behavioral backdoor detection and prompt injection vulnerabilities. As AI systems become more integrated into decision-making processes, the risks associated with compromised memory and trust boundaries are becoming more pronounced, necessitating a reevaluation of existing defenses and the development of robust frameworks to safeguard against these threats.
— via World Pulse Now AI Editorial System

Was this article worth reading? Share it

Recommended apps based on your readingExplore all apps
Continue Readings
SwiftMem: Fast Agentic Memory via Query-aware Indexing
PositiveArtificial Intelligence
SwiftMem has been introduced as a query-aware agentic memory system designed to enhance the efficiency of large language model (LLM) agents by enabling sub-linear retrieval through specialized indexing techniques. This system addresses the limitations of existing memory frameworks that rely on exhaustive retrieval methods, which can lead to significant latency issues as memory storage expands.
PrivGemo: Privacy-Preserving Dual-Tower Graph Retrieval for Empowering LLM Reasoning with Memory Augmentation
PositiveArtificial Intelligence
PrivGemo has been introduced as a privacy-preserving framework designed for knowledge graph (KG)-grounded reasoning, addressing the risks associated with using private KGs in large language models (LLMs). This dual-tower architecture maintains local knowledge while allowing remote reasoning through an anonymized interface, effectively mitigating semantic and structural exposure.
ClimateIQA: A New Dataset and Benchmark to Advance Vision-Language Models in Meteorology Anomalies Analysis
PositiveArtificial Intelligence
A new dataset named ClimateIQA has been introduced to enhance the capabilities of Vision-Language Models (VLMs) in analyzing meteorological anomalies. This dataset, which includes 26,280 high-quality images, aims to address the challenges faced by existing models like GPT-4o and Qwen-VL in interpreting complex meteorological heatmaps characterized by irregular shapes and color variations.
LLaVAction: evaluating and training multi-modal large language models for action understanding
PositiveArtificial Intelligence
The research titled 'LLaVAction' focuses on evaluating and training multi-modal large language models (MLLMs) for action understanding, reformulating the EPIC-KITCHENS-100 dataset into a benchmark for MLLMs. The study reveals that leading MLLMs struggle with recognizing correct actions when faced with difficult distractors, highlighting a gap in their fine-grained action understanding capabilities.
DriveRX: A Vision-Language Reasoning Model for Cross-Task Autonomous Driving
PositiveArtificial Intelligence
DriveRX has been introduced as a vision-language reasoning model aimed at enhancing cross-task autonomous driving by addressing the limitations of traditional end-to-end models, which struggle with complex scenarios due to a lack of structured reasoning. This model is part of a broader framework called AutoDriveRL, which optimizes four core tasks through a unified training approach.
STO-RL: Offline RL under Sparse Rewards via LLM-Guided Subgoal Temporal Order
PositiveArtificial Intelligence
A new offline reinforcement learning (RL) framework named STO-RL has been proposed to enhance policy learning from pre-collected datasets, particularly in long-horizon tasks with sparse rewards. By utilizing large language models (LLMs) to generate temporally ordered subgoal sequences, STO-RL aims to improve the efficiency of reward shaping and policy optimization.
When KV Cache Reuse Fails in Multi-Agent Systems: Cross-Candidate Interaction is Crucial for LLM Judges
NeutralArtificial Intelligence
Recent research highlights that while KV cache reuse can enhance efficiency in multi-agent large language model (LLM) systems, it can negatively impact the performance of LLM judges, leading to inconsistent selection behaviors despite stable end-task accuracy.
LoFT-LLM: Low-Frequency Time-Series Forecasting with Large Language Models
PositiveArtificial Intelligence
The introduction of LoFT-LLM, a novel forecasting pipeline, aims to enhance time-series predictions in finance and energy sectors by integrating low-frequency learning with large language models (LLMs). This approach addresses challenges posed by limited training data and high-frequency noise, allowing for more accurate long-term trend analysis.

Ready to build your own newsroom?

Subscribe to unlock a personalised feed, podcasts, newsletters, and notifications tailored to the topics you actually care about