BaiduTencent AIFeb 22, 2026arXiv:2602.19320

Anatomy of Agentic Memory: Taxonomy and Empirical Analysis of Evaluation and System Limitations

Dongming Jiang, Yi Li, Songtao Wei, Jinxin Yang, Jinxin Yang, Ayushi Kishore, Ayushi Kishore, Alysa Zhao, Alysa Zhao, Dingyi Kang, Xue Hu, Xu Hu, Feng Chen, Qiannan Li, Qiannan Li, Bingzhe Li, Bingzhe Li

AI Summary

This paper presents a taxonomy of agentic memory (MAG) systems based on four memory structures and analyzes the empirical limitations of these systems. It identifies key pain points such as benchmark saturation, metric validity issues, backbone-dependent accuracy, and system-level overhead. The analysis clarifies why current agentic memory systems underperform expectations and suggests directions for improved evaluation and design.

Key Contribution

Key contribution not extracted.

Abstract

Agentic memory systems enable large language model (LLM) agents to maintain state across long interactions, supporting long-horizon reasoning and personalization beyond fixed context windows. Despite rapid architectural development, the empirical foundations of these systems remain fragile: existing benchmarks are often underscaled, evaluation metrics are misaligned with semantic utility, performance varies significantly across backbone models, and system-level costs are frequently overlooked. This survey presents a structured analysis of agentic memory from both architectural and system perspectives. We first introduce a concise taxonomy of MAG systems based on four memory structures. Then, we analyze key pain points limiting current systems, including benchmark saturation effects, metric validity and judge sensitivity, backbone-dependent accuracy, and the latency and throughput overhead introduced by memory maintenance. By connecting the memory structure to empirical limitations, this survey clarifies why current agentic memory systems often underperform their theoretical promise and outlines directions for more reliable evaluation and scalable system design.

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References80

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Anatomy of Agentic Memory: Taxonomy and Empirical Analysis of Evaluation and System Limitations

Related Papers