Mar 19, 2026arXiv:2603.18631

D-Mem: A Dual-Process Memory System for LLM Agents

AI Summary

The paper introduces D-Mem, a dual-process memory system for LLM agents that combines fast vector retrieval with a slower, more exhaustive "Full Deliberation" module to improve long-horizon reasoning. D-Mem uses a Multi-dimensional Quality Gating policy to dynamically switch between these two processes based on query complexity, balancing accuracy and computational cost. Experiments on LoCoMo and RealTalk benchmarks show that D-Mem outperforms static retrieval baselines and approaches the performance of full deliberation while significantly reducing computational overhead.

Key Contribution

LLM agents can achieve near-perfect memory recall without prohibitive costs by strategically combining fast, lossy retrieval with slower, exhaustive deliberation.

Abstract

Driven by the development of persistent, self-adapting autonomous agents, equipping these systems with high-fidelity memory access for long-horizon reasoning has emerged as a critical requirement. However, prevalent retrieval-based memory frameworks often follow an incremental processing paradigm that continuously extracts and updates conversational memories into vector databases, relying on semantic retrieval when queried. While this approach is fast, it inherently relies on lossy abstraction, frequently missing contextually critical information and struggling to resolve queries that rely on fine-grained contextual understanding. To address this, we introduce D-Mem, a dual-process memory system. It retains lightweight vector retrieval for routine queries while establishing an exhaustive Full Deliberation module as a high-fidelity fallback. To achieve cognitive economy without sacrificing accuracy, D-Mem employs a Multi-dimensional Quality Gating policy to dynamically bridge these two processes. Experiments on the LoCoMo and RealTalk benchmarks using GPT-4o-mini and Qwen3-235B-Instruct demonstrate the efficacy of our approach. Notably, our Multi-dimensional Quality Gating policy achieves an F1 score of 53.5 on LoCoMo with GPT-4o-mini. This outperforms our static retrieval baseline, Mem0$^\ast$ (51.2), and recovers 96.7\% of the Full Deliberation's performance (55.3), while incurring significantly lower computational costs.

Reasoning & Chain-of-Thought Recommendation & Information Retrieval Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References39

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

D-Mem: A Dual-Process Memory System for LLM Agents

Related Papers