Search papers, labs, and topics across Lattice.
The paper introduces MemReader, a family of models for active long-term memory extraction in agents, addressing the limitations of passive transcription methods. MemReader-4B uses Group Relative Policy Optimization (GRPO) to actively decide when and how to write memories based on information value, ambiguity, and completeness. Experiments show MemReader-4B achieves state-of-the-art performance on knowledge updating, temporal reasoning, and hallucination reduction compared to passive extraction baselines.
Stop blindly transcribing everything into agent memory: MemReader selectively writes memories based on reasoning, slashing noise and boosting performance on temporal reasoning and knowledge updates.
Long-term memory is fundamental for personalized and autonomous agents, yet populating it remains a bottleneck. Existing systems treat memory extraction as a one-shot, passive transcription from context to structured entries, which struggles with noisy dialogue, missing references, and cross-turn dependencies, leading to memory pollution, low-value writes, and inconsistency. In this paper, we introduce the MemReader family for active long-term memory extraction in agent systems: MemReader-0.6B, a compact and cost-efficient passive extractor distilled for accurate and schema-consistent structured outputs, and MemReader-4B, an active extractor optimized with Group Relative Policy Optimization (GRPO) to make memory writing decisions. Under a ReAct-style paradigm, MemReader-4B explicitly evaluates information value, reference ambiguity, and completeness before acting, and can selectively write memories, defer incomplete inputs, retrieve historical context, or discard irrelevant chatter. Experiments on LOCOMO, LongMemEval, and HaluMem show that MemReader consistently outperforms existing extraction-based baselines. In particular, MemReader-4B achieves state-of-the-art performance on tasks involving knowledge updating, temporal reasoning, and hallucination reduction. These results suggest that effective agent memory requires not merely extracting more information, but performing reasoning-driven and selective memory extraction to build low-noise and dynamically evolving long-term memory. Furthermore, MemReader has been integrated into MemOS and is being deployed in real-world applications. To support future research and adoption, we release the models and provide public API access.