Search papers, labs, and topics across Lattice.
This paper introduces True Memory, a novel agent architecture that prioritizes verbatim event storage and a multi-stage retrieval pipeline over traditional extraction-at-ingestion methods. By preserving all event data and optimizing retrieval, True Memory aims to overcome limitations in recalling information when the query context is unknown during storage. Experiments on LoCoMo, LongMemEval, and BEAM-1M benchmarks demonstrate that True Memory achieves state-of-the-art accuracy compared to existing memory architectures, while operating efficiently on commodity CPUs without external dependencies.
Ditch the vector DB – this new agent architecture achieves SOTA memory recall by storing everything verbatim and optimizing retrieval, all in a single SQLite file.
Extraction at ingestion is the wrong primitive for agent memory: content discarded before the query is known cannot be recovered at retrieval time. We propose True Memory, a six-layer architecture that shifts the center of the system from a storage schema to a multi-stage retrieval pipeline operating over events preserved verbatim. The full system runs as a single SQLite file on commodity CPU with no external database, vector index, graph store, or GPU. On LoCoMo (1,540 questions across 10 multi-session conversations), True Memory Pro reaches 93.0% accuracy (3-run mean) against 61.4% for Mem0, 65.4% for Supermemory, approximately 71% for Zep, and 94.5% for EverMemOS under a matched gpt-4.1-mini answer model. On LongMemEval (500 questions), True Memory Pro reaches 87.8% (3-run mean). On BEAM-1M (700 questions at the 1-million-token scale), True Memory Pro reaches 76.6% (3-run mean), above the prior published result of 73.9% for Hindsight. A 56-configuration ablation shows a 1.3-percentage-point spread within the top-performing configuration family.