Search papers, labs, and topics across Lattice.
This paper introduces Knowledge Objects (KOs), a discrete hash-addressed tuple-based memory system, as an alternative to in-context learning for persistent LLM knowledge management. Benchmarking against Claude Sonnet 4.5, the authors demonstrate that while in-context learning achieves high accuracy within its context window, it suffers from capacity limits, compaction loss during summarization, and goal drift during iterative compaction. KOs, in contrast, achieve 100% accuracy at a fraction of the cost and significantly outperform in-context learning on multi-hop reasoning tasks, highlighting the limitations of current in-context learning approaches for persistent memory.
LLMs forget up to 60% of facts when summarizing and erode over half of project constraints during iterative compaction, but a simple discrete memory system (KOs) fixes this while slashing costs by 252x.
Large language models increasingly serve as persistent knowledge workers, with in-context memory - facts stored in the prompt - as the default strategy. We benchmark in-context memory against Knowledge Objects (KOs), discrete hash-addressed tuples with O(1) retrieval. Within the context window, Claude Sonnet 4.5 achieves 100% exact-match accuracy from 10 to 7,000 facts (97.5% of its 200K window). However, production deployment reveals three failure modes: capacity limits (prompts overflow at 8,000 facts), compaction loss (summarization destroys 60% of facts), and goal drift (cascading compaction erodes 54% of project constraints while the model continues with full confidence). KOs achieve 100% accuracy across all conditions at 252x lower cost. On multi-hop reasoning, KOs reach 78.9% versus 31.6% for in-context. Cross-model replication across four frontier models confirms compaction loss is architectural, not model-specific. We additionally show that embedding retrieval fails on adversarial facts (20% precision at 1) and that neural memory (Titans) stores facts but fails to retrieve them on demand. We introduce density-adaptive retrieval as a switching mechanism and release the benchmark suite.