Mar 2, 2026arXiv:2603.01761

Modular Memory is the Key to Continual Learning Agents

Vaggelis Dorovatas, Malte Schwerin, Andrew D. Bagdanov, Lucas Caccia, Antonio Carta, Laurent Charlin, Barbara Hammer, Tyler L. Hayes, Timm Hess, Christopher Kanan, Dhireesha Kudithipudi, Xialei Liu, Vincenzo Lomonaco, Jorge Mendez-Mendez, Darshan Patil, Tinne Tuytelaars, Gido M. van de Ven, Liyuan Wang, Joost Van de Weijer, Jonghyun Choi, Martin Mundt, Rahaf Aljundi

AI Summary

This paper proposes a modular memory-centric architecture combining In-Weight Learning (IWL) and In-Context Learning (ICL) to address catastrophic forgetting in continual learning. It argues that the limitations of solely relying on IWL can be overcome by integrating ICL for rapid adaptation and knowledge accumulation, while IWL provides stable updates. The paper outlines a conceptual framework for this architecture, suggesting it as a promising path toward creating continually learning agents capable of adapting at scale.

Key Contribution

Forget catastrophic forgetting: modular memory, blending in-context and in-weight learning, offers a practical path to truly continual learning agents.

Abstract

Foundation models have transformed machine learning through large-scale pretraining and increased test-time compute. Despite surpassing human performance in several domains, these models remain fundamentally limited in continuous operation, experience accumulation, and personalization, capabilities that are central to adaptive intelligence. While continual learning research has long targeted these goals, its historical focus on in-weight learning (IWL), i.e., updating a single model's parameters to absorb new knowledge, has rendered catastrophic forgetting a persistent challenge. Our position is that combining the strengths of In-Weight Learning (IWL) and the newly emerged capabilities of In-Context Learning (ICL) through the design of modular memory is the missing piece for continual adaptation at scale. We outline a conceptual framework for modular memory-centric architectures that leverage ICL for rapid adaptation and knowledge accumulation, and IWL for stable updates to model capabilities, charting a practical roadmap toward continually learning agents.

Architecture Design (Transformers, SSMs, MoE)Tool Use & Agents Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Modular Memory is the Key to Continual Learning Agents

Related Papers