Search papers, labs, and topics across Lattice.
This paper critiques the current focus of imitation learning on perfect replay, arguing it hinders adaptability to changing contexts and goals. It proposes a shift towards compositional adaptability, where agents learn and recombine behavioral primitives without retraining. The paper introduces metrics for compositional generalization and suggests hybrid architectures to achieve lifelong adaptability in imitation learning.
Imitation learning's obsession with perfect replay is a dead end; true progress lies in compositional adaptability, enabling agents to recombine learned behaviors in novel contexts without retraining.
Imitation learning stands at a crossroads: despite decades of progress, current imitation learning agents remain sophisticated memorisation machines, excelling at replay but failing when contexts shift or goals evolve. This paper argues that this failure is not technical but foundational: imitation learning has been optimised for the wrong objective. We propose a research agenda that redefines success from perfect replay to compositional adaptability. Such adaptability hinges on learning behavioural primitives once and recombining them through novel contexts without retraining. We establish metrics for compositional generalisation, propose hybrid architectures, and outline interdisciplinary research directions drawing on cognitive science and cultural evolution. Agents that embed adaptability at the core of imitation learning thus have an essential capability for operating in an open-ended world.