Search papers, labs, and topics across Lattice.
This paper formalizes the memory requirements for generalist agents to achieve near-optimal performance across diverse environments and goals. It establishes that when faced with observational bottlenecks and conflicting optimal actions, agents must maintain distinct memory distributions to navigate effectively. The findings reveal that successful agents cannot solely depend on current observations; they must retain relevant domain information to facilitate planning and transition modeling.
Generalist agents need to remember distinct information to navigate conflicting optimal actions across environments, challenging the notion that current state observations are sufficient.
This paper develops a formal account of what generalist agents must store in memory in order to act near-optimally across multiple environments and goals. It shows that when two domains share an observational bottleneck but require incompatible optimal actions, any uniformly near-optimal policy must induce distinct memory distributions at that bottleneck. The result yields a separation theorem: sufficiently successful agents cannot rely only on current state observations, but must preserve domain-relevant information in memory. The paper further shows that if an agent's memory contains enough information to estimate values for related goals, then that memory can be used to approximately reconstruct the agent's local transition dynamics. Together, these results characterize memory as the substrate that supports domain disambiguation, transition-model reconstruction, and planning for generalist agents.