Search papers, labs, and topics across Lattice.
AllDayNav introduces a lifelong self-learning navigation framework that enables robots to autonomously navigate dynamic environments by encoding scene dynamics within a large model's parameters through reinforcement learning. This approach leverages a self-evolving multimodal memory system that updates visual keyframes, semantic descriptions, and temporal context, allowing for the generation of open-vocabulary instructions and structured rewards. Experimental results reveal that AllDayNav achieves near-perfect success rates and outperforms traditional map-based and reinforcement learning baselines in terms of path efficiency and robustness across various scenarios.
AllDayNav achieves near-perfect navigation success rates by leveraging a self-evolving memory system that outperforms traditional mapping methods.
Lifelong embodied navigation in dynamic environments requires robots to form persistent scene understanding from fragmentary observations, which remains difficult for existing methods that rely on explicit maps or scene graphs and struggle to generalize beyond structured settings. We propose AllDayNav, a lifelong self-learning navigation framework that implicitly encodes scene dynamics into the billion-scale parameters of a large model via reinforcement learning, powered by a self-evolving multimodal memory that maintains and updates visual keyframes, semantic descriptions, and temporal context while autonomously generating open-vocabulary instructions, image goals, and structured rewards. Experiments in both synthetic and real-world environments across cross-room, cross-episode, and cross-task scenarios show that AllDayNav achieves success rates approaching $100\%$ and consistently surpasses strong map-based, VLM, and RL baselines in path efficiency and robustness, demonstrating implicit, memory-driven reinforcement learning as a scalable alternative to explicit mapping for reliable lifelong navigation.