Search papers, labs, and topics across Lattice.
This paper introduces GUIDE, an end-to-end reinforcement learning framework for visual navigation in legged robots that operates with a single initial goal, eliminating the need for continuous goal updates. By leveraging a spatial anchor predictor and multi-frequency proprioceptive history, GUIDE cultivates internal directional awareness and maintains long-horizon spatial context, allowing robots to navigate complex environments autonomously. Experimental results demonstrate that GUIDE enables quadruped robots to effectively navigate dense clutter and structured mazes without external guidance or pre-existing maps, showcasing significant advancements in mobile autonomy.
Robots can now navigate complex environments solely based on initial goals, achieving autonomy without continuous external guidance.
Learning-based visual navigation for legged robots typically relies on continuous goal updates from hierarchical state estimation to provide a persistent directional reference. This reliance incurs additional sensory and computational overhead and deviates from fully end-to-end mobile autonomy. Furthermore, under partial observability, policies are prone to learn myopic behaviors, easily becoming trapped in dead ends and complex structural layouts. To address these limitations, we investigate a goal-initialized navigation setting, where the target is provided only once at the beginning of an episode, requiring the robot to operate based on intrinsic spatial memory without subsequent goal updates from external modules. In this work, we propose GUIDE, a fully end-to-end reinforcement learning framework designed to cultivate internal directional awareness. Specifically, GUIDE incorporates a spatial anchor predictor that leverages multi-frequency proprioceptive history to extract egomotion representations, thereby maintaining a persistent long-horizon spatial context for navigation. Concurrently, it utilizes raw depth streams to perceive local environmental geometry. We evaluate the proposed framework across both simulation and real-world scenarios on a quadruped robot. Experiments show that GUIDE learns reliable egomotion and directional awareness, enabling a fully end-to-end deployed policy to safely navigate through dense clutter and structured mazes without subsequent goal guidance or prior maps.