Search papers, labs, and topics across Lattice.
This paper introduces SemGeoNav, a hierarchical visual navigation framework that combines high-level semantic reasoning from end-to-end models with the reliable local planning capabilities of geometric methods. By integrating these approaches, SemGeoNav enhances obstacle avoidance in complex environments while maintaining robust image-based navigation. Evaluated on a Unitree Go2 quadruped robot, the framework significantly outperformed existing methods like ViNT and NoMaD, achieving higher success rates and reduced navigation times.
SemGeoNav achieves superior obstacle avoidance and navigation efficiency by merging semantic reasoning with geometric planning, outperforming leading methods in real-world tests.
Learning-based visual navigation has enhanced semantic goal-reaching capabilities. However, due to their black-box nature, purely end-to-end models often lack explicit geometric constraints, leading to unpredictable and unreliable obstacle avoidance in open environments. Conversely, traditional geometric planners ensure safety but struggle with high-dimensional visual targets. To address these limitations, we propose SemGeoNav, a novel hierarchical visual navigation framework.It tightly integrates the high-level semantic reasoning of end-to-end models with the reliable local planning ability of geometry-based methods, achieving robust image-based navigation while significantly improving obstacle avoidance. Furthermore, we introduce a temporal trajectory smoothing mechanism to ensure continuous and stable robot motion. We evaluated SemGeoNav on a Unitree Go2 quadruped robot in real-world environments. The results demonstrate that SemGeoNav outperforms existing representative methods, including ViNT and NoMaD, achieving higher success rates and shorter navigation times.