Search papers, labs, and topics across Lattice.
This study systematically evaluates the impact of various sensor configurations on the performance of multimodal SLAM in quadrupedal robots, addressing a critical gap in understanding how hardware influences navigation in dynamic environments. By analyzing state-of-the-art visual, visual-inertial, and LiDAR-visual-inertial SLAM methods, the authors reveal that stereo configurations consistently outperform monocular setups, while global shutter cameras significantly reduce motion-induced tracking failures. The findings highlight that standard inertial integration can actually degrade performance in vision-centric frameworks, providing essential design guidelines for optimizing sensor payloads in agile legged systems.
Hardware selection can make or break SLAM performance in quadrupedal robots, with stereo setups and global shutter cameras proving crucial for resilience in dynamic environments.
Autonomous navigation of quadrupedal robots in diverse environments fundamentally relies on resilient Simultaneous Localization and Mapping (SLAM). While visual-inertial SLAM has matured across wheeled, handheld, and aerial platforms, a critical evaluation gap remains regarding how hardware-level sensor configurations affect performance under the aggressive dynamics of legged locomotion. Quadrupeds introduce distinct embodiment-induced sensory challenges, including foot-impact shocks, high-frequency mechanical vibrations, and rapid angular rotations, which degrade standard perception pipelines. To address this gap, we present a systematic evaluation of state-of-the-art visual, visual-inertial, and LiDAR-visual-inertial SLAM methods using the GrandTour dataset recorded on an ANYmal D quadruped. We isolate and quantify the impacts of camera modalities, shutter techniques, and inertial sensor tiers, analyzing their trade-offs across localization accuracy, algorithmic robustness, and computational resource utilization. Our empirical findings demonstrate that hardware selection has substantial influence on system resilience: stereo configurations consistently outperform monocular and RGB-D modalities, global shutter cameras significantly mitigate motion-induced tracking failures compared to rolling shutter cameras, and, crucially, standard inertial integration can degrade the performance of primarily vision-based frameworks under harsh legged locomotion. These insights additionally offer concrete design guidelines for tailoring custom sensor payloads to achieve dependable perception on agile legged systems.