Search papers, labs, and topics across Lattice.
This paper introduces a novel approach to quadrupedal locomotion that decouples task specification, operational limits, gait preference, and terrain adaptation into distinct components, allowing for more efficient and adaptable movement. By eliminating explicit gait priors, the method achieves a 56% reduction in cost of transport and a 96% decrease in operational-limit violations compared to traditional complex-reward systems. The resulting locomotion policies demonstrate effective zero-shot transfer to a physical robot, showcasing the practical applicability of the approach in real-world scenarios.
Removing explicit gait priors leads to a 56% reduction in energy costs and significantly improved adaptability in quadrupedal locomotion.
Learning-based quadrupedal locomotion typically relies on complex reward formulations that entangle task specification, operational limits, gait preference, and terrain adaptation within a single optimization objective. We instead treat these functions through distinct mechanisms: rewards for task specification, constraints for operational limits, energy minimization for gait preference, and exteroceptive perception for adapting energy use to terrain difficulty. We show that these components jointly enable efficient, terrain-adaptive locomotion, and that removing each component exposes a distinct failure mode. Our formulation removes explicit gait priors (including air-time, contact-count, and foot-clearance targets) in favor of emergent behavior. Compared to a conventional complex-reward baseline, our formulation achieves comparable terrain traversal while reducing cost of transport by 56% and operational-limit violations by 96%. The resulting policies transfer zero-shot to a physical Unitree Go2 using LiDAR-based elevation mapping. Project website with videos: https://tinyurl.com/locomposition.