Search papers, labs, and topics across Lattice.
X-Loco is introduced as a framework for training a vision-based generalist humanoid locomotion policy by synergistically distilling multiple specialist policies. A case-adaptive specialist selection mechanism dynamically leverages these policies to guide a vision-based student policy, enabling the acquisition of a broad range of locomotion skills. Experiments demonstrate X-Loco's superior performance in tasks like fall recovery and terrain traversal, achieving vision-based humanoid locomotion that integrates upright locomotion, whole-body coordination, and fall recovery without reference motions.
Forget brittle, single-skill robots: X-Loco achieves robust, vision-based humanoid locomotion by intelligently combining multiple expert policies for fall recovery, terrain traversal, and whole-body coordination.
While recent advances have demonstrated strong performance in individual humanoid skills such as upright locomotion, fall recovery and whole-body coordination, learning a single policy that masters all these skills remains challenging due to the diverse dynamics and conflicting control objectives involved. To address this, we introduce X-Loco, a framework for training a vision-based generalist humanoid locomotion policy. X-Loco trains multiple oracle specialist policies and adopts a synergetic policy distillation with a case-adaptive specialist selection mechanism, which dynamically leverages multiple specialist policies to guide a vision-based student policy. This design enables the student to acquire a broad spectrum of locomotion skills, ranging from fall recovery to terrain traversal and whole-body coordination skills. To the best of our knowledge, X-Loco is the first framework to demonstrate vision-based humanoid locomotion that jointly integrates upright locomotion, whole-body coordination and fall recovery, while operating solely under velocity commands without relying on reference motions. Experimental results show that X-Loco achieves superior performance, demonstrated by tasks such as fall recovery and terrain traversal. Ablation studies further highlight that our framework effectively leverages specialist expertise and enhances learning efficiency.