Search papers, labs, and topics across Lattice.
This paper introduces HALO, a two-stage gradient-based system identification framework leveraging MuJoCo XLA to close the sim-to-real gap for humanoid robots carrying heavy, unknown payloads. The first stage calibrates the robot's nominal model, and the second identifies the payload's mass distribution, both using real-world data and differentiable simulation. Results show HALO enables zero-shot transfer of RL policies to hardware, significantly improving agility and robustness under heavy load compared to baselines.
Humanoid robots can now handle heavy, unknown payloads in the real world thanks to a system that identifies mass distribution via differentiable simulation.
Humanoid robots deployed in real-world scenarios often need to carry unknown payloads, which introduce significant mismatch and degrade the effectiveness of simulation-to-reality reinforcement learning methods. To address this challenge, we propose a two-stage gradient-based system identification framework built on the differentiable simulator MuJoCo XLA. The first stage calibrates the nominal robot model using real-world data to reduce intrinsic sim-to-real discrepancies, while the second stage further identifies the mass distribution of the unknown payload. By explicitly reducing structured model bias prior to policy training, our approach enables zero-shot transfer of reinforcement learning policies to hardware under heavy-load conditions. Extensive simulation and real-world experiments demonstrate more precise parameter identification, improved motion tracking accuracy, and substantially enhanced agility and robustness compared to existing baselines. Project Page: https://mwondering.github.io/halo-humanoid/