Search papers, labs, and topics across Lattice.
The paper introduces Pro-HOI, a framework for humanoid loco-manipulation that uses root-trajectory conditioning and a persistent object estimation module to improve generalization and robustness. They optimize box-carrying motions using a Signed Distance Field loss to reduce penetration artifacts and train a policy conditioned on a desired root-trajectory, using reference motion as a reward signal. The persistent object estimation module fuses real-time detection with a Digital Twin to enable autonomous slippage detection and re-grasping.
Humanoid robots can now reliably perform long-horizon loco-manipulation tasks in the real world thanks to a novel root-trajectory conditioned policy and persistent object estimation.
Executing reliable Humanoid-Object Interaction (HOI) tasks for humanoid robots is hindered by the lack of generalized control interfaces and robust closed-loop perception mechanisms. In this work, we introduce Perceptive Root-guided Humanoid-Object Interaction, Pro-HOI, a generalizable framework for robust humanoid loco-manipulation. First, we collect box-carrying motions that are suitable for real-world deployment and optimize penetration artifacts through a Signed Distance Field loss. Second, we propose a novel training framework that conditions the policy on a desired root-trajectory while utilizing reference motion exclusively as a reward. This design not only eliminates the need for intricate reward tuning but also establishes root trajectory as a universal interface for high-level planners, enabling simultaneous navigation and loco-manipulation. Furthermore, to ensure operational reliability, we incorporate a persistent object estimation module. By fusing real-time detection with Digital Twin, this module allows the robot to autonomously detect slippage and trigger re-grasping maneuvers. Empirical validation on a Unitree G1 robot demonstrates that Pro-HOI significantly outperforms baselines in generalization and robustness, achieving reliable long-horizon execution in complex real-world scenarios.