Search papers, labs, and topics across Lattice.
TeamHOI introduces a decentralized policy framework for physics-based humanoid agents to perform cooperative human-object interaction (HOI) tasks, scaling to variable team sizes. The framework uses a Transformer-based policy network with teammate tokens for coordination based on local observations. To improve motion realism and address data scarcity, they employ a masked Adversarial Motion Prior (AMP) strategy, leveraging single-human reference motions while focusing task rewards on object-interacting body parts.
A single, decentralized policy can now control teams of physics-based humanoids to cooperatively manipulate objects, even with varying team sizes and object shapes.
Physics-based humanoid control has achieved remarkable progress in enabling realistic and high-performing single-agent behaviors, yet extending these capabilities to cooperative human-object interaction (HOI) remains challenging. We present TeamHOI, a framework that enables a single decentralized policy to handle cooperative HOIs across any number of cooperating agents. Each agent operates using local observations while attending to other teammates through a Transformer-based policy network with teammate tokens, allowing scalable coordination across variable team sizes. To enforce motion realism while addressing the scarcity of cooperative HOI data, we further introduce a masked Adversarial Motion Prior (AMP) strategy that uses single-human reference motions while masking object-interacting body parts during training. The masked regions are then guided through task rewards to produce diverse and physically plausible cooperative behaviors. We evaluate TeamHOI on a challenging cooperative carrying task involving two to eight humanoid agents and varied object geometries. Finally, to promote stable carrying, we design a team-size- and shape-agnostic formation reward. TeamHOI achieves high success rates and demonstrates coherent cooperation across diverse configurations with a single policy.