Search papers, labs, and topics across Lattice.
This study explores how humanoid robots can achieve self-other distinction through proprioceptive-visual correspondence, eliminating the need for identity labels or kinematic models. By establishing this distinction, the robots develop a predictive self-model that accurately maps their joint configurations to three-dimensional body occupancy, enabling them to adapt their actions in multi-agent environments. The findings demonstrate that this self-model supports various downstream tasks, including target reaching and collision-aware motion planning, paving the way for enhanced social interaction in shared spaces with humans and other robots.
Humanoid robots can learn to distinguish themselves from others purely through proprioceptive-visual cues, enabling advanced social interactions without predefined identities.
Distinguishing self from others is a prerequisite for social intelligence, yet humanoid robots that increasingly share workspaces with humans still lack this ability. Here we show that a humanoid robot can learn self-other distinction from proprioceptive-visual correspondence, without any identity labels or kinematic models. Once established, this distinction bootstraps a predictive self-model that maps joint configurations to three-dimensional body occupancy, capturing how the robot's body changes with action. In multi-agent scenes involving humans or morphologically identical robots, the system reliably identifies itself, learns a 3D self-model, and supports downstream tasks including target reaching, collision-aware motion planning, and human-to-robot motion retargeting. Together, these results outline a route toward bodily self-representation in robots that act and coordinate alongside others in shared physical environments. Project page: https://euron-zc.github.io/humanoid-self-model/.