CMU MLD pointEastern Institute of TechnologySJTUJun 11, 2026arXiv:2606.13222

Proprioceptive-visual correspondence enables self-other distinction in humanoid robots

Yurun Chen, Tianyuan Gao, Yizhong Ge, Shikun Ban, Yizhou Wang, Hongkai Xiong, Wenjun Zeng, Wentao Zhu

AI Summary

This study explores how humanoid robots can achieve self-other distinction through proprioceptive-visual correspondence, eliminating the need for identity labels or kinematic models. By establishing this distinction, the robots develop a predictive self-model that accurately maps their joint configurations to three-dimensional body occupancy, enabling them to adapt their actions in multi-agent environments. The findings demonstrate that this self-model supports various downstream tasks, including target reaching and collision-aware motion planning, paving the way for enhanced social interaction in shared spaces with humans and other robots.

Key Contribution

Humanoid robots can learn to distinguish themselves from others purely through proprioceptive-visual cues, enabling advanced social interactions without predefined identities.

Abstract

Distinguishing self from others is a prerequisite for social intelligence, yet humanoid robots that increasingly share workspaces with humans still lack this ability. Here we show that a humanoid robot can learn self-other distinction from proprioceptive-visual correspondence, without any identity labels or kinematic models. Once established, this distinction bootstraps a predictive self-model that maps joint configurations to three-dimensional body occupancy, capturing how the robot's body changes with action. In multi-agent scenes involving humans or morphologically identical robots, the system reliably identifies itself, learns a 3D self-model, and supports downstream tasks including target reaching, collision-aware motion planning, and human-to-robot motion retargeting. Together, these results outline a route toward bodily self-representation in robots that act and coordinate alongside others in shared physical environments. Project page: https://euron-zc.github.io/humanoid-self-model/.

Multimodal Models Robotics & Embodied AI

Citation Metrics

Citations0

Influential citations0

References37

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Proprioceptive-visual correspondence enables self-other distinction in humanoid robots

Related Papers