Illinois Institute of TechnologyApr 14, 2026arXiv:2604.13001

XRZero-G0: Pushing the Frontier of Dexterous Robotic Manipulation with Interfaces, Quality and Ratios

James Wang, Junming Wang, Junming Wang, Teng Pu, Primo Pu, Teng Pu, Wing-Shing Fung, Wingmun Fung, Zephyr Fung, Jindong Wang, Jin-Du Wang, Alex Wang, Sam Wang, Shanchang Wang, Yuan Deng, Bender Deng, Shuyuan Wang, Zivid Liu, Ziwei Liu, Chris Pan, Kunhao Pan, Kunhao Pan, Ping Yang, Ping Yang, Panda Yang, Peng Zhai, Andy Zhai, Pengxiang Zhai, Lucy Liang, Yuxin Liang, Xiaofang Li, Xiaofan Li, Shalfun Li, Jiabi Sun, Johnny Sun, Jiabi Sun, Jacky Xu, Renchao Xu, Xiaotian Tian, Will Tian, Pengfei Yan, Kai Yan, Pengfei Yan, Kohler Ye, Guo Ye, Guoqiang Ye, Scott Li, Liang Li, Qian Wang, Ruyi Gan, Ruyi Gan, Roy Gan, Hao Wang

AI Summary

The authors introduce XRZero-G0, a hardware-software system for collecting high-quality, action-aligned robot manipulation data using a VR interface and specialized grippers. They propose a closed-loop data collection pipeline to ensure data reliability, achieving an 85% data validity rate. Experiments demonstrate that combining a small amount of real-robot data with large-scale robot-free data (1:10 ratio) achieves performance comparable to exclusively real-robot datasets, reducing acquisition costs by 20x.

Key Contribution

Robot manipulation models trained on mostly VR data can perform as well as those trained on real-world data, but at 1/20th the cost.

Abstract

The acquisition of high-quality, action-aligned demonstration data remains a fundamental bottleneck in scaling foundation models for dexterous robot manipulation. Although robot-free human demonstrations (e.g., the UMI paradigm) offer a scalable alternative to traditional teleoperation, current systems are constrained by sub-optimal hardware ergonomics, open-loop workflows, and a lack of systematic data-mixing strategies. To address these limitations, we present XRZero-G0, a hardware-software co-designed system for embodied data collection and policy learning. The system features an ergonomic, virtual reality interface equipped with a top-view camera and dual specialized grippers to directly improve collection efficiency. To ensure dataset reliability, we propose a closed-loop collection, inspection, training, and evaluation pipeline for non-proprioceptive data. This workflow achieves an 85% data validity rate and establishes a transparent mechanism for quality control. Furthermore, we investigate the empirical scaling behaviors and optimal mixing ratios of robot-free data. Extensive experiments indicate that combining a minimal volume of real-robot data with large-scale robot-free data (e.g., a 10:1 ratio) achieves performance comparable to exclusively real-robot datasets, while reducing acquisition costs by a factor of twenty. Utilizing XRZero-G0, we construct a 2,000-hour robot-free dataset that enables zero-shot cross-embodiment transfer to a target physical robot, demonstrating a highly scalable methodology for generalized real-world manipulation.Our project repository: https://github.com/X-Square-Robot/XRZero-G0

Data Curation & Synthetic Data Robotics & Embodied AI Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References21

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

XRZero-G0: Pushing the Frontier of Dexterous Robotic Manipulation with Interfaces, Quality and Ratios

Related Papers