FudanHKUShanghai InnovationFeb 9, 2026arXiv:2602.09021

$\chi_{0}$: Resource-Aware Robust Manipulation via Taming Distributional Inconsistencies

Checheng Yu, Chonghao Sima, Gangcheng Jiang, Hai Zhang, Haoguang Mai, Hongyang Li, Huijie Wang, Jin Chen, Kaiyang Wu, Li Chen, Lirui Zhao, Modi Shi, Ping Luo, Qingwen Bu, Shijia Peng, Tianyu Li, Yibo Yuan

AI Summary

The paper introduces $\chi_{0}$, a resource-efficient framework designed to improve the robustness of long-horizon robotic manipulation by addressing distributional shifts between demonstration, policy learning, and execution. $\chi_{0}$ tackles these inconsistencies using model arithmetic for demonstration diversity, stage-aware advantage estimation for stable training signals, and train-deploy alignment techniques like spatio-temporal augmentation and DAgger corrections. Experiments on dual-arm garment manipulation show $\chi_{0}$ achieves a 250% improvement in success rate over $\pi_{0.5}$ while using less data and compute.

Key Contribution

Achieve 2.5x higher success in long-horizon robotic manipulation with 90% less data and compute by explicitly aligning training and deployment distributions.

Abstract

High-reliability long-horizon robotic manipulation has traditionally relied on large-scale data and compute to understand complex real-world dynamics. However, we identify that the primary bottleneck to real-world robustness is not resource scale alone, but the distributional shift among the human demonstration distribution, the inductive bias learned by the policy, and the test-time execution distribution -- a systematic inconsistency that causes compounding errors in multi-stage tasks. To mitigate these inconsistencies, we propose $\chi_{0}$, a resource-efficient framework with effective modules designated to achieve production-level robustness in robotic manipulation. Our approach builds off three technical pillars: (i) Model Arithmetic, a weight-space merging strategy that efficiently soaks up diverse distributions of different demonstrations, varying from object appearance to state variations; (ii) Stage Advantage, a stage-aware advantage estimator that provides stable, dense progress signals, overcoming the numerical instability of prior non-stage approaches; and (iii) Train-Deploy Alignment, which bridges the distribution gap via spatio-temporal augmentation, heuristic DAgger corrections, and temporal chunk-wise smoothing. $\chi_{0}$ enables two sets of dual-arm robots to collaboratively orchestrate long-horizon garment manipulation, spanning tasks from flattening, folding, to hanging different clothes. Our method exhibits high-reliability autonomy; we are able to run the system from arbitrary initial state for consecutive 24 hours non-stop. Experiments validate that $\chi_{0}$ surpasses the state-of-the-art $\pi_{0.5}$ in success rate by nearly 250%, with only 20-hour data and 8 A100 GPUs. Code, data and models will be released to facilitate the community.

Robotics & Embodied AI Training Efficiency & Optimization World Models & Planning

Citation Metrics

Citations1

Influential citations0

References95

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

$\chi_{0}$: Resource-Aware Robust Manipulation via Taming Distributional Inconsistencies

Related Papers