Tsinghua AIFudanShanghai Qi Zhi InstituteApr 12, 2026arXiv:2604.10579

AffordGen: Generating Diverse Demonstrations for Generalizable Object Manipulation with Afford Correspondence

Kaizhe Hu, Yingqian Huang, Yuanchen Ju, Zhengrong Xue, Huazhe Xu

AI Summary

AffordGen addresses the data diversity bottleneck in robot manipulation imitation learning by using 3D generative models and Vision Foundation Models to establish semantic correspondence of keypoints across 3D meshes, creating diverse manipulation trajectories. This affordance-aware dataset is then used to train a closed-loop visuomotor policy, improving both semantic generalizability and reactive robustness. Results demonstrate high success rates and zero-shot generalization to unseen objects in both simulation and real-world experiments, significantly improving data efficiency.

Key Contribution

Unlock zero-shot generalization in robot manipulation by generating diverse, affordance-aware training data with 3D generative models and Vision Foundation Models.

Abstract

Despite the recent success of modern imitation learning methods in robot manipulation, their performance is often constrained by geometric variations due to limited data diversity. Leveraging powerful 3D generative models and vision foundation models (VFMs), the proposed AffordGen framework overcomes this limitation by utilizing the semantic correspondence of meaningful keypoints across large-scale 3D meshes to generate new robot manipulation trajectories. This large-scale, affordance-aware dataset is then used to train a robust, closed-loop visuomotor policy, combining the semantic generalizability of affordances with the reactive robustness of end-to-end learning. Experiments in simulation and the real world show that policies trained with AffordGen achieve high success rates and enable zero-shot generalization to truly unseen objects, significantly improving data efficiency in robot learning.

Computer Vision Data Curation & Synthetic Data Robotics & Embodied AI

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

AffordGen: Generating Diverse Demonstrations for Generalizable Object Manipulation with Afford Correspondence

Related Papers