Apr 5, 2026arXiv:2604.04138

Learning Dexterous Grasping from Sparse Taxonomy Guidance

Juhan Park, Taerim Yoon, Seungmin Kim, Joonggil Kim, Wontae Ye, Jeongeun Park, Yoonbyung Chai, Geonwoo Cho, Geunwoo Cho, Dohyeong Kim, Kyungjae Lee, Yongjae Kim, Sungjoon Choi

AI Summary

The paper introduces GRIT, a two-stage framework for learning dexterous manipulation by predicting taxonomy-based grasp specifications from scene and task context, and then generating continuous finger motions conditioned on this sparse command. They show that grasp taxonomies are more effective for specific object geometries, leading to improved generalization to novel objects and an 87.9% success rate. Real-world experiments demonstrate that GRIT enables controllability, allowing grasp strategies to be adjusted through high-level taxonomy selection.

Key Contribution

Forget dense pose targets: sparse taxonomy guidance unlocks dexterous manipulation with surprising generalization and controllability.

Abstract

Dexterous manipulation requires planning a grasp configuration suited to the object and task, which is then executed through coordinated multi-finger control. However, specifying grasp plans with dense pose or contact targets for every object and task is impractical. Meanwhile, end-to-end reinforcement learning from task rewards alone lacks controllability, making it difficult for users to intervene when failures occur. To this end, we present GRIT, a two-stage framework that learns dexterous control from sparse taxonomy guidance. GRIT first predicts a taxonomy-based grasp specification from the scene and task context. Conditioned on this sparse command, a policy generates continuous finger motions that accomplish the task while preserving the intended grasp structure. Our result shows that certain grasp taxonomies are more effective for specific object geometries. By leveraging this relationship, GRIT improves generalization to novel objects over baselines and achieves an overall success rate of 87.9%. Moreover, real-world experiments demonstrate controllability, enabling grasp strategies to be adjusted through high-level taxonomy selection based on object geometry and task intent.

RLHF & Preference Learning Robotics & Embodied AI World Models & Planning

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Learning Dexterous Grasping from Sparse Taxonomy Guidance

Related Papers