Search papers, labs, and topics across Lattice.
This paper introduces GraspGen-X, a novel approach for 6-DOF robot grasping that enables cross-embodiment generalization across diverse gripper morphologies and physical processes. By leveraging a diffusion model conditioned on a swept-volume heuristic representation of grippers, the model is trained on a large-scale dataset of 2 billion grasps, achieving superior zero-shot generalization to novel real-world grippers and objects compared to baseline methods. The findings highlight the model's effectiveness not only in grasping novel objects but also in adapting to new gripper designs, making it a promising tool for versatile robotic manipulation.
Achieving zero-shot generalization in robotic grasping across diverse gripper designs could revolutionize how robots interact with their environments.
We study cross-embodiment 6-DOF robot grasping. Unlike prior works, we require the model not only to generalize to novel objects / scenes but also to novel gripper morphologies and physical grasping processes. Our method extends diffusion model based generative 6-DOF grasping models to condition on the additional gripper's representation. We propose a swept-volume heuristic for encoding the gripper. We train our cross-embodiment model with procedural grippers and a large-scale dataset of 2 Billion grasps. In simulation experiments, our model has the best zero-shot generalization to novel real-world grippers and objects over baseline methods. Our model also serves as a good initialization for fine-tuning to adapt to novel grippers. In ablations, we demonstrate the efficiency of our sweep-volume gripper representation and our procedural gripper training dataset. Last, we show zero-shot generalization to real-world novel grippers for 6-DOF grasping, surpassing baselines in cross-embodiment generalization.