Search papers, labs, and topics across Lattice.
Tencent Robotics X, Futian Laboratory
2
0
4
Robots can now focus on the *right* body parts for interaction, thanks to a new vision-language model that understands human motion commands and precisely localizes task-relevant 3D keypoints.
Pre-training on universal 3D poses lets robots learn new tasks from just 100 demonstrations, sidestepping the usual VLA efficiency bottleneck.