Search papers, labs, and topics across Lattice.
2
0
4
8
Robots can now learn complex manipulation tasks directly from human demonstrations using only a pair of smart glasses, achieving zero-shot transfer without specialized hardware.
Current MLLMs are surprisingly bad at understanding human intent in egocentric videos at a step-by-step level, achieving only 33% accuracy on a new benchmark designed to prevent future-frame leakage.