Search papers, labs, and topics across Lattice.
1
0
2
Extracting action signals from 32,041 hours of human video enables CAIP to outperform leading vision encoders in robotic manipulation tasks by over 30%.