Search papers, labs, and topics across Lattice.
1
0
3
5
A single visual backbone can now handle perception, reconstruction, and action in streaming video, rivaling specialized models without task-specific fine-tuning.