Search papers, labs, and topics across Lattice.
2
0
5
Forget sparse sampling – this VLM digests 2x-4x more video frames by compressing each down to a single token.
LLMs can scalably annotate motion capture data to produce semantically rich descriptions of bimanual interactions, enabling higher-quality generation of dexterous hand motions.