Search papers, labs, and topics across Lattice.
2
2
5
17
Achieve near-dense Video-LLM performance on long videos with up to 57% fewer FLOPs by adaptively selecting which video cubes and tokens to process.
Forget slow visual token concatenation: LaVi modulates LLM features directly with visual context, slashing FLOPs by 94% while boosting speed and accuracy.