Search papers, labs, and topics across Lattice.
2
0
4
5
Squeezing the most out of your MLLM's visual budget is now possible: ResAdapt learns to allocate visual tokens intelligently *before* encoding, boosting performance by 15% while processing 16x more frames at the same cost.
MLLMs can now reason about streaming video with significantly improved accuracy and reduced output length thanks to a novel memory-anchored framework that overlaps watching and thinking.