Search papers, labs, and topics across Lattice.
2
0
5
6
Frequency domain analysis unlocks 1.59x speedups in Vision-Language-Navigation by enabling optimal token caching, a feat previously limited by visual domain approaches.
MLLMs can now reason about streaming video with significantly improved accuracy and reduced output length thanks to a novel memory-anchored framework that overlaps watching and thinking.