Search papers, labs, and topics across Lattice.
Key Laboratory of Multimedia Trusted Perception and Efficient Computing
2
0
2
CausalMem achieves over 20x visual token compression while maintaining high accuracy in streaming video understanding, redefining memory efficiency in MLLMs.
AdaQ enables MLLMs to achieve superior long video understanding with just 64 frames, outperforming state-of-the-art methods by a striking margin.