Search papers, labs, and topics across Lattice.
Gaoling School of Artificial Intelligence, Renmin University of China
3
0
7
LLMs can be sped up by over 2x without sacrificing accuracy, by compressing the input and predicting multiple output tokens at once using a unified framework.
By jointly training a keyframe sampler with an MLLM, MSJoE achieves state-of-the-art accuracy in long-form video understanding while significantly reducing computational cost.
Stop paying for verbose overthinking: BFS-PO slashes LRM output length while simultaneously boosting accuracy.