Search papers, labs, and topics across Lattice.
1
0
3
4
VLLMs can be made much faster without sacrificing accuracy by intelligently merging redundant tokens across space and time using optimal transport.