Search papers, labs, and topics across Lattice.
1
0
3
Compressing KV caches with multi-granularity representations boosts long video QA accuracy without sacrificing memory or speed.