Search papers, labs, and topics across Lattice.
2
0
4
0
Decoupling the "Thinker" from the "Editor" in image editing allows targeted optimization of reasoning, leading to performance competitive with strong proprietary models using a fixed generative model.
Achieve near-dense Video-LLM performance on long videos with up to 57% fewer FLOPs by adaptively selecting which video cubes and tokens to process.