Search papers, labs, and topics across Lattice.
2
0
3
Video-ToC drastically improves video understanding by forcing Video LLMs to focus on relevant visual cues, leading to state-of-the-art performance and reduced hallucinations.
Amodal SAM unlocks zero-shot amodal segmentation by adapting the Segment Anything Model (SAM) to infer complete object shapes, even in the presence of occlusions.