Search papers, labs, and topics across Lattice.
2
0
4
7
Grounding boosts spatial reasoning in VLMs: explicitly linking language to 2D and 3D scene elements lets models decompose complex spatial problems and improve performance even on non-grounded tasks.
Current video object removal methods leave distracting visual artifacts behind, but EffectErase tackles this problem head-on by jointly removing objects and their pesky visual effects.