Search papers, labs, and topics across Lattice.
Fudan University
3
0
6
Interactive world models still have a long way to go: a comprehensive benchmark reveals that even state-of-the-art models struggle to consistently perform well across video quality, interaction adherence, and physics compliance.
MLLMs can now segment emerging entities with significantly improved accuracy thanks to ROSE, a retrieval-augmented framework that boosts performance by 19.2 gIoU over a Gemini-2.0 Flash baseline.
Current video object removal methods leave distracting visual artifacts behind, but EffectErase tackles this problem head-on by jointly removing objects and their pesky visual effects.