Search papers, labs, and topics across Lattice.
NJU-LINK Team, Nanjing University
2
0
2
Current video editing models falter under the weight of complex user instructions, often omitting critical edits and introducing artifacts.
Open-weight Omni models struggle with binding accuracy, achieving only 41.55% on a new counterfactual benchmark, highlighting a critical gap in long-video comprehension.