Search papers, labs, and topics across Lattice.
Nanjing University, Shanghai AI Laboratory
1
9
3
17
Current LLMs and VLMs struggle with multi-step reasoning in long videos, often failing to maintain temporal coherence and procedural validity, as revealed by a new benchmark of hour-long narratives.