Search papers, labs, and topics across Lattice.
New York University
1
0
3
MLLMs are failing to visually track events in videos, performing only modestly above baseline despite strong results on other benchmarks.