Search papers, labs, and topics across Lattice.
1
0
3
Visual language models can now explicitly reason about object trajectories in videos, thanks to a simple yet effective method that augments training data and uses discrete motion tags.