Search papers, labs, and topics across Lattice.
2
0
5
29
Models excel at recognizing actions in sports videos but falter dramatically when tasked with strategic reasoning, achieving only 5% accuracy in autonomous evidence integration.
Forget paired video-music training data: V2M-Zero aligns video and music by matching the *timing* of changes within each modality, not the content itself.