Search papers, labs, and topics across Lattice.
1
0
3
16
The YT-NTU-AVQ dataset, 10x larger than previous AVQA datasets, unlocks new possibilities for training and evaluating multimodal perception models by offering unprecedented scale and diversity.