Search papers, labs, and topics across Lattice.
IRT.
1
0
3
2
Current multimodal benchmarks are full of single-modality shortcuts, but this paper offers a way to prune them, yielding more reliable and efficient evaluations of true cross-modal reasoning.