Search papers, labs, and topics across Lattice.
1
0
3
36
VLMs still struggle to combine visual and textual information for multi-hop reasoning, but a new automatically generated dataset, CRIT, can help them learn.