Search papers, labs, and topics across Lattice.
1
0
3
Existing multimodal models struggle with multi-image reasoning, but a new benchmark and inference-time attention fix exposes and alleviates these shortcomings.