Search papers, labs, and topics across Lattice.
ETH Zurich
1
0
3
12
Multimodal LLMs often perform worse with more modalities because they struggle to jointly recognize and reason across modalities, a problem solvable with simple prompting strategies.