Search papers, labs, and topics across Lattice.
1
0
3
Multimodal LLMs primarily rely on language-unique information for final predictions, with visual information decaying across layers and cross-modal synergy remaining surprisingly low (under 2%).