Search papers, labs, and topics across Lattice.
1
3
Multimodal LLMs sometimes perform *worse* on chemistry problems when given images, revealing surprising failures in vision-language fusion.