Search papers, labs, and topics across Lattice.
1
0
3
8
MLLMs don't fuse vision and language uniformly: targeted interventions guided by layer-wise attention analysis can significantly boost multimodal reasoning without retraining.