Search papers, labs, and topics across Lattice.
1
0
3
MLLMs can learn to reason more faithfully by explicitly anchoring visual attention to relevant image regions and reinforcing the use of that evidence during reasoning via counterfactual interventions.