Search papers, labs, and topics across Lattice.
University of Chinese Academy of Sciences
1
0
3
MLLMs can learn to reason more faithfully by explicitly anchoring visual attention to relevant image regions and reinforcing the use of that evidence during reasoning via counterfactual interventions.