Search papers, labs, and topics across Lattice.
University of Science and Technology, State Key Laboratory of Cognitive Intelligence
1
0
2
Training with local visual cues can dramatically enhance MLLMs' ability to extract fine-grained visual details without altering their inference interface.