Zewen Ding

University of Science and Technology of China

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Computer Vision (1)Multimodal Models (1)

Frequent co-authors

Zhou Tao (1)Fang Zhang (1)Shida Wang (1)Xiaokun Sun (1)

Papers (1)

Jun 15, 2026

University of Science and Technology6d ago·also State Key Laboratory of Cognitive, USTC

LOCUS: Local Visual Cue Search for Enhancing Fine-Grained Perception in Multimodal Large Language Models

Training with local visual cues can dramatically enhance MLLMs' ability to extract fine-grained visual details without altering their inference interface.

Zhou Tao, Fang Zhang, Zewen Ding +5

Computer Vision Multimodal Models