Xin Li

University of Science and Technology of China & iFLYTEK Co., Ltd.

Papers on Lattice

Total citations

Topics

h-index

Research focus

Multimodal Models (3)Reasoning & Chain-of-Thought (2)Tool Use & Agents (2)Computer Vision (1)

Frequent co-authors

Hao Wang (3)Xuanzhao Dong (2)Xiwen Chen (2)Xiaobing Yu (2)

Papers (3)

May 27, 2026

Xuanzhao Dong +11May 27, 2026·also Nubank, USTC

Mags-RL: Wearing Multimodal LLMs a Magnifying Glass via Agentic Reinforcement Learning For Complex Scene Reasoning

Mags-RL lets multimodal LLMs see the forest *and* the trees, using reinforcement learning to guide a super-resolution agent that selectively enhances image regions for improved reasoning without extra annotations.

Xuanzhao Dong, Peijie Qiu, Xiwen Chen +9

Multimodal Models Reasoning & Chain-of-Thought Tool Use & Agents

Xuanzhao Dong +11May 27, 2026·also Nubank, USTC

OphIn-500K: Curating Web-Scale Visual Instructions for Scaling Ophthalmic Multimodal Large Language Models

Training on 500K automatically-curated ophthalmology instructions lets a vision-language model leapfrog general medical models in a specialized domain.

Xuanzhao Dong, Xiwen Chen, Hao Wang +9

Computer Vision Data Curation & Synthetic Data Multimodal Models

May 27, 2026

MACReD: A Multi-Agent Collaborative Reasoning Framework for Reaction Diagram Parsing

Chemical reaction diagram parsing, a notoriously difficult task for vision-language models, sees a significant leap in performance thanks to a new multi-agent framework that enforces chemical consistency.

Chuang Tang, Yin Xu, Hao Wang +4

Multimodal Models Reasoning & Chain-of-Thought Tool Use & Agents

Search

Xin Li

Research focus

Frequent co-authors

Papers (3)