Mingkun Yang

Qwen Team, Alibaba Group 4 University of California San Diego 5 Zhejiang University 6 Shanghai Jiao Tong University Equal Contribution.Corresponding author. yang.yujiu@sz.tsinghua.edu.cn

Tsinghua AI

Papers on Lattice

Total citations

Topics

Research focus

Multimodal Models (2)Reasoning & Chain-of-Thought (2)Code Generation & Program Synthesis (1)Computer Vision (1)

Frequent co-authors

Tongkun Guan (1)Jianqiang Wan (1)Ming-Hsuan Yang (1)Zhengtao Guo (1)

Papers (2)

Mar 11, 2026

Tsinghua AIMar 11, 2026·also DAMO, NanKai University, NJU, Scale +1

CodePercept: Code-Grounded Visual STEM Perception for MLLMs

Forget scaling reasoning – this work shows that scaling visual perception using code-grounded data is the real key to unlocking MLLMs' STEM abilities.

Tongkun Guan, Jianqiang Wan, Mingkun Yang +12

Code Generation & Program Synthesis Multimodal Models Reasoning & Chain-of-Thought

Mar 4, 2026

Tsinghua AIMar 4, 2026·also DAMO

From Narrow to Panoramic Vision: Attention-Guided Cold-Start Reshapes Multimodal Reasoning

Multimodal models are often blind at birth: a new "Visual Attention Score" reveals they struggle to focus on visual inputs during cold-start, but a simple attention-guided fix can boost performance by 7%.

Yizhen Zhang, Ruizhe Chen, Mingkun Yang +1

Computer Vision Interpretability & Mechanistic Interp Multimodal Models+1

Search

Mingkun Yang

Research focus

Frequent co-authors

Papers (2)