Zhibo Yang

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (2)Reasoning & Chain-of-Thought (2)Code Generation & Program Synthesis (1)Computer Vision (1)

Frequent co-authors

Mingkun Yang (2)Tongkun Guan (1)Jianqiang Wan (1)Ming-Hsuan Yang (1)

Papers (2)

Mar 11, 2026

5d ago

CodePercept: Code-Grounded Visual STEM Perception for MLLMs

Forget scaling reasoning – this work shows that scaling visual perception using code-grounded data is the real key to unlocking MLLMs' STEM abilities.

Tongkun Guan, Zhibo Yang, Jianqiang Wan +13

Code Generation & Program Synthesis Multimodal Models Reasoning & Chain-of-Thought

Mar 4, 2026

Tsinghua AI1w ago

From Narrow to Panoramic Vision: Attention-Guided Cold-Start Reshapes Multimodal Reasoning

Multimodal models are often blind at birth: a new "Visual Attention Score" reveals they struggle to focus on visual inputs during cold-start, but a simple attention-guided fix can boost performance by 7%.

Chufan Shi, Yizhen Zhang, Ruizhe Chen +3

Computer Vision Interpretability & Mechanistic Interp Multimodal Models+1

Search

Zhibo Yang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)