Yefei He

Counterintuitively, VLMs can achieve higher VQA accuracy by intentionally degrading visual inputs, suggesting that high-resolution details can act as noise that hinders reasoning.

Haoxuan Han, Yefei He, Bohan Zhuang

Computer Vision Multimodal Models Reasoning & Chain-of-Thought

Mar 7, 2026

Mar 7, 2026·also Ministry of Education Key Laboratory of Intelligent Networks and Network Security

LiveWorld: Simulating Out-of-Sight Dynamics in Generative Video World Models

Generative video models can now simulate a continuously evolving world, even when objects are out of sight, thanks to a new framework that maintains persistent global state.

Zicheng Duan, Jiatong Xia, Zeyu Zhang +7

Computer Vision Multimodal Models World Models & Planning

Search

Yefei He

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)