Chun Yuan

VLMs can get a +39% boost in downstream reasoning by using translator-guided reinforcement learning to improve geometric perception, a far better result than standard supervised fine-tuning.

Shuning Jia, Guanghao Li, Guanghao Li +3

Eval Frameworks & Benchmarks Multimodal Models Reasoning & Chain-of-Thought+1

Feb 22, 2026

Tsinghua AI2w ago·also Tencent AI

Learning Cross-View Object Correspondence via Cycle-Consistent Mask Prediction

Cycle consistency unlocks SOTA cross-view object correspondence in videos without ground-truth annotations, even enabling test-time training.

Shannan Yan, Leqi Zheng, Keyu Lv +7

Computer Vision Multimodal Models Robotics & Embodied AI

Search

Chun Yuan

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)