Zheng Yuan

Medical-specific vision-language models surprisingly underutilize visual information in Japanese medical licensing exams, often performing well even when images are removed, highlighting a critical gap in their multimodal reasoning capabilities.

Yue Xun, Junyu Liu, Qian Niu +10

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

Mar 4, 2026

Mar 4, 2026·also Beihang, HIT, HKUST, Tencent AI

ErrorLLM: Modeling SQL Errors for Text-to-SQL Refinement

ErrorLLM tackles the challenge of refining LLM-generated SQL by explicitly modeling and detecting implicit semantic errors, leading to substantial improvements in text-to-SQL performance.

Zijin Hong, Zheng Yuan, Qing Liao +3

Code Generation & Program Synthesis Eval Frameworks & Benchmarks Natural Language Processing

Search

Zheng Yuan

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)