Yujiu Yang

No current MLLM can reliably issue timely safety warnings, with performance sharply varying across domains and a troubling trade-off between recall and false positives.

Yusong Zhao, Yuejin Xie, Youliang Yuan +4

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

Mar 12, 2026

Tsinghua AIMar 12, 2026·also HKUST, SUSTech, UCF

CreativeBench: Benchmarking and Enhancing Machine Creativity via Self-Evolving Challenges

Scaling up LLMs boosts combinatorial creativity in code generation, but plateaus on exploratory tasks, revealing a "convergence-by-scaling" effect where larger models become less divergent.

Zihu Wang, Zihan Wang, Lam Nguyen +6

Code Generation & Program Synthesis Eval Frameworks & Benchmarks

Search

Yujiu Yang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)