Yang Xiao

Top systems in the ESDD2 challenge achieved a staggering Macro-F1 score of 0.8775, revealing the power of modular design and self-supervised learning in audio deepfake detection.

Xueping Zhang, Han Yin, Yang Xiao +2

Multimodal Models Speech & Audio

Jun 9, 2026·also Auckland, HKU, Monash, WHU

RAIL: Rethinking Auditory Intelligence in Large Audio-Language Models with a CHC-Grounded Benchmark

Current LALMs exhibit significant performance disparities across cognitive auditory capabilities, revealing a critical oversight in existing evaluation methods.

Hongyu Jin, Siyi Wang, Yang Xiao +7

Multimodal Models Speech & Audio

Jun 4, 2026

Kejuan Yang +7Jun 4, 2026

UNIVID: Unified Vision-Language Model for Video Moderation

UNIVID cuts violation leakage by 42.7% while consolidating over 1,000 classifiers into a single, interpretable model for video moderation.

Kejuan Yang, Yizhuo Zhang, Mingyuan Du +5

Interpretability & Mechanistic Interp Multimodal Models

Feb 15, 2026

Tsinghua AIFeb 15, 2026·also (Corresponding author: Rui Meng and Xiaodong, Huawei, HUST, PKU +4

LongCLI-Bench: A Preliminary Benchmark and Study for Long-horizon Agentic Programming in Command-Line Interfaces

Today's best AI agents fail at realistic software engineering tasks, stalling before even reaching 30% completion, revealing the urgent need for better long-horizon planning and human-AI collaboration.

Yukang Feng, Jian Sun, Ze Yang +20

Code Generation & Program Synthesis Eval Frameworks & Benchmarks Tool Use & Agents

Jan 2, 2026

Haonan Song +14Jan 2, 2026·also China Academy of Space Technology, I, Shenzhen University of Advanced Technology

IRPM: Intergroup Relative Preference Modeling for Pointwise Generative Reward Models

Pointwise reward models can finally compete with pairwise models in RLHF, thanks to a new intergroup comparison method that scales linearly with the number of candidates.

Haonan Song, Qingchen Xie, Huan Zhu +12

Search

Yang Xiao

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (6)