Jinyu Li

Unleashing LLMs' reasoning powers on speech unlocks a new ASR paradigm, slashing error rates by up to 17% simply by having the model "think" before transcribing.

Keqi Deng, Ruchao Fan, Bo Ren +1

Natural Language Processing Reasoning & Chain-of-Thought Speech & Audio

Feb 15, 2026

Ruiyang Xu +5Feb 15, 2026·also CMU ML, NTU, SNU

The Interspeech 2026 Audio Reasoning Challenge: Evaluating Reasoning Process Quality for Audio Reasoning Models and Agents

Agent systems leveraging iterative tool orchestration and cross-modal analysis significantly outperform single models in audio reasoning, highlighting a promising path toward explainable audio intelligence.

Ruiyang Xu, Yinghao Ma, Jaeyeon Kim +3

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought Speech & Audio

Search

Jinyu Li

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (4)