Feb 15, 2026arXiv:2602.14224

The Interspeech 2026 Audio Reasoning Challenge: Evaluating Reasoning Process Quality for Audio Reasoning Models and Agents

Ruiyang Xu, Yinghao Ma, Chao-Han Huck Yang, Bohan Li, Jaeyeon Kim, Jin Xu, Jinyu Li, Carlos Busso, Eng Siong Chng

AI Summary

The Interspeech 2026 Audio Reasoning Challenge was organized to evaluate the quality of Chain-of-Thought (CoT) reasoning in Large Audio Language Models (LALMs), addressing their "black-box" nature. The challenge introduced MMAR-Rubrics, a new instance-level protocol for assessing the factuality and logic of reasoning chains in audio. Results from the challenge, which featured Single Model and Agent tracks with 156 teams, indicate that agent systems currently exhibit superior reasoning quality due to iterative tool orchestration and cross-modal analysis, while single models are rapidly improving through reinforcement learning and data pipeline advancements.

Key Contribution

Agent systems leveraging iterative tool orchestration and cross-modal analysis significantly outperform single models in audio reasoning, highlighting a promising path toward explainable audio intelligence.

Abstract

Recent Large Audio Language Models (LALMs) excel in understanding but often lack transparent reasoning. To address this "black-box" limitation, we organized the Audio Reasoning Challenge at Interspeech 2026, the first shared task dedicated to evaluating Chain-of-Thought (CoT) quality in the audio domain. The challenge introduced MMAR-Rubrics, a novel instance-level protocol assessing the factuality and logic of reasoning chains. Featured Single Model and Agent tracks, the competition attracting 156 teams from 18 countries and regions. Results show agent systems currently lead in reasoning quality, utilizing iterative tool orchestration and cross-modal analysis. Besides, single models are rapidly advancing via reinforcement learning and sophisticated data pipeline. We details the challenge design, methodology, and a comprehensive analysis of state-of-the-art systems, providing new insights for explainable audio intelligence.

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought Speech & Audio

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

The Interspeech 2026 Audio Reasoning Challenge: Evaluating Reasoning Process Quality for Audio Reasoning Models and Agents

Related Papers