May 11 – May 18, 2026

Speech & Audio - Weekly Roundup

1 paper published across 0 labs.

3500% acceleration

Top Papers

May 18, 2026

1w ago·also Tencent AI, UT Austin

OmniPro: A Comprehensive Benchmark for Omni-Proactive Streaming Video Understanding

Current video understanding models struggle with long-horizon robustness and non-speech audio, as revealed by the new OmniPro benchmark designed for comprehensive omni-modal proactive evaluation.

Ruixiang Zhao, Jie Yang, Zijie Xin +4

Computer Vision Eval Frameworks & Benchmarks Multimodal Models+1

Search

Speech & Audio - Weekly Roundup

Top Papers