Yifu Chen

VoxMind drastically improves task completion rates in spoken dialogue agents, jumping from 34.88% to 74.57%, even surpassing Gemini-2.5-Pro, by integrating "Think-before-Speak" reasoning and asynchronous tool management.

Tianle Liang, Yifu Chen, Shengpeng Ji +7

Natural Language Processing Speech & Audio Tool Use & Agents

Apr 16, 2026

Yifu Chen +6Apr 16, 2026

Dual-Axis Generative Reward Model Toward Semantic and Turn-taking Robustness in Interactive Spoken Dialogue Models

Fine-grained reward signals for semantic quality and interaction timing unlock more human-like spoken dialogue models.

Yifu Chen, Zhengqing Liu, Wen Wang +4

Natural Language Processing RLHF & Preference Learning Speech & Audio

Yifu Chen +12Apr 16, 2026·also ZJU

WavAlign: Enhancing Intelligence and Expressiveness in Spoken Dialogue Models via Adaptive Hybrid Post-Training

Reinforcement learning can now be practically applied to spoken dialogue models thanks to a new post-training recipe that disentangles semantic and acoustic improvements.

Yifu Chen, Shengpeng Ji, Qian Chen +10

Natural Language Processing RLHF & Preference Learning Speech & Audio

Mar 16, 2026

Mar 16, 2026·also CUHK

Modeling and Benchmarking Spoken Dialogue Rewards with Modality and Colloquialness

Current reward models for spoken dialogue systems are missing crucial paralinguistic and natural speech elements, but this new model closes the gap by operating directly on speech and outperforming existing audio LLMs.

Yuhan Wang, Fan Zhuo, Xize Cheng +6

Natural Language Processing RLHF & Preference Learning Speech & Audio

Feb 12, 2026

Feb 12, 2026·also CAS

WavBench: Benchmarking Reasoning, Colloquialism, and Paralinguistics for End-to-End Spoken Dialogue Models

WavBench exposes the limitations of current spoken dialogue models in handling real-world conversational nuances like colloquialisms and paralinguistics, despite advances in reasoning capabilities.

Yangzhuo Li, Yifu Chen, Haorong Ying +3

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought Speech & Audio

Search

Yifu Chen

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (6)