Qian Chen

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Speech & Audio (5)Multimodal Models (3)Natural Language Processing (1)

Frequent co-authors

Xiangang Li (3)Huadai Liu (2)Kaicheng Luo (2)Wei Xue (2)

Papers (5)

Jun 22, 2026

DAMO3w ago·also HKUST

STAR-VAE: Structured Topology-Aware Regularization for Audio Reconstruction and Generation

STAR-VAE achieves state-of-the-art audio reconstruction fidelity by aligning latent space geometry with the hierarchical structure of audio signals.

Huadai Liu, Kaicheng Luo, Qian Chen +2

Speech & Audio

DAMO3w ago·also HKUST

AudioCALM: Continuous Autoregressive Language Modeling for Universal Audio Generation

AudioCALM achieves state-of-the-art performance in speech, sound, and music generation by seamlessly integrating diverse audio modalities into a single autoregressive framework.

Huadai Liu, Kaicheng Luo, Qian Chen +3

Multimodal Models Speech & Audio

Jun 8, 2026

Jun 8, 2026·also Anhui Province Key Laboratory of Digital Security, USTC

BareWave: Waveform-Native Flow-Matching Text-to-Speech

Achieving high-quality voice cloning without any intermediate representations could revolutionize text-to-speech synthesis.

Wei Fan, Chao-Hong Tan, Qian Chen +4

Speech & Audio

Jun 1, 2026

DAMOJun 1, 2026·also USTC

UniVocal: Unified Speech-Singing Code-Switching Synthesis

Seamless transitions between speech and singing modes are now driven purely by text context, achieving state-of-the-art results in code-switching synthesis.

Yufei Shi, Qian Chen, Zhen-Hua Ling +1

Multimodal Models Speech & Audio

May 6, 2026

Yangchen Yu +6May 6, 2026·also Intelligent Interconnected Systems, NTU, SMU, University of Reading

To Fuse or to Drop? Dual-Path Learning for Resolving Modality Conflicts in Multimodal Emotion Recognition

Standard multimodal fusion can hurt performance in emotion recognition, but this new approach knows when to drop modalities, leading to state-of-the-art results.

Yangchen Yu, Qian Chen, Jia Li +4

Multimodal Models Natural Language Processing Speech & Audio

Search

Qian Chen

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (5)