Tiantian Feng

Training speaker diarization models solely on adult speech leads to surprisingly poor performance on children and older adults, but a simple multi-age training strategy can fix it.

Anfeng Xu, Anfeng Xu, Tiantian Feng

Eval Frameworks & Benchmarks Natural Language Processing Speech & Audio

Mar 18, 2026

Shih-Heng Wang +6Mar 18, 2026·also USC

Towards Interpretable Framework for Neural Audio Codecs via Sparse Autoencoders: A Case Study on Accent Information

Acoustic and phonetic NACs encode accent in fundamentally different ways, with implications for how we interpret and manipulate these representations.

Shih-Heng Wang, Tiantian Feng, Aditya Kommineni +4

Architecture Design (Transformers, SSMs, MoE)Interpretability & Mechanistic Interp Speech & Audio

Mar 12, 2026

Mar 12, 2026·also USC

Affect Decoding in Phonated and Silent Speech Production from Surface EMG

You can reliably decode frustration from facial muscle activity, even when people aren't speaking aloud.

Simon Pistrosch, Kleanthis Avramidis, Tiantian Feng +3

Natural Language Processing Speech & Audio

Mar 11, 2026

VoxCare: Studying Natural Communication Behaviors of Hospital Caregivers through Wearable Sensing of Egocentric Audio

Wearable sensors and speech AI can now unobtrusively reveal the hidden communication dynamics driving hospital caregiver workload and stress.

Tiantian Feng, Kleanthis Avramidis, Anfeng Xu +3

Natural Language Processing Speech & Audio

Mar 11, 2026·also USC

Speech Codec Probing from Semantic and Phonetic Perspectives

Speech tokenizers, despite being crucial for multimodal LLMs, primarily capture phonetic information, creating a semantic mismatch with text-derived semantics that hinders performance.

Xuan Shi, Chang Zeng, Tiantian Feng +3

Multimodal Models Natural Language Processing Speech & Audio

Mar 8, 2026

Learning-free L2-Accented Speech Generation using Phonological Rules

Achieve accent-specific speech synthesis without any accented training data by cleverly combining phonological rules with multilingual TTS.

Yoonjeong Lee, Jihwan Lee, Tiantian Feng

Natural Language Processing Speech & Audio

Mar 8, 2026

Accent Vector: Controllable Accent Manipulation for Multilingual TTS Without Accented Data

Control the accent of your TTS output without needing any accented training data, by transferring accent characteristics from other languages.

Thanathai Lertpetchpun, Thanapat Trachu, Jihwan Lee +3

Natural Language Processing Speech & Audio

Mar 5, 2026

An Approach to Simultaneous Acquisition of Real-Time MRI Video, EEG, and Surface EMG for Articulatory, Brain, and Muscle Activity During Speech Production

A new multimodal dataset links brain activity, muscle activation, and articulation in speech, opening doors to understanding the causal chain of speech production.

Jihwan Lee, Parsa Razmara, Kevin Huang +17

Multimodal Models Scientific Discovery & Drug Design Speech & Audio

Search

Tiantian Feng

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (9)