Yi-Cheng Lin

National Taiwan University

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Speech & Audio (7)Eval Frameworks & Benchmarks (3)Multimodal Models (3)Constitutional AI & AI Ethics (2)

Frequent co-authors

Hung-yi Lee (4)Wenze Ren (3)Ke-Han Lu (2)Yusuke Hirota (1)

Papers (7)

Apr 19, 2026

1w ago·also NVIDIA

VIBE: Voice-Induced open-ended Bias Evaluation for Large Audio-Language Models via Real-World Speech

LALMs reveal their hidden biases when you let them generate freely from real human voices, and gender stereotypes are more pronounced than accent biases.

Yi-Cheng Lin, Yusuke Hirota, Hung-yi Lee

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Speech & Audio

NVIDIA1w ago·also NTU Taiwan

MoVE: Translating Laughter and Tears via Mixture of Vocalization Experts in Speech-to-Speech Translation

Speech-to-speech translation can now convey laughter and tears with human-like fidelity, thanks to a surprisingly data-efficient approach leveraging LoRA experts.

Szu-Chi Chen, I-Ning Tsai, Yi-Cheng Lin +2

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Speech & Audio

Mar 19, 2026

How Auditory Knowledge in LLM Backbones Shapes Audio Language Models: A Holistic Evaluation

Text-only LLMs already contain surprisingly diverse levels of auditory knowledge, and this pre-existing knowledge strongly predicts their performance when adapted for audio-language tasks.

Ke-Han Lu, Szu-Wei Fu, Chao-Han Huck Yang +14

Eval Frameworks & Benchmarks Multimodal Models Speech & Audio

Mar 11, 2026

Mar 11, 2026·also NTU Taiwan

MOS-Bias: From Hidden Gender Bias to Gender-Aware Speech Quality Assessment

Speech quality assessment is skewed: male listeners consistently give higher scores than female listeners, and standard MOS models learn and perpetuate this bias.

Wenze Ren, Yi-Cheng Lin, Erica Cooper +4

Constitutional AI & AI Ethics Natural Language Processing Speech & Audio

Mar 10, 2026

How Contrastive Decoding Enhances Large Audio Language Models?

Contrastive Decoding's power-up for audio language models hinges on fixing specific error types, like uncertainty and audio absence, but don't expect it to magically fix flawed reasoning.

Tzu-Quan Lin, Yi-Cheng Lin

Eval Frameworks & Benchmarks Multimodal Models Speech & Audio

Mar 5, 2026

Mar 5, 2026·also IIS Academia Sinica, NYCU

Latent-Mark: An Audio Watermark Robust to Neural Resynthesis

Audio watermarks can now survive neural resynthesis, thanks to a latent space embedding technique that resists semantic compression by modern audio codecs.

Yen-Shan Chen, Shih-Yu Lai, Ying-Jung Tsou +5

Red-Teaming & Adversarial Robustness Speech & Audio

Mar 5, 2026

TW-Sound580K: A Regional Audio-Text Dataset with Verification-Guided Curation for Localized Audio-Language Modeling

Overcome LALM's struggles with localized dialectal prosody: a new Taiwanese audio-text dataset and fine-tuning strategy boosts accuracy by 6.5% on the TAU Benchmark.

Hao-Hui Xie, Haotao Xie, Ho-Lam Chung +5

Data Curation & Synthetic Data Multimodal Models Speech & Audio

Search

Yi-Cheng Lin

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (7)