Hung-Yi Lee

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Speech & Audio (3)Multimodal Models (2)Eval Frameworks & Benchmarks (2)Interpretability & Mechanistic Interp (1)

Frequent co-authors

Tsung-En Lin (1)Yun-Shao Tsai (1)Yi-Cheng Lin (1)Huang-Cheng Chou (1)

Papers (3)

Jun 9, 2026

1w ago

Steering Where to Listen: Instruction-Based Activation Steering Redirects Temporal Attention in Large Audio-Language Models

Instruction-based steering can redirect attention in LALMs to acoustically relevant regions, achieving over 60% overlap with ground-truth sound event locations without any training.

Tsung-En Lin, Hung-Yi Lee

Interpretability & Mechanistic Interp Multimodal Models Speech & Audio

Apr 29, 2026

Apr 29, 2026·also Gilbert AI Lab, USC

The False Resonance: A Critical Examination of Emotion Embedding Similarity for Speech Generation Evaluation

Widely used emotion embedding similarity metrics for speech generation are more sensitive to speaker and linguistic features than actual emotion, rendering them unreliable for evaluating emotional expressiveness.

Yun-Shao Tsai, Yi-Cheng Lin, Huang-Cheng Chou +8

Eval Frameworks & Benchmarks Natural Language Processing Speech & Audio

Apr 28, 2026

Chun-Yi Kuan +2Apr 28, 2026

Walking Through Uncertainty: An Empirical Study of Uncertainty Estimation for Audio-Aware Large Language Models

Semantic-level uncertainty estimation methods significantly enhance the reliability of audio-aware language models, outperforming traditional approaches in critical reasoning tasks.

Chun-Yi Kuan, Wei-Ping Huang, Hung-Yi Lee

Eval Frameworks & Benchmarks Multimodal Models Speech & Audio

Search

Hung-Yi Lee

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)