Search papers, labs, and topics across Lattice.
The paper introduces the Cross-Lingual Transfer Matrix (CLTM) to systematically quantify cross-lingual transfer in paralinguistic speech tasks. They fine-tuned a multilingual HuBERT-based encoder on gender identification and speaker verification tasks, evaluating the impact of different donor languages on target language performance. The CLTM analysis revealed distinct transfer patterns across languages and tasks, demonstrating that paralinguistic tasks exhibit language-dependent effects despite relying on extralinguistic cues.
Paralinguistic speech tasks aren't as language-agnostic as we thought: cross-lingual transfer patterns reveal systematic language dependencies.
Paralinguistic speech tasks are often considered relatively language-agnostic, as they rely on extralinguistic acoustic cues rather than lexical content. However, prior studies report performance degradation under cross-lingual conditions, indicating non-negligible language dependence. Still, these studies typically focus on isolated language pairs or task-specific settings, limiting comparability and preventing a systematic assessment of task-level language dependence. We introduce the Cross-Lingual Transfer Matrix (CLTM), a systematic method to quantify cross-lingual interactions between pairs of languages within a given task. We apply the CLTM to two paralinguistic tasks, gender identification and speaker verification, using a multilingual HuBERT-based encoder, to analyze how donor-language data affects target-language performance during fine-tuning. Our results reveal distinct transfer patterns across tasks and languages, reflecting systematic, language-dependent effects.