Mar 30, 2026arXiv:2603.28378

Membership Inference Attacks against Large Audio Language Models

Jiatang Dong, Jia-Kai Dong, Yu-Xiang Lin, Hung-Yi Lee

AI Summary

This paper presents a membership inference attack (MIA) evaluation of Large Audio Language Models (LALMs), highlighting the confounding effects of train/test distribution shifts in audio data. They establish a multi-modal blind baseline using textual, spectral, and prosodic features to account for these shifts and enable reliable MIA evaluation. Their experiments reveal that LALM memorization is cross-modal, stemming from the association of a speaker's vocal identity with their text.

Key Contribution

LALMs leak speaker identity by memorizing the link between voice and text, not just the content of speech.

Abstract

We present the first systematic Membership Inference Attack (MIA) evaluation of Large Audio Language Models (LALMs). As audio encodes non-semantic information, it induces severe train and test distribution shifts and can lead to spurious MIA performance. Using a multi-modal blind baseline based on textual, spectral, and prosodic features, we demonstrate that common speech datasets exhibit near-perfect train/test separability (AUC approximately 1.0) even without model inference, and the standard MIA scores strongly correlate with these blind acoustic artifacts (correlation greater than 0.7). Using this blind baseline, we identify that distribution-matched datasets enable reliable MIA evaluation without distribution shift confounds. We benchmark multiple MIA methods and conduct modality disentanglement experiments on these datasets. The results reveal that LALM memorization is cross-modal, arising only from binding a speaker's vocal identity with its text. These findings establish a principled standard for auditing LALMs beyond spurious correlations.

Natural Language Processing Red-Teaming & Adversarial Robustness Speech & Audio

Citation Metrics

Citations0

Influential citations0

References36

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Membership Inference Attacks against Large Audio Language Models

Related Papers