IIT DelhiIndraprastha Institute of InformationJaypee Institute of InformationUMDMar 31, 2026arXiv:2603.29263

Audio Hallucination Attacks: Probing the Reliability of Large Audio Language Models

Ashish Seth, Sonal Kumar, Ramaneswaran Selvakumar, Nishit Anand, Utkarsh Tyagi, Prem Seetharaman, R. Duraiswami, Dinesh Manocha

AI Summary

The paper introduces Audio Hallucination Attacks (AHA), a suite of query-based and audio-based adversarial examples, to evaluate the reliability of Large Audio Language Models (LALMs). Applying AHA-Eval to state-of-the-art models like Audio Flamingo 3 and Gemini 3 Pro reveals high attack success rates (up to 95.35%), indicating a significant vulnerability to audio hallucinations. To address this, the authors propose AHA-Guard, a post-alignment dataset that reduces attack success rates by up to 49%.

Key Contribution

LALMs can be easily tricked into "hearing" things that aren't there, with success rates as high as 95% on targeted attacks.

Abstract

Large Audio Language Models (LALMs) achieve strong performance on audio-language tasks; however, their reliability in real-world settings remains underexplored. We introduce Audio Hallucination Attacks (AHA), an attack suite called AHA-Eval, comprising 6.5K QA pairs designed to test whether LALMs genuinely ground their responses in the audio input. AHA targets two attack surfaces: (i) query-based attacks, which exploit question structure to induce hallucinations about absent sounds, and (ii) audio-based attacks, which inject synthetic speech describing non-existent events into the audio stream. Evaluating state-of-the-art LALMs, including Audio Flamingo 3 and Gemini 3 Pro, we observe high attack success rates of 95.35% and 79.65%, respectively, revealing a reliability gap that is hidden by standard benchmark performance. To mitigate this, we propose a 120K QA post-alignment dataset, AHA-Guard, which successfully reduces attack success rates by up to 49%.

Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness Speech & Audio

Citation Metrics

Citations0

Influential citations0

References27

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Audio Hallucination Attacks: Probing the Reliability of Large Audio Language Models

Related Papers