HKUSTApr 14, 2026arXiv:2604.12424

Decoding by Perturbation: Mitigating MLLM Hallucinations via Dynamic Textual Perturbation

AI Summary

This paper tackles hallucinations in Multimodal Large Language Models (MLLMs) by identifying that they arise from visual grounding being overly sensitive to textual phrasing during decoding. They introduce Decoding by Perturbation (DeP), a training-free framework that dynamically probes and perturbs the input text to elicit and then suppress latent language priors using attention variance and logit statistics. Experiments show DeP reduces hallucinations and improves performance across benchmarks by enhancing stable evidence regions and counteracting probability biases.

Key Contribution

MLLM hallucinations aren't just about bad vision, but about language priors hijacking visual grounding during decoding—and a little text perturbation can expose and fix it.

Abstract

Multimodal Large Language Models frequently suffer from inference hallucinations, partially stemming from language priors dominating visual evidence. Existing training-free mitigation methods either perturb the visual representation and deviate from the natural image distribution, or enforce intrusive manipulations that compromise the model's inherent generative fluency. We introduce a novel perspective that multimodal hallucination manifests as the hypersensitivity of visual grounding to textual phrasing during the decoding phase. Building on this insight, we propose Decoding by Perturbation (DeP), a training-free framework mitigating prior-induced hallucinations via controlled textual interventions. DeP employs a dynamic probe applying multi-level textual perturbations to elicit latent language priors. Leveraging attention variance, it enhances stable evidence regions while suppressing suspicious noise in the feature space. Furthermore, it constructs an interpretable prior drift direction using logits statistics to counteract probability biases from textual co-occurrences. Extensive experiments confirm DeP effectively reduces hallucinations and achieves superior performance across multiple benchmarks.

Multimodal Models Natural Language Processing Red-Teaming & Adversarial Robustness

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Decoding by Perturbation: Mitigating MLLM Hallucinations via Dynamic Textual Perturbation

Related Papers