Feb 25, 2026arXiv:2602.21704

Dynamic Multimodal Activation Steering for Hallucination Mitigation in Large Vision-Language Models

Jianghao Yin, Qin Chen, Kedi Chen, Jie Zhou, Xingjiao Wu, Liang He

AI Summary

This paper investigates hallucination in LVLMs, finding that truthfulness and visual perception rely on distinct attention head subsets and that truthfulness steering vectors are context-dependent. To address this, they propose Dynamic Multimodal Activation Steering, a training-free inference-time method. The approach dynamically selects and applies contextually relevant truthfulness steering vectors from a semantic database, combined with visual perception steering vectors, to influential attention heads, achieving state-of-the-art hallucination mitigation.

Key Contribution

By dynamically steering attention heads based on semantic context, LVLMs can significantly reduce hallucinations without any additional training.

Abstract

Large Vision-Language Models (LVLMs) exhibit outstanding performance on vision-language tasks but struggle with hallucination problems. Through in-depth analysis of LVLM activation patterns, we reveal two key findings: 1) truthfulness and visual perception capabilities predominantly engage different subsets of attention heads within the model architecture; and 2) truthfulness steering vectors vary significantly across different semantic contexts. Based on these observations, we propose Dynamic Multimodal Activation Steering, a training-free approach for hallucination mitigation. Our method constructs a semantic-based truthfulness steering vector database and computes visual perception steering vectors, enabling context-aware interventions during inference by dynamically selecting the most relevant steering vectors based on input semantic similarity and applying them to the most influential attention heads. We conduct comprehensive experiments across multiple models and datasets, demonstrating that our approach significantly enhances model performance, outperforming existing state-of-the-art methods.

Computer Vision Interpretability & Mechanistic Interp Multimodal Models

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Dynamic Multimodal Activation Steering for Hallucination Mitigation in Large Vision-Language Models

Related Papers