NTU TaiwanMar 15, 2026arXiv:2603.14636

Nudging Hidden States: Training-Free Model Steering for Chain-of-Thought Reasoning in Large Audio-Language Models

Lok-Lam Ieong, Chia-Chien Chen, Chih-Kai Yang, Yu-Han Huang, An-Yu Cheng, Hung-yi Lee

AI Summary

This paper explores training-free inference-time model steering to enhance chain-of-thought reasoning in large audio-language models (LALMs). They introduce three steering strategies leveraging diverse information sources and evaluate them on four LALMs across four benchmarks, achieving accuracy gains up to 4.4% over standard CoT prompting. A key finding is the effective cross-modal transfer of steering vectors from text to speech, highlighting data efficiency in guiding speech-based reasoning.

Key Contribution

Text-based steering vectors can boost the reasoning accuracy of large audio-language models by 4.4% – without any training.

Abstract

Chain-of-thought (CoT) prompting has been extended to large audio-language models (LALMs) to elicit reasoning, yet enhancing its effectiveness without training remains challenging. We study inference-time model steering as a training-free approach to improve LALM reasoning. We introduce three strategies using diverse information sources and evaluate them across four LALMs and four benchmarks. Results show general accuracy gains up to 4.4% over CoT prompting. Notably, we identify a cross-modal transfer where steering vectors derived from few text samples effectively guide speech-based reasoning, demonstrating high data efficiency. We also examine hyperparameter sensitivity to understand the robustness of these approaches. Our findings position model steering as a practical direction for strengthening LALM reasoning.

Multimodal Models Reasoning & Chain-of-Thought Speech & Audio

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Nudging Hidden States: Training-Free Model Steering for Chain-of-Thought Reasoning in Large Audio-Language Models

Related Papers