ZJUMay 28, 2026arXiv:2605.30219

When Should Models Change Their Minds? Contextual Belief Management in Large Language Models

Haoming Xu, Weihong Xu, Zongrui Li, Mengru Wang, Yunzhi Yao, Chiyu Wu, Jingbo Shang, Jin Shang, Yu Gong, Yujia Gong, Shumin Deng

AI Summary

The paper introduces Contextual Belief Management (CBM) to evaluate how well LLMs maintain and update beliefs based on evidence while ignoring irrelevant information. They present BeliefTrack, a benchmark with Rule Discovery and Circuit Diagnosis tasks, enabling precise turn-level evaluation of belief state accuracy. Results show that vanilla LLMs struggle with CBM, but reinforcement learning with belief-state rewards and representation-level steering significantly reduce failure rates.

Key Contribution

LLMs often fail to maintain accurate beliefs in multi-turn interactions, but targeted reinforcement learning and representation steering can dramatically improve their contextual reasoning.

Abstract

Long-horizon interactions require language models to manage accumulating information: when to update their state, when to preserve their state, and what to ignore. We study this challenge as \textbf{Contextual Belief Management (CBM)}: maintaining a predicted belief state aligned with formal evidence while isolating task-irrelevant noise. To make CBM measurable, we introduce BeliefTrack, a closed-world benchmark spanning Rule Discovery and Circuit Diagnosis, where a finite belief space and symbolic verifiers enable exact turn-level evaluation. BeliefTrack diagnoses three failures: Failed Stay, Failed Update, and Failed Isolation. Across multiple LLMs, vanilla models exhibit severe CBM failures, while explicit belief-tracking prompts provide limited gains. In contrast, reinforcement learning with belief-state rewards reduces failure rates by 70.9\% on average. Further probing reveals latent belief-state dynamics behind these failures, and representation-level steering reduces failure rates by 46.1\% across two tasks\footnote{Code is coming soon at https://github.com/zjunlp/CBM.

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References45

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

When Should Models Change Their Minds? Contextual Belief Management in Large Language Models

Related Papers