Tsinghua AIHKUPKUJun 9, 2026arXiv:2606.10298

From Context-Aware to Conflict-Aware: Generalizing Contrastive Decoding for Knowledge Conflict in LLMs

AI Summary

This paper introduces a novel approach to contrastive decoding in large language models (LLMs) by shifting from a context-aware to a conflict-aware paradigm, allowing for dynamic authority allocation between external context and parametric priors based on conflict signals. The authors demonstrate that existing methods tend to amplify errors or under-correct outputs due to their static nature, and they propose a new evaluation protocol, TriState-Bench, to assess the model's handling of knowledge conflicts. Their Adaptive Regime Routing (ARR) method significantly improves error management, achieving resistance error metrics from below 6 to 16–33 without compromising correction or agreement.

Key Contribution

A conflict-aware approach to decoding can triple resistance to errors in LLMs while maintaining accuracy, fundamentally changing how we handle knowledge conflicts in AI.

Abstract

When large language models generate from retrieved or augmented contexts, conflicts between external context and parametric priors remain a central reliability bottleneck. Existing contrastive decoding methods follow a \emph{context-aware} paradigm that unilaterally amplifies context over parametric priors, overwriting correct priors when the context is erroneous. We generalize this to the \textbf{conflict-aware} paradigm that dynamically allocates authority between prior and context based on conflict signals, rather than presupposing context trustworthiness. We show that the affine combination of prior and context logits yields a \textbf{power family} with an inherent \textbf{regime asymmetry}: extrapolation amplifies errors unboundedly when the prior is correct, interpolation under-corrects when the context is correct, and no static regime covers both. Existing contrastive decoding methods are instances of this family, mostly extrapolative. To evaluate both conflict directions, we propose TriState-Bench, a model-aware evaluation protocol that calibrates per-model prior knowledge to measure three conflict states: correction, resistance, and agreement. To resolve the asymmetry, we propose Adaptive Regime Routing (ARR), which routes between regimes at each step, lifting resistance EM from below 6 to 16--33 without sacrificing correction or agreement. Our code is available at https://github.com/keith-Jiang/conflict-aware-decoding.

Natural Language Processing Red-Teaming & Adversarial Robustness

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

From Context-Aware to Conflict-Aware: Generalizing Contrastive Decoding for Knowledge Conflict in LLMs

Related Papers