Search papers, labs, and topics across Lattice.
This paper investigates the problem of "Context Reliance" in next-token prediction (NTP) based unstructured knowledge editing of LLMs, where edited knowledge becomes overly dependent on the context used during editing. They demonstrate both empirically and theoretically that gradient-based optimization leads to this context reliance, causing recall failures when the context is absent during inference. To mitigate this, they propose COIN, a context-independent editing framework that encourages the model to focus on local knowledge, achieving a 45.2% reduction in Context Reliance and a 23.6% improvement in editing success rate.
LLM knowledge editing often fails because the "fix" only works when you ask about it in the same way it was trained, revealing a surprising brittleness in how models incorporate new information.
Editing Large language models (LLMs) with real-world, unstructured knowledge is essential for correcting and updating their internal parametric knowledge. In this work, we revisit the fundamental next-token prediction (NTP) as a candidate paradigm for unstructured editing. We identify Context Reliance as a critical failure mode of NTP-based approaches, where knowledge acquired from edited text becomes highly dependent on its preceding context, leading to recall failures when that context is absent during inference. This hypothesis is supported by our empirical validation that prepending context during inference recovers knowledge recall. We further theoretically demonstrate that Context Reliance is an inherent consequence of gradient-based optimization, which tends to bind acquired knowledge to a specific aggregated contextual representation. To address this, we propose a simple yet effective COntext-INdependent editing framework (COIN), encouraging model to focus on knowledge within local scope rather than memorizing contextual patterns. Evaluations show that COIN reduces Context Reliance by 45.2% and outperforms strong baselines by 23.6% in editing success rate, highlighting the vital role of mitigating Context Reliance for robust editing.