CASMar 16, 2026arXiv:2603.15518

Beyond the Covariance Trap: Unlocking Generalization in Same-Subject Knowledge Editing for Large Language Models

Xiyu Liu, Qingyi Si, Zhengxiao Liu, Chenxu Yang, Naibin Gu, Zheng Lin

AI Summary

The paper identifies a generalization failure in same-subject knowledge editing of LLMs, where models struggle to recall updated knowledge under prompt variations. It attributes this to a "Covariance Trap" caused by orthogonal gradients and the standard covariance constraint, leading to instability and sensitivity to input perturbations. To address this, the authors propose RoSE (Robust Same-subject Editing), which uses Isotropic Geometric Alignment and Hierarchical Knowledge Integration to improve instruction-following capabilities.

Key Contribution

LLMs can fail to generalize knowledge edits to instruction-following scenarios due to a "Covariance Trap," but RoSE unlocks robust interactive parametric memory by aligning representations and smoothing the optimization landscape.

Abstract

While locate-then-edit knowledge editing efficiently updates knowledge encoded within Large Language Models (LLMs), a critical generalization failure mode emerges in the practical same-subject knowledge editing scenario: models fail to recall the updated knowledge when following user instructions, despite successfully recalling it in the original edited form. This paper identifies the geometric root of this generalization collapse as a fundamental conflict where the inner activation drifts induced by prompt variations exceed the model's geometric tolerance for generalization after editing. We attribute this instability to a dual pathology: (1) The joint optimization with orthogonal gradients collapses solutions into sharp minima with narrow stability, and (2) the standard covariance constraint paradoxically acts as a Covariance Trap that amplifies input perturbations. To resolve this, we introduce RoSE (Robust Same-subject Editing), which employs Isotropic Geometric Alignment to minimize representational deviation and Hierarchical Knowledge Integration to smooth the optimization landscape. Extensive experiments demonstrate that RoSE significantly improves instruction-following capabilities, laying the foundation for robust interactive parametric memory of LLM agents.

Eval Frameworks & Benchmarks Natural Language Processing

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Beyond the Covariance Trap: Unlocking Generalization in Same-Subject Knowledge Editing for Large Language Models

Related Papers