Tsinghua AIApr 29, 2026arXiv:2604.26768

Decoupling Knowledge and Task Subspaces for Composable Parametric Retrieval Augmented Generation

Weihang Su, Hanwen Zhang, Qingyao Ai, Yiqun Liu

AI Summary

This paper investigates the entanglement of task-solving behavior and document-specific knowledge within adapters in Parametric Retrieval-Augmented Generation (PRAG). They propose Orthogonal Subspace Decomposition (OSD) to train Task LoRAs and document LoRAs in orthogonal subspaces, thereby decoupling reusable task behavior from document-specific knowledge. Experiments demonstrate that OSD improves the compositional robustness of PRAG, particularly when merging multiple document adapters across knowledge-intensive tasks and model scales.

Key Contribution

Untangling task-solving skills from factual knowledge in PRAG adapters makes them play better together, boosting performance when you combine multiple documents.

Abstract

Parametric Retrieval-Augmented Generation (PRAG) encodes external documents into lightweight parameter modules that can be retrieved and merged at inference time, offering a promising alternative to in-context retrieval augmentation. Despite its potential, many PRAG implementations train document adapters with task-supervised objectives, which may cause each adapter to encode both document-specific facts and reusable task-solving behavior. This entanglement may make adapter composition less reliable: when multiple adapters are merged at inference time, their overlapping task behaviors can accumulate together with document-specific updates, potentially making the merged adapter less stable and less focused on the intended document knowledge. To examine this issue, we explore Orthogonal Subspace Decomposition (OSD), an adapter-training setup that separates reusable task behavior from document-specific knowledge adapters. Concretely, we first train a Task LoRA to capture reusable task behavior, and then train document LoRAs to encode document-specific knowledge in a orthogonal subspace. This setup provides a controlled way to examine how orthogonalizing task and document LoRA updates affects adapter composition in multi-document PRAG. Experiments across multiple knowledge-intensive tasks and model scales suggest that this orthogonalization strategy can improve compositional robustness in parametric RAG, especially when multiple document adapters are merged.

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Recommendation & Information Retrieval

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Decoupling Knowledge and Task Subspaces for Composable Parametric Retrieval Augmented Generation

Related Papers