Search papers, labs, and topics across Lattice.
The paper investigates the performance degradation observed when incorporating Chain-of-Thought (CoT) reasoning into Semantic ID-based recommendation foundation models like OpenOneRec. They diagnose the issue as "textual inertia" stemming from the General Subspace, where verbose reasoning overshadows the crucial Semantic ID information. To rectify this, they introduce a training-free Inference-Time Subspace Alignment framework that compresses reasoning chains and employs bias-subtracted contrastive decoding, leading to improved recommendation accuracy.
CoT reasoning can hurt recommender performance by drowning out important ID signals – unless you compress reasoning chains and use bias-subtracted contrastive decoding to realign the inference subspace.
Integrating Chain-of-Thought (CoT) reasoning into Semantic ID-based recommendation foundation models (such as OpenOneRec) often paradoxically degrades recommendation performance. We identify the root cause as textual inertia from the General Subspace, where verbose reasoning dominates inference and causes the model to neglect critical Semantic ID. To address this, we propose a training-free Inference-Time Subspace Alignment framework. By compressing reasoning chains and applying bias-subtracted contrastive decoding, our approach mitigates ungrounded textual drift. Experiments show this effectively calibrates inference, allowing foundation models to leverage reasoning without sacrificing ID-grounded accuracy.