Search papers, labs, and topics across Lattice.
This paper introduces RecoChain, a unified framework that integrates generative retrieval and ranking within a single Transformer architecture to enhance next-item recommendation systems. By employing hierarchical semantic ID prediction for candidate generation followed by a SIM-based ranking process, the model effectively narrows the performance gap between generating potential items and selecting the most relevant ones. Experimental results on large-scale datasets show that RecoChain significantly improves Top-K recommendation performance while preserving robust generative capabilities.
Bridging the gap between generative retrieval and ranking, RecoChain achieves superior Top-K recommendation performance without sacrificing generative strength.
Generative recommender systems have recently emerged as a promising paradigm by formulating next-item prediction as an auto-regressive semantic IDs generation, such as OneRec series works. However, with the next-item-agnostic prediction paradigm, its could beam out some next potential items via Semantic IDs but hard to estimate which items are better from them, e.g., select the top-10 from beam-256 items, leading to a gap between generation and ranking performance. To fulfill this gap, we propose RecoChain, a unified generative retrieval and ranking framework that integrates candidate generation and ranking within a single Transformer backbone. Specifically, in inference, the model first generates candidate items via hierarchical semantic ID prediction, then performs the SIM-based ranking process to estimate the click possibility of corresponding item candidate continuously. Extensive experiments on large-scale real-world datasets demonstrate that our approach effectively bridges the gap between generative retrieval and ranking, achieving improved Top-K recommendation performance while maintaining strong generative capability.