Search papers, labs, and topics across Lattice.
The paper introduces Adaptive Prefix-Aware Optimization (APAO), a novel framework to address the training-inference discrepancy in generative recommendation models caused by beam search pruning. APAO incorporates prefix-level optimization losses, focusing on vulnerable prefixes through an adaptive worst-prefix optimization strategy, to better align training with the inference process. Empirical results across multiple datasets demonstrate that APAO consistently improves performance by enhancing the model's ability to retain correct candidates during beam search.
Generative recommendation gets a boost: APAO tackles the training-inference gap by intelligently optimizing for prefixes, leading to better candidate retention during beam search.
Generative recommendation has recently emerged as a promising paradigm in sequential recommendation. It formulates the task as an autoregressive generation process, predicting discrete tokens of the next item conditioned on user interaction histories. Existing generative recommendation models are typically trained with token-level likelihood objectives, such as cross-entropy loss, while employing multi-step beam search during inference to generate ranked item candidates. However, this leads to a fundamental training-inference inconsistency: standard training assumes ground-truth history is always available, ignoring the fact that beam search prunes low-probability branches during inference. Consequently, the correct item may be prematurely discarded simply because its initial tokens (prefixes) have low scores. To address this issue, we propose the Adaptive Prefix-Aware Optimization (APAO) framework, which introduces prefix-level optimization losses to better align the training objective with the inference setting. Furthermore, we design an adaptive worst-prefix optimization strategy that dynamically focuses on the most vulnerable prefixes during training, thereby enhancing the model's ability to retain correct candidates under beam search constraints. We provide theoretical analyses to demonstrate the effectiveness and efficiency of our framework. Extensive experiments on multiple datasets further show that APAO consistently alleviates the training-inference inconsistency and improves performance across various generative recommendation backbones. Our codes are publicly available at https://github.com/yuyq18/APAO.