Search papers, labs, and topics across Lattice.
This paper introduces a practical ranking-stage Long-Term Value (LTV) prediction framework for short-video recommendation, tackling position bias, attribution ambiguity, and temporal limitations. The framework incorporates a Position-aware Debias Quantile (PDQ) module for position normalization, a multi-dimensional attribution module for learning continuous attribution strengths, and a cross-temporal author modeling module for capturing creator-driven re-engagement. Experimental results, including online A/B tests on Taobao, demonstrate significant improvements in LTV metrics and stable trade-offs with short-term objectives, validating the framework's effectiveness and compatibility with industrial constraints.
Taobao's new LTV ranking framework boosts long-term user engagement by learning nuanced video influence and creator-driven re-engagement, all while fitting within existing industrial constraints.
Accurately modeling long-term value (LTV) at the ranking stage of short-video recommendation remains challenging. While delayed feedback and extended engagement have been explored, fine-grained attribution and robust position normalization at billion-scale are still underdeveloped. We propose a practical ranking-stage LTV framework addressing three challenges: position bias, attribution ambiguity, and temporal limitations. (1) Position bias: We introduce a Position-aware Debias Quantile (PDQ) module that normalizes engagement via quantile-based distributions, enabling position-robust LTV estimation without architectural changes. (2) Attribution ambiguity: We propose a multi-dimensional attribution module that learns continuous attribution strengths across contextual, behavioral, and content signals, replacing static rules to capture nuanced inter-video influence. A customized hybrid loss with explicit noise filtering improves causal clarity. (3) Temporal limitations: We present a cross-temporal author modeling module that builds censoring-aware, day-level LTV targets to capture creator-driven re-engagement over longer horizons; the design is extensible to other dimensions (e.g., topics, styles). Offline studies and online A/B tests show significant improvements in LTV metrics and stable trade-offs with short-term objectives. Implemented as task augmentation within an existing ranking model, the framework supports efficient training and serving, and has been deployed at billion-scale in Taobao's production system, delivering sustained engagement gains while remaining compatible with industrial constraints.