Tsinghua AITaobaoFeb 19, 2026arXiv:2602.17058

A Long-term Value Prediction Framework In Video Ranking

Huabin Chen, Xinao Wang, Huiping Chu, Keqin Xu, Chenhao Zhai, Chenyi Wang, Kai Meng, Yuning Jiang

AI Summary

This paper introduces a practical ranking-stage Long-Term Value (LTV) prediction framework for short-video recommendation, tackling position bias, attribution ambiguity, and temporal limitations. The framework incorporates a Position-aware Debias Quantile (PDQ) module for position normalization, a multi-dimensional attribution module for learning continuous attribution strengths, and a cross-temporal author modeling module for capturing creator-driven re-engagement. Experimental results, including online A/B tests on Taobao, demonstrate significant improvements in LTV metrics and stable trade-offs with short-term objectives, validating the framework's effectiveness and compatibility with industrial constraints.

Key Contribution

Taobao's new LTV ranking framework boosts long-term user engagement by learning nuanced video influence and creator-driven re-engagement, all while fitting within existing industrial constraints.

Abstract

Accurately modeling long-term value (LTV) at the ranking stage of short-video recommendation remains challenging. While delayed feedback and extended engagement have been explored, fine-grained attribution and robust position normalization at billion-scale are still underdeveloped. We propose a practical ranking-stage LTV framework addressing three challenges: position bias, attribution ambiguity, and temporal limitations. (1) Position bias: We introduce a Position-aware Debias Quantile (PDQ) module that normalizes engagement via quantile-based distributions, enabling position-robust LTV estimation without architectural changes. (2) Attribution ambiguity: We propose a multi-dimensional attribution module that learns continuous attribution strengths across contextual, behavioral, and content signals, replacing static rules to capture nuanced inter-video influence. A customized hybrid loss with explicit noise filtering improves causal clarity. (3) Temporal limitations: We present a cross-temporal author modeling module that builds censoring-aware, day-level LTV targets to capture creator-driven re-engagement over longer horizons; the design is extensible to other dimensions (e.g., topics, styles). Offline studies and online A/B tests show significant improvements in LTV metrics and stable trade-offs with short-term objectives. Implemented as task augmentation within an existing ranking model, the framework supports efficient training and serving, and has been deployed at billion-scale in Taobao's production system, delivering sustained engagement gains while remaining compatible with industrial constraints.

Recommendation & Information Retrieval

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

A Long-term Value Prediction Framework In Video Ranking

Related Papers