NTURUCShopee Pte. Ltd.Jun 15, 2026arXiv:2606.16838

OneRank: Unified Transformer-Native Ranking Architecture for Multi-Task Recommendation

Jiakai Tang, Sunhao Dai, Kun Wang, Zhiluohan Guo, Yu Zhao, Cong Fu, Kangle Wu, Yabo Ni, Anxiang Zeng, Xu Chen, Jun Xu

AI Summary

This paper introduces OneRank, a Transformer-native multi-task ranking architecture that integrates feature encoding with multi-task prediction to enhance performance in recommendation systems. By eliminating the separation between the encoder and predictor, OneRank addresses key issues such as information bottlenecks and gradient interference, leading to improved task-specific learning and reduced inter-task interference. Experimental results demonstrate that OneRank significantly outperforms existing state-of-the-art methods while maintaining computational efficiency on large-scale datasets.

Key Contribution

OneRank achieves superior multi-task recommendation performance by seamlessly integrating task-specific learning within a unified Transformer framework, eliminating the traditional encoder-predictor bottleneck.

Abstract

Multi-task learning (MTL) is essential in recommender systems to enable complementary learning among diverse user feedback. While modern industrial practices have shifted from DNNs to Transformer-centric architectures to strengthen sequence modeling and scaling capacity, they still decouple feature encoding from multi-task prediction, treating the Transformer as a task-agnostic encoder. This design fundamentally limits the performance and scalability by (1) creating an information bottleneck under heterogeneous task objectives, (2) inducing gradient interference that leads to the seesaw phenomenon, and (3) forcing a dataflow transition in which attention-based, context-adaptive representation learning is converted to static feed-forward task prediction with incompatible information read-write dynamics. We propose OneRank, a Transformer-native multi-task ranking framework that eliminates encoder-predictor separation and introduces task-private channels for forward representation learning and backward optimization, enabling task-specialized learning while reducing inter-task interference. In the forward pass, OneRank learns task-specific representations bottom-up through task-conditioned information selection, candidate-aware contextualization, and controlled cross-task interaction. In the backward pass, cross-task gradient detachment isolates task-private parameter updates from shared knowledge extraction modules, preventing negative transfer. We further replace static task-specific MLP scorers with dynamic matching-based scoring for context-aware personalized ranking. By internalizing multi-task reasoning within the Transformer stack, OneRank establishes a unified and scalable architectural paradigm. Offline and online experiments on large-scale industrial datasets show that OneRank significantly outperforms state-of-the-art baselines while maintaining computational efficiency.

Architecture Design (Transformers, SSMs, MoE)Recommendation & Information Retrieval

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

OneRank: Unified Transformer-Native Ranking Architecture for Multi-Task Recommendation

Related Papers