Mar 15, 2026arXiv:2603.14422

MBD: A Model-Based Debiasing Framework Across User, Content, and Model Dimensions

Yuantong Li, Lei Yuan, Zhihao Zheng, Weimiao Wu, Songbin Liu, Jeong Min Lee, Ali Selman Aydin, Shaofeng Deng, Junbo Chen, Xinyi Zhang, Hongjing Xia, Sam Fieldman, Matthew Kosko, Wei Fu, Du Zhang, Peiyu Yang, Albert Jin Chung, Xianlei Qiu, Miao Yu, Zhongwei Teng, Hao Chen, Sunny Baek, Hui Tang, Yang Lv, Renze Wang, Qifan Wang, Zhan Li, Tiantian Xu, Peng Wu, Ji Liu

AI Summary

This paper introduces Model-Based Debiasing (MBD), a framework to transform biased behavioral signals in recommendation systems into unbiased representations by augmenting the existing ranking model with distributional modeling. MBD estimates the contextual mean and variance of engagement distributions for arbitrary cohorts by conditioning on a flexible subset of features, allowing for the conversion of biased raw signals into calibrated signals like percentiles or z-scores. The framework is implemented as a lightweight branch of the existing MTML ranking model, enabling flexible and controllable debiasing without requiring separate serving infrastructure.

Key Contribution

Recommendation systems can now systematically debias engagement signals across user, content, and model dimensions using a lightweight, in-model approach, leading to more accurate value models and stable ecosystem dynamics.

Abstract

Modern recommendation systems rank candidates by aggregating multiple behavioral signals through a value model. However, many commonly used signals are inherently affected by heterogeneous biases. For example, watch time naturally favors long-form content, loop rate favors short - form content, and comment probability favors videos over images. Such biases introduce two critical issues: (1) value model scores may be systematically misaligned with users' relative preferences - for instance, a seemingly low absolute like probability may represent exceptionally strong interest for a user who rarely engages; and (2) changes in value modeling rules can trigger abrupt and undesirable ecosystem shifts. In this work, we ask a fundamental question: can biased behavioral signals be systematically transformed into unbiased signals, under a user - defined notion of ``unbiasedness'', that are both personalized and adaptive? We propose a general, model-based debiasing (MBD) framework that addresses this challenge by augmenting it with distributional modeling. By conditioning on a flexible subset of features (partial feature set), we explicitly estimate the contextual mean and variance of the engagement distribution for arbitrary cohorts (e.g., specific video lengths or user regions) directly alongside the main prediction. This integration allows the framework to convert biased raw signals into unbiased representations, enabling the construction of higher-level, calibrated signals (such as percentiles or z - scores) suitable for the value model. Importantly, the definition of unbiasedness is flexible and controllable, allowing the system to adapt to different personalization objectives and modeling preferences. Crucially, this is implemented as a lightweight, built-in branch of the existing MTML ranking model, requiring no separate serving infrastructure.

Constitutional AI & AI Ethics Recommendation & Information Retrieval

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

MBD: A Model-Based Debiasing Framework Across User, Content, and Model Dimensions

Related Papers