Search papers, labs, and topics across Lattice.
This paper introduces Model-Based Debiasing (MBD), a framework to transform biased behavioral signals in recommendation systems into unbiased representations by augmenting the existing ranking model with distributional modeling. MBD estimates the contextual mean and variance of engagement distributions for arbitrary cohorts by conditioning on a flexible subset of features, allowing for the conversion of biased raw signals into calibrated signals like percentiles or z-scores. The framework is implemented as a lightweight branch of the existing MTML ranking model, enabling flexible and controllable debiasing without requiring separate serving infrastructure.
Recommendation systems can now systematically debias engagement signals across user, content, and model dimensions using a lightweight, in-model approach, leading to more accurate value models and stable ecosystem dynamics.
Modern recommendation systems rank candidates by aggregating multiple behavioral signals through a value model. However, many commonly used signals are inherently affected by heterogeneous biases. For example, watch time naturally favors long-form content, loop rate favors short - form content, and comment probability favors videos over images. Such biases introduce two critical issues: (1) value model scores may be systematically misaligned with users' relative preferences - for instance, a seemingly low absolute like probability may represent exceptionally strong interest for a user who rarely engages; and (2) changes in value modeling rules can trigger abrupt and undesirable ecosystem shifts. In this work, we ask a fundamental question: can biased behavioral signals be systematically transformed into unbiased signals, under a user - defined notion of ``unbiasedness'', that are both personalized and adaptive? We propose a general, model-based debiasing (MBD) framework that addresses this challenge by augmenting it with distributional modeling. By conditioning on a flexible subset of features (partial feature set), we explicitly estimate the contextual mean and variance of the engagement distribution for arbitrary cohorts (e.g., specific video lengths or user regions) directly alongside the main prediction. This integration allows the framework to convert biased raw signals into unbiased representations, enabling the construction of higher-level, calibrated signals (such as percentiles or z - scores) suitable for the value model. Importantly, the definition of unbiasedness is flexible and controllable, allowing the system to adapt to different personalization objectives and modeling preferences. Crucially, this is implemented as a lightweight, built-in branch of the existing MTML ranking model, requiring no separate serving infrastructure.