HuaweiApr 22, 2026arXiv:2604.20777

Efficient Multi-Cohort Inference for Long-Term Effects and Lifetime Value in A/B Testing with User Learning

Dario Simionato, Andrea Tonon, Mingxue Wang, Weiguo Wang, Tong Gui, Xiaoyue Li

AI Summary

This paper introduces a method for estimating long-term treatment effects (LTE) and residual lifetime value change ($ΔERLV$) in A/B tests, accounting for user learning and churn, by modeling the treatment effect trajectory as a parametric decay. They propose an inverse-variance weighted estimator that combines multiple cohort estimates to improve precision in estimating time-varying treatment effects. Empirical results demonstrate that the proposed framework improves the precision of LTE and $ΔERLV$ estimation, enabling more accurate product decisions compared to relying solely on short-term or long-term metrics.

Key Contribution

Short-term A/B test metrics can be misleading: this paper shows how to accurately estimate long-term value changes by modeling treatment effects as a decaying function learned from multiple cohorts.

Abstract

In streaming platforms churn is extremely costly, yet A/B tests are typically evaluated using outcomes observed within a limited experimental horizon. Even when both short- and predicted long-term engagement metrics are considered, they may fail to capture how a treatment affects users' retention. Consequently, an intervention may appear beneficial in the short term and neutral in the long term while still generating lower total value than the control due to users churn. To address this limitation, we introduce a method that estimates long-term treatment effects (LTE) and residual lifetime value change ($ΔERLV$) in short multi-cohort A/B tests under user learning. To estimate time-varying treatment effects efficiently, we introduce an inverse-variance weighted estimator that combines multiple cohorts estimates, reducing variance relative to standard approaches in the literature. The estimated treatment trajectory is then modeled as a parametric decay to recover both the asymptotic treatment effect and the cumulative value generated over time. Our framework enables simultaneous evaluation of steady-state impact and residual user value within a single experiment. Empirical results show improved precision in estimating LTE and $ΔERLV$ and identify scenarios in which relying on either short-term or long-term metrics alone would lead to incorrect product decisions.

Recommendation & Information Retrieval

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Efficient Multi-Cohort Inference for Long-Term Effects and Lifetime Value in A/B Testing with User Learning

Related Papers