DAMOMar 2, 2026arXiv:2603.02184

MAC: A Conversion Rate Prediction Benchmark Featuring Labels Under Multiple Attribution Mechanisms

AI Summary

This paper introduces the Multi-Attribution Benchmark (MAC), a new public dataset for conversion rate (CVR) prediction that includes labels generated by multiple attribution mechanisms, addressing the lack of such data for multi-attribution learning (MAL) research. Through comprehensive experiments on MAC, the authors demonstrate the benefits of MAL, particularly for users with long conversion paths, and identify key architectural design principles for effective MAL models. They then propose Mixture of Asymmetric Experts (MoAE), a novel MAL approach that incorporates multi-attribution knowledge learning and main task-centric knowledge utilization, achieving state-of-the-art results on the MAC benchmark.

Key Contribution

Finally, a CVR prediction dataset with labels from multiple attribution mechanisms, revealing that multi-attribution learning consistently boosts performance, but only with careful architecture and objective selection.

Abstract

Multi-attribution learning (MAL), which enhances model performance by learning from conversion labels yielded by multiple attribution mechanisms, has emerged as a promising learning paradigm for conversion rate (CVR) prediction. However, the conversion labels in public CVR datasets are generated by a single attribution mechanism, hindering the development of MAL approaches. To address this data gap, we establish the Multi-Attribution Benchmark (MAC), the first public CVR dataset featuring labels from multiple attribution mechanisms. Besides, to promote reproducible research on MAL, we develop PyMAL, an open-source library covering a wide array of baseline methods. We conduct comprehensive experimental analyses on MAC and reveal three key insights: (1) MAL brings consistent performance gains across different attribution settings, especially for users featuring long conversion paths. (2) The performance growth scales up with objective complexity in most settings; however, when predicting first-click conversion targets, simply adding auxiliary objectives is counterproductive, underscoring the necessity of careful selection of auxiliary objectives. (3) Two architectural design principles are paramount: first, to fully learn the multi-attribution knowledge, and second, to fully leverage this knowledge to serve the main task. Motivated by these findings, we propose Mixture of Asymmetric Experts (MoAE), an effective MAL approach incorporating multi-attribution knowledge learning and main task-centric knowledge utilization. Experiments on MAC show that MoAE substantially surpasses the existing state-of-the-art MAL method. We believe that our benchmark and insights will foster future research in the MAL field. Our MAC benchmark and the PyMAL algorithm library are publicly available at https://github.com/alimama-tech/PyMAL.

Data Curation & Synthetic Data Eval Frameworks & Benchmarks Recommendation & Information Retrieval

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

MAC: A Conversion Rate Prediction Benchmark Featuring Labels Under Multiple Attribution Mechanisms

Related Papers