Search papers, labs, and topics across Lattice.
This paper introduces a constraint-aware generative re-ranking framework for optimizing advertising feeds, unifying sequence generation and reward estimation into a single network. The framework uses constraint-aware reward pruning to integrate constraint satisfaction directly into the decoding process, improving efficiency. Experiments on industrial feeds and A/B tests demonstrate improved revenue and user engagement while meeting latency requirements.
Achieve optimal advertising feed re-ranking by baking constraints directly into neural decoding, sidestepping the usual latency and constraint-handling limitations of generative approaches.
Optimizing reranking in advertising feeds is a constrained combinatorial problem, requiring simultaneous maximization of platform revenue and preservation of user experience. Recent generative ranking methods enable listwise optimization via autoregressive decoding, but their deployment is hindered by high inference latency and limited constraint handling. We propose a constraint-aware generative reranking framework that transforms constrained optimization into bounded neural decoding. Unlike prior approaches that separate generator and evaluator models, our framework unifies sequence generation and reward estimation into a single network. We further introduce constraint-aware reward pruning, integrating constraint satisfaction directly into decoding to efficiently generate optimal sequences. Experiments on large-scale industrial feeds and online A/B tests show that our method improves revenue and user engagement while meeting strict latency requirements, providing an efficient neural solution for constrained listwise optimization.