Search papers, labs, and topics across Lattice.
This paper introduces QueryAgent-R1, a memory-augmented framework that enhances query recommendation in e-commerce by aligning query generation with product retrieval. By employing chain-of-retrieval optimization and a consistency reward in the reinforcement learning process, the framework effectively bridges the gap between high query click-through rates and low product conversion rates. The results demonstrate a significant improvement in online A/B tests, with a 2.9% increase in query CTR and a 3.1% increase in guided CVR, showcasing its practical effectiveness in real-world applications.
QueryAgent-R1 achieves a remarkable 2.9% boost in query CTR and 3.1% in product conversion rates by aligning query generation with actual product retrieval.
Query recommendation in e-commerce search aims to proactively suggest queries that match users' potential interests. However, existing methods mainly optimize query-level relevance, while neglecting whether the retrieved products align with users' downstream preferences. This mismatch often leads to high query click through rates (CTR) but low product conversion rates (CVR). To bridge this gap, we propose QueryAgent-R1, a memory-augmented agentic framework that improves end-to-end alignment via chain-of-retrieval optimization. Our QueryAgent-R1 grounds query generation in real inventory retrieval, allowing the agent to validate and refine queries based on retrieved products. We also design a consistency reward in the agentic reinforcement learning (RL) process to jointly optimize query relevance and downstream engagement. In addition, we construct a memory abstraction module for efficient user profiling. To support offline evaluation, we construct two datasets based on both proprietary industrial data and public datasets, on which QueryAgent-R1 consistently outperforms strong baselines. Moreover, on a large scale production platform, QueryAgent-R1 improves Query CTR by 2.9% and guided CVR by 3.1% in online A/B tests.