Search papers, labs, and topics across Lattice.
The paper introduces Multi-Faceted Self-Consistent Preference Aligned CQR (MSPA-CQR) to improve conversational query rewriting by incorporating feedback from rewriting, retrieval, and response generation. They construct self-consistent preference alignment data across these three dimensions to generate diverse rewritten queries and use prefix-guided multi-faceted direct preference optimization to learn preferences. Experiments demonstrate MSPA-CQR's effectiveness in both in- and out-of-distribution settings, indicating improved query rewriting performance.
Conversational search can be dramatically improved by aligning query rewriting with retrieval and response preferences, leading to better results even out-of-distribution.
Conversational Query Rewriting (CQR) aims to rewrite ambiguous queries to achieve more efficient conversational search. Early studies have predominantly focused on the rewriting in isolation, ignoring the feedback from query rewrite, passage retrieval and response generation in the rewriting process. To address this issue, we propose Multi-Faceted Self-Consistent Preference Aligned CQR (MSPA-CQR). Specifically, we first construct self-consistent preference alignment data from three dimensions (rewriting, retrieval, and response) to generate more diverse rewritten queries. Then we propose prefix guided multi-faceted direct preference optimization to learn preference information from three different dimensions. The experimental results show that our MSPA-CQR is effective in both in- and out-of-distribution scenarios.