Jun 11, 2026arXiv:2606.13537

When Does Mixing Help? Analyzing Query Embedding Interpolation in Multilingual Dense Retrieval

Tongyao Zhu, Tongyao Zhu, Chao-ming Huang, Chao-Ming Huang, Min-Yen Kan, Min-Yen Kan

AI Summary

This study investigates the impact of query embedding interpolation on the performance of multilingual dense retrieval systems, focusing on the optimal mixing ratios of parallel query translations. By conducting a ratio-controlled analysis on the mMARCO dataset, the authors find that an optimal mixing ratio consistently outperforms monolingual queries in 88 out of 105 scenarios, revealing a nuanced relationship between language dominance and retrieval effectiveness. The results indicate that while mixing enhances retrieval from non-English indices, pure English queries are more effective for English document indices, highlighting the structured nature of language-mix sensitivity in retrieval systems.

Key Contribution

Optimal mixing of query translations can significantly boost retrieval performance, but English dominance complicates the landscape of multilingual querying.

Abstract

While mixed-language querying is ubiquitous in multilingual communities, the sensitivity of dense retrievers to such queries remains poorly understood. We present a ratio-controlled study on mMARCO that systematically evaluates retrieval performance by varying the mixing proportion of parallel query translations via embedding-level mixing -- constructing mixed queries as an interpolation of monolingual embeddings. Experiments with BGE-M3 demonstrate that an optimal mixing ratio outperforms the best monolingual endpoint in 88/105 cases. We uncover a distinct asymmetry driven by English dominance: mixing is uniformly beneficial when retrieving from non-English document indices, whereas indices containing English are best served by pure English queries. Furthermore, English acts as the strongest mixing partner for every non-English document language. Finally, when controlling for English dominance, mixing gains correlate negatively with typological distance. We conclude that language-mix sensitivity is structured and predictable, and we validate the robustness of these patterns across model families and scales.

Natural Language Processing Recommendation & Information Retrieval

Citation Metrics

Citations0

Influential citations0

References29

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

When Does Mixing Help? Analyzing Query Embedding Interpolation in Multilingual Dense Retrieval

Related Papers