Feb 19, 2026arXiv:2602.16989

WSDM Cup 2026 Multilingual Retrieval: A Low-Cost Multi-Stage Retrieval Pipeline

AI Summary

This paper introduces a low-cost, four-stage retrieval pipeline for multilingual information retrieval, specifically targeting the WSDM Cup 2026 task. The pipeline combines LLM-based query expansion with BM25 retrieval, dense ranking using Jina embeddings, and pointwise reranking with Qwen3-Reranker-4B. The system achieves an nDCG@20 of 0.403 and Judged@20 of 0.95 on the official evaluation, with ablation studies quantifying the contribution of each stage under computational constraints.

Key Contribution

You can achieve competitive multilingual retrieval performance with a surprisingly cheap pipeline leveraging LLM query expansion, dense embeddings, and a pointwise reranker.

Abstract

We present a low-cost retrieval system for the WSDM Cup 2026 multilingual retrieval task, where English queries are used to retrieve relevant documents from a collection of approximately ten million news articles in Chinese, Persian, and Russian, and to output the top-1000 ranked results for each query. We follow a four-stage pipeline that combines LLM-based GRF-style query expansion with BM25 candidate retrieval, dense ranking using long-text representations from jina-embeddings-v4, and pointwise re-ranking of the top-20 candidates using Qwen3-Reranker-4B while preserving the dense order for the remaining results. On the official evaluation, the system achieves nDCG@20 of 0.403 and Judged@20 of 0.95. We further conduct extensive ablation experiments to quantify the contribution of each stage and to analyze the effectiveness of query expansion, dense ranking, and top-$k$ reranking under limited compute budgets.

Natural Language Processing Open-Source Models & Weights Recommendation & Information Retrieval

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

WSDM Cup 2026 Multilingual Retrieval: A Low-Cost Multi-Stage Retrieval Pipeline

Related Papers