Search papers, labs, and topics across Lattice.
This paper replicates Graph-based Adaptive Reranking (GAR) on the BRIGHT benchmark for reasoning-intensive information retrieval, demonstrating its effectiveness in addressing the bounded recall problem without modifying the first-stage retriever. They test GAR with both reasoning and non-reasoning reranking models, finding that the quality of the reranker's signal is crucial for identifying relevant documents within the corpus graph. The results show that GAR significantly improves retrieval effectiveness across various models with minimal computational overhead, making it a practical solution for reasoning-intensive queries.
Iteratively exploring a corpus graph during reranking can substantially boost reasoning-intensive retrieval performance, even with weaker rerankers, offering a surprisingly effective alternative to computationally expensive retriever improvements.
The classical cascading pipeline of retrieve--rerank suffers from a bounded recall problem, stemming from limitations of the first-stage retriever. Most current approaches address the bounded recall problem by improving the first-stage retriever, but this incurs substantial training and inference costs, especially to handle queries that require substantial reasoning. To circumvent the computational costs of reasoning-based retrievers, we replicate the findings of GAR, Graph-based Adaptive Reranking, on the BRIGHT reasoning-intensive retrieval benchmark. GAR addresses the bounded recall problem by modifying the reranking process itself through iterative exploration of a corpus graph, but it was previously only tested on models designed for topical and question-answering-style queries. Hence, reproduce GAR in reasoning-intensive settings with reasoning and non-reasoning reranking models. We observe that the quality of the reranker's signal plays an important role in identifying additional relevant documents within the corpus graph. Overall, we find that GAR boosts the effectiveness of reasoning-intensive retrieval across a variety of models while contributing minimally to computational overheads. Ultimately, this work enables more practical deployment of retrieval systems that can address reasoning-intensive queries.