Search papers, labs, and topics across Lattice.
IndexRAG pre-computes "bridging facts" by identifying shared entities across documents and generating explicit statements linking them, enabling more effective cross-document reasoning. This approach shifts the computational burden from online inference to offline indexing, avoiding the need for iterative retrieval or graph-based processing. Experiments on multi-hop QA benchmarks demonstrate that IndexRAG significantly improves F1 scores compared to naive RAG and outperforms graph-based methods when combined with IRCoT.
Achieve state-of-the-art multi-hop question answering by pre-computing bridging facts at index time, eliminating the need for complex online reasoning or graph traversal.
Multi-hop question answering (QA) requires reasoning across multiple documents, yet existing retrieval-augmented generation (RAG) approaches address this either through graph-based methods requiring additional online processing or iterative multi-step reasoning. We present IndexRAG, a novel approach that shifts cross-document reasoning from online inference to offline indexing. IndexRAG identifies bridge entities shared across documents and generates bridging facts as independently retrievable units, requiring no additional training or fine-tuning. Experiments on three widely-used multi-hop QA benchmarks (HotpotQA, 2WikiMultiHopQA, MuSiQue) show that IndexRAG improves F1 over Naive RAG by 4.6 points on average, while requiring only single-pass retrieval and a single LLM call at inference time. When combined with IRCoT, IndexRAG outperforms all graph-based baselines on average, including HippoRAG and FastGraphRAG, while relying solely on flat retrieval. Our code will be released upon acceptance.