Search papers, labs, and topics across Lattice.
The paper introduces MASS-RAG, a multi-agent RAG framework that decomposes evidence processing into specialized agents for summarization, extraction, and reasoning. By synthesizing outputs from these agents, MASS-RAG facilitates improved integration of heterogeneous retrieved contexts. Experiments across four benchmarks demonstrate that MASS-RAG outperforms standard RAG baselines, especially when relevant evidence is distributed.
LLMs can reason better over noisy and distributed information when you break down RAG into specialized agent roles for summarization, extraction, and reasoning.
Large language models (LLMs) are widely used in retrieval-augmented generation (RAG) to incorporate external knowledge at inference time. However, when retrieved contexts are noisy, incomplete, or heterogeneous, a single generation process often struggles to reconcile evidence effectively. We propose \textbf{MASS-RAG}, a multi-agent synthesis approach to retrieval-augmented generation that structures evidence processing into multiple role-specialized agents. MASS-RAG applies distinct agents for evidence summarization, evidence extraction, and reasoning over retrieved documents, and combines their outputs through a dedicated synthesis stage to produce the final answer. This design exposes multiple intermediate evidence views, allowing the model to compare and integrate complementary information before answer generation. Experiments on four benchmarks show that MASS-RAG consistently improves performance over strong RAG baselines, particularly in settings where relevant evidence is distributed across retrieved contexts.