Search papers, labs, and topics across Lattice.
This paper introduces agentic hybrid RAG, a novel evidence-grounded retrieval-augmented generation framework specifically designed for muon collider research. By integrating a hybrid retriever that combines sparse lexical and dense semantic retrieval with an agentic reasoning module, the framework effectively enhances query decomposition, evidence expansion, and answer generation. Extensive evaluations demonstrate that this approach outperforms traditional retrieval and RAG baselines in terms of retrieval effectiveness, answer quality, evidence coverage, and factual grounding, establishing a robust foundation for future high-energy physics analysis workflows.
Agentic hybrid RAG outperforms existing methods in retrieving and synthesizing evidence for muon collider research, setting a new standard for scientific question answering in high-energy physics.
Muon collider research spans accelerator physics, detector instrumentation, and high-energy phenomenology, with relevant evidence scattered across a rapidly expanding and heterogeneous body of scientific literature. As high-energy physics (HEP) increasingly explores agent-assisted analysis workflows, efficiently locating, integrating, and verifying scientific evidence becomes an essential capability. While retrieval-augmented generation (RAG) offers a promising framework for scientific question answering, integrating agentic reasoning without compromising retrieval precision remains a key challenge. In this work, we present agentic hybrid RAG, an evidence-grounded RAG framework for muon collider research. The framework combines a hybrid retriever, integrating sparse lexical and dense semantic retrieval, with an agentic reasoning module for query decomposition, evidence expansion, and grounded answer generation. To enable systematic evaluation, we construct the first benchmark for retrieval-augmented scientific question answering in the muon collider domain, comprising a curated literature corpus together with dedicated retrieval and answer-generation benchmarks covering major detector and physics research topics. Extensive evaluation shows that hybrid retrieval provides the strongest retrieval backbone, while agentic reasoning is most effective for controlled evidence expansion and answer synthesis. Built on this principle, agentic hybrid RAG consistently outperforms representative retrieval and RAG baselines in retrieval effectiveness, answer quality, evidence coverage, and factual grounding. Together, the benchmark and framework provide a foundation for evidence-grounded scientific question answering and future HEP analysis agents operating over large-scale scientific literature.