Search papers, labs, and topics across Lattice.
The paper introduces HyperRAG, a retrieval-augmented generation framework designed to leverage n-ary hypergraphs for more efficient and expressive reasoning in multi-hop question answering. HyperRAG incorporates two retrieval variants: HyperRetriever, which learns structural-semantic reasoning to construct query-conditioned relational chains, and HyperMemory, which uses the LLM's parametric memory to guide beam search over n-ary facts. Experiments on WikiTopics and open-domain QA benchmarks demonstrate that HyperRAG, particularly HyperRetriever, achieves improved answer accuracy and interpretable multi-hop reasoning compared to existing graph-based RAG methods.
N-ary hypergraphs can boost RAG performance by encoding richer relational facts, enabling more accurate and interpretable multi-hop reasoning for both open and closed-domain question answering.
Graph-based retrieval-augmented generation (RAG) methods, typically built on knowledge graphs (KGs) with binary relational facts, have shown promise in multi-hop open-domain QA. However, their rigid retrieval schemes and dense similarity search often introduce irrelevant context, increase computational overhead, and limit relational expressiveness. In contrast, n-ary hypergraphs encode higher-order relational facts that capture richer inter-entity dependencies and enable shallower, more efficient reasoning paths. To address this limitation, we propose HyperRAG, a RAG framework tailored for n-ary hypergraphs with two complementary retrieval variants: (i) HyperRetriever learns structural-semantic reasoning over n-ary facts to construct query-conditioned relational chains. It enables accurate factual tracking, adaptive high-order traversal, and interpretable multi-hop reasoning under context constraints. (ii) HyperMemory leverages the LLM's parametric memory to guide beam search, dynamically scoring n-ary facts and entities for query-aware path expansion. Extensive evaluations on WikiTopics (11 closed-domain datasets) and three open-domain QA benchmarks (HotpotQA, MuSiQue, and 2WikiMultiHopQA) validate HyperRAG's effectiveness. HyperRetriever achieves the highest answer accuracy overall, with average gains of 2.95% in MRR and 1.23% in Hits@10 over the strongest baseline. Qualitative analysis further shows that HyperRetriever bridges reasoning gaps through adaptive and interpretable n-ary chain construction, benefiting both open and closed-domain QA.