Search papers, labs, and topics across Lattice.
This paper introduces HyGRAG, a hierarchical graph retrieval-augmented generation framework that integrates contextual and relational information to enhance the retrieval capabilities of large language models. By addressing the limitations of existing entity-centric and chunk-centric approaches, HyGRAG enables the synthesis of knowledge from multiple sources, allowing for more effective multi-hop reasoning. Experimental results demonstrate a 9.7% improvement in accuracy for multi-hop reasoning tasks, showcasing the framework's potential for dynamic knowledge integration and retrieval efficiency.
HyGRAG achieves a 9.7% boost in multi-hop reasoning accuracy by seamlessly integrating contextual and relational knowledge from diverse sources.
Retrieval-Augmented Generation (RAG) has emerged as a paradigm for enhancing large language models (LLMs) with external knowledge, yet existing graph-based methods face a fundamental limitation: entity-centric and chunk-centric approaches operate on representations anchored to original text without true knowledge fusion. While entity-centric methods connect logically related content and chunk-centric methods preserve context, both retrieve information separately through similarity search, missing emergent understanding from their synthesis. In this paper, we propose HyGRAG, a hierarchical graph RAG framework that transcends source documents by addressing three core challenges: constructing summaries that genuinely integrate contextual and relational information, leveraging these synthesized representations to access emergent knowledge during retrieval, and efficiently updating hierarchical structures for dynamic corpora. Specifically, we design hierarchical index structures over hybrid graphs with both chunk and entity nodes, then iteratively cluster them and generate LLM-based summaries. Then, we design context and relation-aware retrieval that searches across all abstraction levels while expanding through community membership. Moreover, we enable dynamic knowledge update through attachment-based algorithms with only local re-summarization. Experimental results show that HyGRAG improves the average accuracy of multi-hop reasoning tasks by 9.7%, while maintaining reasonable efficiency.