Tsinghua AIBUPTShanghai UniversityXiangtanXJTUFeb 23, 2026arXiv:2602.19543

Hyper-KGGen: A Skill-Driven Knowledge Extractor for High-Quality Knowledge Hypergraph Generation

Rizhuo Huang, Yifan Feng, Yifan Feng, Rundong Xue, Rundong Xue, Shihui Ying, Shihui Ying, Junhai Yong, Jun-Hai Yong, Chuan Shi, Shaoyi Du, Yue Gao, Yue Gao

AI Summary

The paper introduces Hyper-KGGen, a skill-driven framework for knowledge hypergraph extraction that addresses the scenario gap between generic extractors and domain-specific jargon. Hyper-KGGen uses a coarse-to-fine decomposition of documents and an adaptive skill acquisition module to distill domain expertise into a Global Skill Library, guided by a stability-based feedback loop. Experiments on the newly introduced HyperDocRED benchmark demonstrate that Hyper-KGGen outperforms existing methods by leveraging evolved skills for richer guidance in multi-scenario settings.

Key Contribution

Domain-specific knowledge hypergraphs can now be extracted with significantly improved quality by dynamically learning and applying extraction skills, outperforming static few-shot learning.

Abstract

Knowledge hypergraphs surpass traditional binary knowledge graphs by encapsulating complex $n$-ary atomic facts, providing a more comprehensive paradigm for semantic representation. However, constructing high-quality hypergraphs remains challenging due to the \textit{scenario gap}: generic extractors struggle to generalize across diverse domains with specific jargon, while existing methods often fail to balance structural skeletons with fine-grained details. To bridge this gap, we propose \textbf{Hyper-KGGen}, a skill-driven framework that reformulates extraction as a dynamic skill-evolving process. First, Hyper-KGGen employs a \textit{coarse-to-fine} mechanism to systematically decompose documents, ensuring full-dimensional coverage from binary links to complex hyperedges. Crucially, it incorporates an \textit{adaptive skill acquisition} module that actively distills domain expertise into a Global Skill Library. This is achieved via a stability-based feedback loop, where extraction stability serves as a relative reward signal to induce high-quality skills from unstable traces and missed predictions. Additionally, we present \textbf{HyperDocRED}, a rigorously annotated benchmark for document-level knowledge hypergraph extraction. Experiments demonstrate that Hyper-KGGen significantly outperforms strong baselines, validating that evolved skills provide substantially richer guidance than static few-shot examples in multi-scenario settings.

Data Curation & Synthetic Data Natural Language Processing Reasoning & Chain-of-Thought Recommendation & Information Retrieval

Citation Metrics

Citations0

Influential citations0

References35

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Hyper-KGGen: A Skill-Driven Knowledge Extractor for High-Quality Knowledge Hypergraph Generation

Related Papers