Search papers, labs, and topics across Lattice.
The paper introduces Meta-path-aware Universal heterogeneous Graph pre-training (MUG), a novel approach for pre-training graph encoders on heterogeneous graphs to learn transferable representations. MUG addresses the challenges of diverse node/relation types and varying meta-path semantics across datasets by using an input unification module and a shared encoder trained on meta-path views. Experiments demonstrate MUG's effectiveness on real-world datasets, suggesting improved generalization capabilities for heterogeneous graph representation learning.
Overcome the heterogeneity hurdle: MUG pre-trains a single graph encoder that transfers across diverse heterogeneous graphs by unifying node/relation types and learning consistent structural patterns across meta-paths.
Universal graph pre-training has emerged as a key paradigm in graph representation learning, offering a promising way to train encoders to learn transferable representations from unlabeled graphs and to effectively generalize across a wide range of downstream tasks. However, recent explorations in universal graph pre-training primarily focus on homogeneous graphs and it remains unexplored for heterogeneous graphs, which exhibit greater structural and semantic complexity. This heterogeneity makes it highly challenging to train a universal encoder for diverse heterogeneous graphs: (i) the diverse types with dataset-specific semantics hinder the construction of a unified representation space; (ii) the number and semantics of meta-paths vary across datasets, making encoding and aggregation patterns learned from one dataset difficult to apply to others. To address these challenges, we propose a novel Meta-path-aware Universal heterogeneous Graph pre-training (MUG) approach. Specifically, for challenge (i), MUG introduces a input unification module that integrates information from multiple node and relation types within each heterogeneous graph into a unified representation.This representation is then projected into a shared space by a dimension-aware encoder, enabling alignment across graphs with diverse schemas.Furthermore, for challenge (ii), MUG trains a shared encoder to capture consistent structural patterns across diverse meta-path views rather than relying on dataset-specific aggregation strategies, while a global objective encourages discriminability and reduces dataset-specific biases. Extensive experiments demonstrate the effectiveness of MUG on some real datasets.