Search papers, labs, and topics across Lattice.
The paper introduces Spurious Correlation Learning Graph Neural Network (SCL-GNN) to improve GNN generalization by mitigating spurious correlations between node features and labels. SCL-GNN uses the Hilbert-Schmidt Independence Criterion (HSIC) to quantify and reduce irrelevant correlations between node representations and class scores. Experiments on real-world and synthetic datasets show SCL-GNN outperforms existing methods under IID and OOD settings, demonstrating improved robustness and generalization.
GNNs can be made more robust to distribution shifts by explicitly identifying and mitigating spurious correlations between node features and labels using a Hilbert-Schmidt Independence Criterion (HSIC) based regularization.
Graph Neural Networks (GNNs) have demonstrated remarkable success across diverse tasks. However, their generalization capability is often hindered by spurious correlations between node features and labels in the graph. Our analysis reveals that GNNs tend to exploit imperceptible statistical correlations in training data, even when such correlations are unreliable for prediction. To address this challenge, we propose the Spurious Correlation Learning Graph Neural Network (SCL-GNN), a novel framework designed to enhance generalization on both Independent and Identically Distributed (IID) and Out-of-Distribution (OOD) graphs. SCL-GNN incorporates a principled spurious correlation learning mechanism, leveraging the Hilbert-Schmidt Independence Criterion (HSIC) to quantify correlations between node representations and class scores. This enables the model to identify and mitigate irrelevant but influential spurious correlations effectively. Additionally, we introduce an efficient bi-level optimization strategy to jointly optimize modules and GNN parameters, preventing overfitting. Extensive experiments on real-world and synthetic datasets demonstrate that SCL-GNN consistently outperforms state-of-the-art baselines under various distribution shifts, highlighting its robustness and generalization capabilities.