Mar 5, 2026arXiv:2603.05395

On the Necessity of Learnable Sheaf Laplacians

Ferran Hernandez Caralt, M. Catala, Mar Gonzàlez i Català, A. Bazaga, Adrián Bazaga, Pietro Lio

AI Summary

This paper investigates whether learnable restriction maps in Sheaf Neural Networks (SNNs) are necessary for mitigating oversmoothing on heterophilous graphs, as originally hypothesized. They introduce an Identity Sheaf Network baseline, where restriction maps are fixed to the identity, and compare its performance to SNN variants across five heterophilic benchmarks. The key finding is that the identity baseline achieves comparable performance to SNNs, and that the diffusion-based analysis of SNNs does not empirically reflect the oversmoothing behavior in trained networks.

Key Contribution

Turns out, the fancy learnable restriction maps in Sheaf Neural Networks might not be worth the complexity – a simple identity mapping works just as well for mitigating oversmoothing on heterophilic graphs.

Abstract

Sheaf Neural Networks (SNNs) were introduced as an extension of Graph Convolutional Networks to address oversmoothing on heterophilous graphs by attaching a sheaf to the input graph and replacing the adjacency-based operator with a sheaf Laplacian defined by (learnable) restriction maps. Prior work motivates this design through theoretical properties of sheaf diffusion and the kernel of the sheaf Laplacian, suggesting that suitable non-identity restriction maps can avoid representations converging to constants across connected components. Since oversmoothing can also be mitigated through residual connections and normalization, we revisit a trivial sheaf construction to ask whether the additional complexity of learning restriction maps is necessary. We introduce an Identity Sheaf Network baseline, where all restriction maps are fixed to the identity, and use it to ablate the empirical improvements reported by sheaf-learning architectures. Across five popular heterophilic benchmarks, the identity baseline achieves comparable performance to a range of SNN variants. Finally, we introduce the Rayleigh quotient as a normalized measure for comparing oversmoothing across models and show that, in trained networks, the behavior predicted by the diffusion-based analysis of SNNs is not reflected empirically. In particular, Identity Sheaf Networks do not appear to suffer more significant oversmoothing than their SNN counterparts.

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing

Citation Metrics

Citations0

Influential citations0

References15

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

On the Necessity of Learnable Sheaf Laplacians

Related Papers