Mar 16, 2026arXiv:2603.14946

Spiking Layer-Adaptive Magnitude-based Pruning

AI Summary

This paper introduces Spiking Layer-Adaptive Magnitude-based Pruning (SLAMP), a novel pruning framework tailored for Spiking Neural Networks (SNNs) that addresses the limitations of applying standard magnitude pruning techniques to SNNs. SLAMP formulates sparsity allocation as a temporal distortion-constrained optimization problem, considering the temporal dynamics and non-uniform timestep contributions inherent in SNNs. Experiments show SLAMP significantly reduces connectivity and spiking operations on CIFAR datasets while maintaining accuracy, facilitating efficient SNN deployment.

Key Contribution

SNNs can be pruned to extreme sparsity without sacrificing accuracy by explicitly controlling temporal distortion across layers and timesteps.

Abstract

Spiking Neural Networks (SNNs) provide energy-efficient computation but their deployment is constrained by dense connectivity and high spiking operation costs. Existing magnitude-based pruning strategies, when naively applied to SNNs, fail to account for temporal accumulation, non-uniform timestep contributions, and membrane stability, often leading to severe performance degradation. This paper proposes Spiking Layer-Adaptive Magnitude-based Pruning (SLAMP), a theory-guided pruning framework that generalizes layer-adaptive magnitude pruning to temporal SNNs by explicitly controlling worst-case output distortion across layers and timesteps. SLAMP formulates sparsity allocation as a temporal distortion-constrained optimization problem, yielding time-aware layer importance scores that reduce to conventional layer-adaptive pruning in single-timestep limit. An efficient two-stage procedure is derived, combining temporal score estimation, global sparsity allocation, and magnitude pruning with retraining for stability recovery. Experiments on CIFAR10, CIFAR100, and the event-based CIFAR10-DVS datasets demonstrate that SLAMP achieves substantial connectivity and spiking operation reductions while preserving accuracy, enabling efficient and deployable SNN inference.

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Spiking Layer-Adaptive Magnitude-based Pruning

Related Papers