Search papers, labs, and topics across Lattice.
The paper introduces SpectralMamba-UNet, a frequency-disentangled framework for medical image segmentation that aims to improve both global structure and fine-grained detail modeling. It uses a Spectral Decomposition and Modeling (SDM) module with discrete cosine transform to separate low- and high-frequency features, processing the former with a frequency-domain Mamba and preserving the latter for boundary details. Experiments on five benchmarks show that SpectralMamba-UNet achieves consistent improvements in segmentation accuracy across different modalities and targets.
By decoupling texture and structure learning in the spectral domain, SpectralMamba-UNet achieves state-of-the-art medical image segmentation, outperforming standard Vision Mamba approaches.
Accurate medical image segmentation requires effective modeling of both global anatomical structures and fine-grained boundary details. Recent state space models (e.g., Vision Mamba) offer efficient long-range dependency modeling. However, their one-dimensional serialization weakens local spatial continuity and high-frequency representation. To this end, we propose SpectralMamba-UNet, a novel frequency-disentangled framework to decouple the learning of structural and textural information in the spectral domain. Our Spectral Decomposition and Modeling (SDM) module applies discrete cosine transform to decompose low- and high-frequency features, where low frequency contributes to global contextual modeling via a frequency-domain Mamba and high frequency preserves boundary-sensitive details. To balance spectral contributions, we introduce a Spectral Channel Reweighting (SCR) mechanism to form channel-wise frequency-aware attention, and a Spectral-Guided Fusion (SGF) module to achieve adaptively multi-scale fusion in the decoder. Experiments on five public benchmarks demonstrate consistent improvements across diverse modalities and segmentation targets, validating the effectiveness and generalizability of our approach.