Search papers, labs, and topics across Lattice.
TopoMamba addresses limitations in visual SSMs for medical image segmentation by introducing topology-aware scanning with diagonal/anti-diagonal scans alongside standard cross-scans, and a lightweight HSIC Gate for dependence-aware feature fusion. The ScanCache mechanism optimizes scan index construction across resolutions, enhancing efficiency. Experiments on CT, dermoscopy, and endoscopy datasets demonstrate improved segmentation quality, especially for thin or curved structures, compared to CNN, Transformer, and SSM baselines.
Forget axis-aligned biases: TopoMamba's topology-aware scanning and HSIC-gated fusion unlocks more accurate medical image segmentation, especially for tricky curved structures.
Visual state-space models (SSMs) have shown strong potential for medical image segmentation, yet their effectiveness is often limited by two practical issues: axis-biased scan ordering weakens the modeling of oblique and curved structures, and naive multi-branch fusion tends to amplify redundant responses. We present TopoMamba, a topology-aware scan-and-fuse framework for segmenting heterogeneous medical visual media. The method combines a diagonal/anti-diagonal TopoA-Scan branch with the standard Cross-Scan branch to provide complementary structural priors, and introduces ScanCache, a device-aware caching mechanism that amortizes explicit scan-index construction across recurring resolutions. To fuse heterogeneous scan features efficiently, we further propose a lightweight HSIC Gate that regulates branch interaction using a dependence-aware scalar gating rule. We also instantiate a volumetric TopoMamba-3D for practical 3D clinical segmentation. Experiments on Synapse CT, ISIC 2017 dermoscopy, and CVC-ClinicDB endoscopy show that TopoMamba consistently improves segmentation quality over strong CNN, Transformer, and SSM baselines, with particularly clear gains on thin or curved targets such as the pancreas and gallbladder, while maintaining favorable deployment efficiency under dynamic input resolutions. These results suggest that topology-aware scan ordering and lightweight dependence-aware fusion form an effective and practical design for medical multimedia segmentation. The code will be made publicly available.