School of Computing and Artificial IntelligenceFeb 25, 2026arXiv:2602.21917

Scan Clusters, Not Pixels: A Cluster-Centric Paradigm for Efficient Ultra-high-definition Image Restoration

Chen Wu, Ling Wang, Zhuoran Zheng, Yuning Cui, Zhixiong Yang, Xiangyu Chen, Weidong Jiang, Jingyuan Xia

AI Summary

The paper introduces C$^2$SSM, a visual state space model for UHD image restoration that replaces pixel-serial scanning with cluster-serial scanning to improve computational efficiency. C$^2$SSM distills UHD image features into semantic centroids using a neural-parameterized mixture model, enabling global modeling through cluster center scanning and context diffusion back to pixels. The approach achieves state-of-the-art results on five UHD restoration tasks with significantly reduced computational costs.

Key Contribution

Ditch pixel-by-pixel processing: C$^2$SSM achieves state-of-the-art UHD image restoration by scanning semantic clusters, not individual pixels, for a massive efficiency boost.

Abstract

Ultra-High-Definition (UHD) image restoration is trapped in a scalability crisis: existing models, bound to pixel-wise operations, demand unsustainable computation. While state space models (SSMs) like Mamba promise linear complexity, their pixel-serial scanning remains a fundamental bottleneck for the millions of pixels in UHD content. We ask: must we process every pixel to understand the image? This paper introduces C$^2$SSM, a visual state space model that breaks this taboo by shifting from pixel-serial to cluster-serial scanning. Our core discovery is that the rich feature distribution of a UHD image can be distilled into a sparse set of semantic centroids via a neural-parameterized mixture model. C$^2$SSM leverages this to reformulate global modeling into a novel dual-path process: it scans and reasons over a handful of cluster centers, then diffuses the global context back to all pixels through a principled similarity distribution, all while a lightweight modulator preserves fine details. This cluster-centric paradigm achieves a decisive leap in efficiency, slashing computational costs while establishing new state-of-the-art results across five UHD restoration tasks. More than a solution, C$^2$SSM charts a new course for efficient large-scale vision: scan clusters, not pixels.

Architecture Design (Transformers, SSMs, MoE)Computer Vision Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Scan Clusters, Not Pixels: A Cluster-Centric Paradigm for Efficient Ultra-high-definition Image Restoration

Related Papers