Search papers, labs, and topics across Lattice.
This paper introduces DDM-SSCC, a novel discrete-diffusion-model-based framework for lossless pixel-level image transmission that enhances symbol probability modeling and reliable delivery over noisy channels. By adapting a diffusion language model for pixel-token restoration and employing synchronized reverse arithmetic coding with bidirectional attention, the method allows for multiple masked tokens to be coded simultaneously, improving recovery accuracy. Experimental results demonstrate that DDM-SSCC outperforms existing lossless and semantic communication methods, particularly in challenging noise conditions, while ablation studies confirm the efficacy of its innovative design components.
Achieving superior lossless image transmission, DDM-SSCC outperforms traditional methods by leveraging a diffusion model for pixel-token restoration and synchronized coding.
Lossless pixel-level image transmission is a fundamental regime beyond semantic communications, because exact recovery requires both accurate symbol probability modeling and reliable delivery over noisy channels. This paper proposes DDM-SSCC, a discrete-diffusion-model-based separate source-channel coding framework for lossless image transmission. Different from raster-order autoregressive coding, the proposed source codec adapts a diffusion language model to pixel-token restoration and performs synchronized reverse arithmetic coding under bidirectional attention, allowing multiple masked tokens to be coded within one reverse denoising step. This progressive restoration process also yields a more favorable source representation for noisy transmission, since newly restored tokens can serve as bidirectional context in subsequent denoising steps. To bridge the gap between generation-oriented masked denoising and lossless arithmetic coding, we further introduce a Halton-guided denoising order, a mask-ratio-aware cosine schedule, and a lightweight temperature calibration module. These designs respectively improve spatial coverage, adapt the denoising pace to context reliability, and calibrate the probability tables used by arithmetic coding. Experiments on CIFAR10, DIV2K-LR-X4, and Kodak over additive white Gaussian noise and Rayleigh fading channels show that DDM-SSCC achieves better exact-recovery performance than representative lossless and semantic communication baselines, while ablation studies verify the effectiveness of the proposed denoising order, schedule, and calibration modules.