Jun 4, 2026arXiv:2606.06273

Adapting Diffusion Language Models for Lossless Pixel-Level Image Transmission

Tianqi Ren, Rongpeng Li, Xianfu Chen, Yingyu Li, Zhifeng Zhao

AI Summary

This paper introduces DDM-SSCC, a novel discrete-diffusion-model-based framework for lossless pixel-level image transmission that enhances symbol probability modeling and reliable delivery over noisy channels. By adapting a diffusion language model for pixel-token restoration and employing synchronized reverse arithmetic coding with bidirectional attention, the method allows for multiple masked tokens to be coded simultaneously, improving recovery accuracy. Experimental results demonstrate that DDM-SSCC outperforms existing lossless and semantic communication methods, particularly in challenging noise conditions, while ablation studies confirm the efficacy of its innovative design components.

Key Contribution

Achieving superior lossless image transmission, DDM-SSCC outperforms traditional methods by leveraging a diffusion model for pixel-token restoration and synchronized coding.

Abstract

Lossless pixel-level image transmission is a fundamental regime beyond semantic communications, because exact recovery requires both accurate symbol probability modeling and reliable delivery over noisy channels. This paper proposes DDM-SSCC, a discrete-diffusion-model-based separate source-channel coding framework for lossless image transmission. Different from raster-order autoregressive coding, the proposed source codec adapts a diffusion language model to pixel-token restoration and performs synchronized reverse arithmetic coding under bidirectional attention, allowing multiple masked tokens to be coded within one reverse denoising step. This progressive restoration process also yields a more favorable source representation for noisy transmission, since newly restored tokens can serve as bidirectional context in subsequent denoising steps. To bridge the gap between generation-oriented masked denoising and lossless arithmetic coding, we further introduce a Halton-guided denoising order, a mask-ratio-aware cosine schedule, and a lightweight temperature calibration module. These designs respectively improve spatial coverage, adapt the denoising pace to context reliability, and calibrate the probability tables used by arithmetic coding. Experiments on CIFAR10, DIV2K-LR-X4, and Kodak over additive white Gaussian noise and Rayleigh fading channels show that DDM-SSCC achieves better exact-recovery performance than representative lossless and semantic communication baselines, while ablation studies verify the effectiveness of the proposed denoising order, schedule, and calibration modules.

Computer Vision Multimodal Models

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Adapting Diffusion Language Models for Lossless Pixel-Level Image Transmission

Related Papers