DAMOApr 17, 2026arXiv:2604.16044

Elucidating the SNR-t Bias of Diffusion Probabilistic Models

AI Summary

The paper identifies a Signal-to-Noise Ratio-timestep (SNR-t) bias in diffusion models, where the SNR of a denoising sample is misaligned with its timestep during inference, leading to error accumulation. They show this bias impairs generation quality through empirical evidence and theoretical analysis. To address this, they propose a differential correction method that decomposes samples into frequency components and applies corrections individually, improving generation quality across various diffusion models and datasets.

Key Contribution

Diffusion models are making mistakes because they're losing track of time, but a simple frequency-aware correction can get them back on track.

Abstract

Diffusion Probabilistic Models have demonstrated remarkable performance across a wide range of generative tasks. However, we have observed that these models often suffer from a Signal-to-Noise Ratio-timestep (SNR-t) bias. This bias refers to the misalignment between the SNR of the denoising sample and its corresponding timestep during the inference phase. Specifically, during training, the SNR of a sample is strictly coupled with its timestep. However, this correspondence is disrupted during inference, leading to error accumulation and impairing the generation quality. We provide comprehensive empirical evidence and theoretical analysis to substantiate this phenomenon and propose a simple yet effective differential correction method to mitigate the SNR-t bias. Recognizing that diffusion models typically reconstruct low-frequency components before focusing on high-frequency details during the reverse denoising process, we decompose samples into various frequency components and apply differential correction to each component individually. Extensive experiments show that our approach significantly improves the generation quality of various diffusion models (IDDPM, ADM, DDIM, A-DPM, EA-DPM, EDM, PFGM++, and FLUX) on datasets of various resolutions with negligible computational overhead. The code is at https://github.com/AMAP-ML/DCW.

Computer Vision

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Elucidating the SNR-t Bias of Diffusion Probabilistic Models

Related Papers