Search papers, labs, and topics across Lattice.
This paper introduces a spectrally-guided approach to designing per-instance noise schedules for pixel diffusion models, deriving theoretical bounds on minimum and maximum noise levels based on an image's spectral properties. By eliminating redundant steps, the resulting "tight" noise schedules, conditionally sampled during inference, improve generative quality, especially in low-step regimes. Empirically, the proposed method enhances the performance of single-stage pixel diffusion models.
Ditch the handcrafted noise schedules: spectral analysis unlocks per-image diffusion schedules that boost generative quality, especially when you're racing against the clock with few steps.
Denoising diffusion models are widely used for high-quality image and video generation. Their performance depends on noise schedules, which define the distribution of noise levels applied during training and the sequence of noise levels traversed during sampling. Noise schedules are typically handcrafted and require manual tuning across different resolutions. In this work, we propose a principled way to design per-instance noise schedules for pixel diffusion, based on the image's spectral properties. By deriving theoretical bounds on the efficacy of minimum and maximum noise levels, we design ``tight''noise schedules that eliminate redundant steps. During inference, we propose to conditionally sample such noise schedules. Experiments show that our noise schedules improve generative quality of single-stage pixel diffusion models, particularly in the low-step regime.