Search papers, labs, and topics across Lattice.
This paper introduces a training-free framework for traversing the rate-distortion-perception (RDP) surface in lossy compression by leveraging pre-trained diffusion models. The approach integrates a reverse channel coding (RCC) module with a score-scaled probability flow ODE decoder, proven optimal for the distortion-perception tradeoff under AWGN observations. Experiments across datasets validate the framework's flexibility and effectiveness in navigating the RDP tradeoff without retraining.
Achieve adaptive, perception-aware image compression without any training by simply steering a pre-trained diffusion model.
The rate-distortion-perception (RDP) tradeoff characterizes the fundamental limits of lossy compression by jointly considering bitrate, reconstruction fidelity, and perceptual quality. While recent neural compression methods have improved perceptual performance, they typically operate at fixed points on the RDP surface, requiring retraining to target different tradeoffs. In this work, we propose a training-free framework that leverages pre-trained diffusion models to traverse the entire RDP surface. Our approach integrates a reverse channel coding (RCC) module with a novel score-scaled probability flow ODE decoder. We theoretically prove that the proposed diffusion decoder is optimal for the distortion-perception tradeoff under AWGN observations and that the overall framework with the RCC module achieves the optimal RDP function in the Gaussian case. Empirical results across multiple datasets demonstrate the framework's flexibility and effectiveness in navigating the ternary RDP tradeoff using pre-trained diffusion models. Our results establish a practical and theoretically grounded approach to adaptive, perception-aware compression.