Search papers, labs, and topics across Lattice.
This paper presents a theoretical analysis of Classifier-Free Guidance (CFG) in diffusion models, deriving upper bounds on the score discrepancy between conditional and unconditional distributions across timesteps. Based on this analysis, the authors propose Control Classifier-Free Guidance (C$^2$FG), a training-free, plug-in method that dynamically adjusts guidance strength based on an exponential decay control function. Experiments demonstrate the effectiveness and broad applicability of C$^2$FG across various generative tasks, showing it is orthogonal to existing guidance strategies.
Fixed guidance weights in diffusion models are suboptimal: C$^2$FG offers a training-free, theoretically grounded approach to dynamically adjust guidance strength, improving performance across diverse generative tasks.
Classifier-Free Guidance (CFG) is a cornerstone of modern conditional diffusion models, yet its reliance on the fixed or heuristic dynamic guidance weight is predominantly empirical and overlooks the inherent dynamics of the diffusion process. In this paper, we provide a rigorous theoretical analysis of the Classifier-Free Guidance. Specifically, we establish strict upper bounds on the score discrepancy between conditional and unconditional distributions at different timesteps based on the diffusion process. This finding explains the limitations of fixed-weight strategies and establishes a principled foundation for time-dependent guidance. Motivated by this insight, we introduce \textbf{Control Classifier-Free Guidance (C$^2$FG)}, a novel, training-free, and plug-in method that aligns the guidance strength with the diffusion dynamics via an exponential decay control function. Extensive experiments demonstrate that C$^2$FG is effective and broadly applicable across diverse generative tasks, while also exhibiting orthogonality to existing strategies.