Search papers, labs, and topics across Lattice.
This paper introduces ZEUS, a training-free acceleration method for diffusion models that uses a second-order predictor to reduce the number of denoiser evaluations during inference. ZEUS leverages the observation that with aggressive step skipping, only the current output and its backward difference provide causally grounded information for prediction. By interleaving predictions and avoiding back-to-back extrapolations, ZEUS achieves up to 3.2x speedup in image and video generation without sacrificing perceptual quality or requiring architectural modifications.
Diffusion models can run over 3x faster without retraining or architectural changes, thanks to a clever second-order prediction scheme.
Denoising generative models deliver high-fidelity generation but remain bottlenecked by inference latency due to the many iterative denoiser calls required during sampling. Training-free acceleration methods reduce latency by either sparsifying the model architecture or shortening the sampling trajectory. Current training-free acceleration methods are more complex than necessary: higher-order predictors amplify error under aggressive speedups, and architectural modifications hinder deployment. Beyond 2x acceleration, step skipping creates structural scarcity -- at most one fresh evaluation per local window -- leaving the computed output and its backward difference as the only causally grounded information. Based on this, we propose ZEUS, an acceleration method that predicts reduced denoiser evaluations using a second-order predictor, and stabilizes aggressive consecutive skipping with an interleaved scheme that avoids back-to-back extrapolations. ZEUS adds essentially zero overhead, no feature caches, and no architectural modifications, and it is compatible with different backbones, prediction objectives, and solver choices. Across image and video generation, ZEUS consistently improves the speed-fidelity performance over recent training-free baselines, achieving up to 3.2x end-to-end speedup while maintaining perceptual quality. Our code is available at: https://github.com/Ting-Justin-Jiang/ZEUS.