Feb 18, 2026arXiv:2602.16498

Fast and Scalable Analytical Diffusion

AI Summary

This paper addresses the scalability bottleneck of analytical diffusion models, which require a full-dataset scan at each timestep, by introducing the concept of Posterior Progressive Concentration, where the effective support for denoising shrinks with increasing signal-to-noise ratio. They propose Dynamic Time-Aware Golden Subset Diffusion (GoldDiff), a training-free framework that dynamically identifies a "Golden Subset" of data for inference, decoupling inference complexity from dataset size. GoldDiff achieves significant speedups (e.g., 71x on AFHQ) and enables the first successful scaling of analytical diffusion to ImageNet-1K while maintaining or improving performance compared to full-scan baselines.

Key Contribution

Analytical diffusion models can now scale to ImageNet-1K without training, thanks to a clever "Golden Subset" selection strategy that avoids full-dataset scans.

Abstract

Analytical diffusion models offer a mathematically transparent path to generative modeling by formulating the denoising score as an empirical-Bayes posterior mean. However, this interpretability comes at a prohibitive cost: the standard formulation necessitates a full-dataset scan at every timestep, scaling linearly with dataset size. In this work, we present the first systematic study addressing this scalability bottleneck. We challenge the prevailing assumption that the entire training data is necessary, uncovering the phenomenon of Posterior Progressive Concentration: the effective golden support of the denoising score is not static but shrinks asymptotically from the global manifold to a local neighborhood as the signal-to-noise ratio increases. Capitalizing on this, we propose Dynamic Time-Aware Golden Subset Diffusion (GoldDiff), a training-free framework that decouples inference complexity from dataset size. Instead of static retrieval, GoldDiff uses a coarse-to-fine mechanism to dynamically pinpoint the ''Golden Subset'' for inference. Theoretically, we derive rigorous bounds guaranteeing that our sparse approximation converges to the exact score. Empirically, GoldDiff achieves a $\bf 71 \times$ speedup on AFHQ while matching or achieving even better performance than full-scan baselines. Most notably, we demonstrate the first successful scaling of analytical diffusion to ImageNet-1K, unlocking a scalable, training-free paradigm for large-scale generative modeling.

Architecture Design (Transformers, SSMs, MoE)Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Fast and Scalable Analytical Diffusion

Related Papers