论文信息 - Progressive Distillation for Fast Sampling of Diffusion Models

Progressive Distillation for Fast Sampling of Diffusion Models

Diffusion models have recently shown great promise for generative modeling, outperforming GANs on perceptual quality and autoregressive models at density estimation. A remaining downside is their slow sampling time: generating high quality samples takes many hundreds or thousands of model evaluations. Here we make two contributions to help eliminate this downside: First, we present new parameterizations of diffusion models that provide increased stability when using few sampling steps. Second, we present a method to distill a trained deterministic diffusion sampler, using many steps, into a new diffusion model that takes half as many sampling steps. We then keep progressively applying this distillation procedure to our model, halving the number of required sampling steps each time. On standard image generation benchmarks like CIFAR-10, ImageNet, and LSUN, we start out with state-of-the-art samplers taking as many as 8192 steps, and are able to distill down to models taking as few as 4 steps without losing much perceptual quality; achieving, for example, a FID of 3.0 on CIFAR-10 in 4 steps. Finally, we show that the full progressive distillation procedure does not take more time than it takes to train the original model, thus representing an efﬁcient solution for generative modeling using diffusion at both train and test time.

Tim Salimans | Jonathan Ho

[1] David J. Fleet,et al. Cascaded Diffusion Models for High Fidelity Image Generation , 2021, J. Mach. Learn. Res..

[2] Qi Li,et al. SRDiff: Single Image Super-Resolution with Diffusion Probabilistic Models , 2021, Neurocomputing.

[3] David J. Fleet,et al. Image Super-Resolution via Iterative Refinement , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4] Dan Su,et al. Bilateral Denoising Diffusion Models , 2021, ArXiv.

[5] Jonathan Ho,et al. Structured Denoising Diffusion Models in Discrete State-Spaces , 2021, ArXiv.

[6] Diederik P. Kingma,et al. Variational Diffusion Models , 2021, ArXiv.

[7] Eliya Nachmani,et al. Non Gaussian Denoising Diffusion Models , 2021, ArXiv.

[8] Jan Kautz,et al. Score-based Generative Modeling in Latent Space , 2021, NeurIPS.

[9] Mohammad Norouzi,et al. Learning to Efficiently Sample from Diffusion Probabilistic Models , 2021, ArXiv.

[10] Zhifeng Kong,et al. On Fast Sampling of Diffusion Probabilistic Models , 2021, ArXiv.

[11] Tal Kachman,et al. Gotta Go Fast When Generating Data with Score-Based Models , 2021, ArXiv.