ADIR: Adaptive Diffusion for Image Reconstruction

In recent years, denoising diffusion models have demonstrated outstanding image generation performance. The information on natural images captured by these models is useful for many image reconstruction applications, where the task is to restore a clean image from its degraded observations. In this work, we propose a conditional sampling scheme that exploits the prior learned by diffusion models while retaining agreement with the observations. We then combine it with a novel approach for adapting pre-trained diffusion denoising networks to their input. We examine two adaption strategies: the first uses only the degraded image, while the second, which we advocate, is performed using images that are “nearest neighbors” of the degraded image, retrieved from a diverse dataset using an off-the-shelf visual-language model. To evaluate our method, we test it on two state-of-the-art publicly available diffusion models, Stable Diffusion and Guided Diffusion. We show that our proposed ‘adaptive diffusion for image reconstruction’ (ADIR) approach achieves a significant improvement in the super-resolution, deblurring, and text-based editing tasks. Our code and additional results are available online in the project web page.

[1]  M. Irani,et al.  Imagic: Text-Based Real Image Editing with Diffusion Models , 2022, ArXiv.

[2]  Alexei A. Efros,et al.  Test-Time Training with Masked Autoencoders , 2022, NeurIPS.

[3]  Jonathan Ho Classifier-Free Diffusion Guidance , 2022, ArXiv.

[4]  Dani Lischinski,et al.  Blended Latent Diffusion , 2022, ArXiv.

[5]  J. C. Ye,et al.  Improving Diffusion Models for Inverse Problems using Manifold Constraints , 2022, Neural Information Processing Systems.

[6]  Ryan A. Rossi,et al.  CyCLIP: Cyclic Contrastive Language-Image Pretraining , 2022, NeurIPS.

[7]  David J. Fleet,et al.  Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding , 2022, NeurIPS.

[8]  Yaniv Taigman,et al.  KNN-Diffusion: Image Generation via Large-Scale Retrieval , 2022, ICLR.

[9]  D. Cohen-Or,et al.  MyStyle , 2022, ACM Trans. Graph..

[10]  Jong-Chul Ye,et al.  MR Image Denoising and Super-Resolution Using Regularized Reverse Diffusion , 2022, IEEE Transactions on Medical Imaging.

[11]  S. Hoi,et al.  BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation , 2022, ICML.

[12]  Michael Elad,et al.  Denoising Diffusion Restoration Models , 2022, NeurIPS.

[13]  L. Gool,et al.  RePaint: Inpainting using Denoising Diffusion Probabilistic Models , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  B. Ommer,et al.  High-Resolution Image Synthesis with Latent Diffusion Models , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Prafulla Dhariwal,et al.  GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models , 2021, ICML.

[16]  A. Dimakis,et al.  Deblurring via Stochastic Refinement , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  D. Lischinski,et al.  Blended Diffusion for Text-driven Editing of Natural Images , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  S. Ermon,et al.  Solving Inverse Problems in Medical Imaging with Score-Based Generative Models , 2021, ICLR.

[19]  Daniel Cohen-Or,et al.  Pivotal Tuning for Latent-based Editing of Real Images , 2021, ACM Trans. Graph..

[20]  David J. Fleet,et al.  Image Super-Resolution via Iterative Refinement , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Yonina C. Eldar,et al.  Image Restoration by Deep Projected GSURE , 2021, 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).

[22]  Luc Van Gool,et al.  Plug-and-Play Image Restoration With Deep Denoiser Prior , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Bo Dai,et al.  Exploiting Deep Generative Prior for Versatile Image Restoration and Manipulation , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Peyman Milanfar,et al.  MUSIQ: Multi-scale Image Quality Transformer , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[25]  Youngjune Gwon,et al.  ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[26]  Prafulla Dhariwal,et al.  Diffusion Models Beat GANs on Image Synthesis , 2021, NeurIPS.

[27]  Ilya Sutskever,et al.  Learning Transferable Visual Models From Natural Language Supervision , 2021, ICML.

[28]  Prafulla Dhariwal,et al.  Improved Denoising Diffusion Probabilistic Models , 2021, ICML.

[29]  Pieter Abbeel,et al.  Denoising Diffusion Probabilistic Models , 2020, NeurIPS.

[30]  Mark A. Anastasio,et al.  Medical image reconstruction with image-adaptive priors learned by use of generative adversarial networks , 2020, Medical Imaging 2020: Physics of Medical Imaging.

[31]  Shady Abu Hussein,et al.  Correction Filter for Single Image Super-Resolution: Robustifying Off-the-Shelf Deep Super-Resolvers , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  Raja Giryes,et al.  Image-Adaptive GAN based Reconstruction , 2019, AAAI.

[33]  Yang Song,et al.  Generative Modeling by Estimating Gradients of the Data Distribution , 2019, NeurIPS.

[34]  Tom Tirer,et al.  Super-Resolution via Image-Adapted Denoising CNNs: Incorporating External and Internal Learning , 2018, IEEE Signal Processing Letters.

[35]  Raja Giryes,et al.  Image Restoration by Iterative Denoising and Backward Projections , 2017, IEEE Transactions on Image Processing.

[36]  Stefano Ermon,et al.  Modeling Sparse Deviations for Compressed Sensing using Generative Models , 2018, ICML.

[37]  Michal Irani,et al.  "Zero-Shot" Super-Resolution Using Deep Internal Learning , 2017, CVPR.

[38]  Andrea Vedaldi,et al.  Deep Image Prior , 2017, International Journal of Computer Vision.

[39]  Yochai Blau,et al.  The Perception-Distortion Tradeoff , 2017, CVPR.

[40]  Matthias Zwicker,et al.  Deep Mean-Shift Priors for Image Restoration , 2017, NIPS.

[41]  Eirikur Agustsson,et al.  NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[42]  Kyoung Mu Lee,et al.  Enhanced Deep Residual Networks for Single Image Super-Resolution , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[43]  Wangmeng Zuo,et al.  Learning Deep CNN Denoiser Prior for Image Restoration , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[44]  Alexandros G. Dimakis,et al.  Compressed Sensing using Generative Models , 2017, ICML.

[45]  Lei Zhang,et al.  Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising , 2016, IEEE Transactions on Image Processing.

[46]  Xiaoou Tang,et al.  Image Super-Resolution Using Deep Convolutional Networks , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[47]  Surya Ganguli,et al.  Deep Unsupervised Learning using Nonequilibrium Thermodynamics , 2015, ICML.

[48]  Jean Ponce,et al.  Learning a convolutional neural network for non-uniform motion blur removal , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[49]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[50]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.