论文信息 - WaveDM: Wavelet-Based Diffusion Models for Image Restoration

WaveDM: Wavelet-Based Diffusion Models for Image Restoration

Latest diffusion-based methods for many image restoration tasks outperform traditional models, but they encounter the long-time inference problem. To tackle it, this paper proposes a Wavelet-Based Diffusion Model (WaveDM) with an Efficient Conditional Sampling (ECS) strategy. WaveDM learns the distribution of clean images in the wavelet domain conditioned on the wavelet spectrum of degraded images after wavelet transform, which is more time-saving in each step of sampling than modeling in the spatial domain. In addition, ECS follows the same procedure as the deterministic implicit sampling in the initial sampling period and then stops to predict clean images directly, which reduces the number of total sampling steps to around 5. Evaluations on four benchmark datasets including image raindrop removal, defocus deblurring, demoir\'eing, and denoising demonstrate that WaveDM achieves state-of-the-art performance with the efficiency that is comparable to traditional one-pass methods and over 100 times faster than existing image restoration methods using vanilla diffusion models.

[1] S. Ermon,et al. GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration , 2023, ArXiv.

[2] Vishal M. Patel,et al. AT-DDPM: Restoring Faces Degraded by Atmospheric Turbulence Using Denoising Diffusion Probabilistic Models , 2022, 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).

[3] Michael Elad,et al. Enhancing Diffusion-Based Image Synthesis with Robust Classifier Guidance , 2022, Trans. Mach. Learn. Res..

[4] R. Legenstein,et al. Restoring Vision in Adverse Weather Conditions With Patch-Based Denoising Diffusion Models , 2022, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5] Dong Huk Park,et al. More Control for Free! Image Synthesis with Semantic Diffusion Guidance , 2021, 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).

[6] David J. Fleet,et al. Image Super-Resolution via Iterative Refinement , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7] Syed Waqas Zamir,et al. Learning Enriched Features for Real Image Restoration and Enhancement , 2020, ECCV.

[8] Chen Change Loy,et al. DifFace: Blind Face Restoration with Diffused Error Contraction , 2022, IEEE transactions on pattern analysis and machine intelligence.

[9] Yinhuai Wang,et al. Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model , 2022, ICLR.

[10] A. Tran,et al. Wavelet Diffusion Models are fast and scalable Image Generators , 2022, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[11] J. C. Ye,et al. Parallel Diffusion Models of Operator and Image for Blind Inverse Problems , 2022, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[12] Cheng Lu,et al. DPM-Solver++: Fast Solver for Guided Sampling of Diffusion Probabilistic Models , 2022, ArXiv.

[13] Diederik P. Kingma,et al. On Distillation of Guided Diffusion Models , 2022, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[14] Jong-Chul Ye,et al. Diffusion-based Image Translation using Disentangled Style and Content Representation , 2022, ICLR.

[15] Michael T. McCann,et al. Diffusion Posterior Sampling for General Noisy Inverse Problems , 2022, ICLR.

[16] Chi-Wing Fu,et al. Neural Wavelet-domain Diffusion for 3D Shape Generation , 2022, SIGGRAPH Asia.

[17] Valentin De Bortoli,et al. Wavelet Score-Based Generative Modeling , 2022, NeurIPS.

[18] Jonathan Ho. Classifier-Free Diffusion Guidance , 2022, ArXiv.

[19] Jia Li,et al. Towards Efficient and Scale-Robust Ultra-High-Definition Image Demoireing , 2022, ECCV.

[20] Jong-Chul Ye,et al. Progressive Deblurring of Diffusion Models for Coarse-to-Fine Image Synthesis , 2022, ArXiv.

[21] Chongxuan Li,et al. EGSDE: Unpaired Image-to-Image Translation via Energy-Guided Stochastic Differential Equations , 2022, NeurIPS.

[22] Xiatian Zhu,et al. Accelerating Score-based Generative Models with Preconditioned Diffusion Sampling , 2022, ECCV.

[23] Zhengjun Zha,et al. Image De-raining Transformer. , 2022, IEEE transactions on pattern analysis and machine intelligence.

[24] J. C. Ye,et al. Improving Diffusion Models for Inverse Problems using Manifold Constraints , 2022, Neural Information Processing Systems.

[25] Cheng Lu,et al. DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps , 2022, NeurIPS.

[26] Nan Duan,et al. DiVAE: Photorealistic Images Synthesis with Denoising Diffusion Decoder , 2022, ArXiv.

[27] Fang Wen,et al. Pretraining is All You Need for Image-to-Image Translation , 2022, ArXiv.

[28] Dahua Lin,et al. Accelerating Diffusion Models via Early Stop of the Diffusion Process , 2022, ArXiv.

[29] David J. Fleet,et al. Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding , 2022, NeurIPS.

[30] Jian Sun,et al. Simple Baselines for Image Restoration , 2022, ECCV.

[31] Tim Salimans,et al. Progressive Distillation for Fast Sampling of Diffusion Models , 2022, ICLR.

[32] Michael Elad,et al. Denoising Diffusion Restoration Models , 2022, NeurIPS.

[33] L. Gool,et al. RePaint: Inpainting using Denoising Diffusion Probabilistic Models , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[34] B. Ommer,et al. High-Resolution Image Synthesis with Latent Diffusion Models , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[35] Prafulla Dhariwal,et al. GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models , 2021, ICML.

[36] Jong-Chul Ye,et al. Come-Closer-Diffuse-Faster: Accelerating Conditional Diffusion Models for Inverse Problems through Stochastic Contraction , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[37] A. Dimakis,et al. Deblurring via Stochastic Refinement , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[38] Supasorn Suwajanakorn,et al. Diffusion Autoencoders: Toward a Meaningful and Decodable Representation , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[39] Syed Waqas Zamir,et al. Restormer: Efficient Transformer for High-Resolution Image Restoration , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[40] David J. Fleet,et al. Palette: Image-to-Image Diffusion Models , 2021, SIGGRAPH.

[41] Syed Waqas Zamir,et al. Burst Image Restoration and Enhancement , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[42] A. Leonardis,et al. Learning Frequency Domain Priors for Image Demoireing , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43] S. Ermon,et al. SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations , 2021, ICLR.

[44] Jianmin Bao,et al. Uformer: A General U-Shaped Transformer for Image Restoration , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[45] David J. Fleet,et al. Cascaded Diffusion Models for High Fidelity Image Generation , 2021, J. Mach. Learn. Res..

[46] Qi Li,et al. SRDiff: Single Image Super-Resolution with Diffusion Probabilistic Models , 2021, Neurocomputing.

[47] Jie Li,et al. Wavelet-Based Dual Recursive Network for Image Super-Resolution , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[48] Nick Barnes,et al. Densely Residual Laplacian Super-Resolution , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[49] Haibin Huang,et al. Inversion-Based Creativity Transfer with Diffusion Models , 2022, ArXiv.

[50] Luc Van Gool,et al. SwinIR: Image Restoration Using Swin Transformer , 2021, 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW).

[51] Seungyong Lee,et al. Single Image Defocus Deblurring Using Kernel-Sharing Parallel Atrous Convolutions , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[52] Tieyong Zeng,et al. Structure-Preserving Deraining with Residue Channel Prior Guidance , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[53] Sung-Jea Ko,et al. Rethinking Coarse-to-Fine Approach in Single Image Deblurring , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[54] Youngjune Gwon,et al. ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[55] Yi Yang,et al. Removing Raindrops and Rain Streaks in One Go , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[56] Seungyong Lee,et al. Iterative Filter Adaptive Network for Single Image Defocus Deblurring , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[57] Prafulla Dhariwal,et al. Diffusion Models Beat GANs on Image Synthesis , 2021, NeurIPS.

[58] Chris G. Willcocks,et al. UNIT-DDPM: UNpaired Image Translation with Denoising Diffusion Probabilistic Models , 2021, ArXiv.

[59] Lin Ma,et al. Dual Attention-in-Attention Model for Joint Rain Streak and Raindrop Removal , 2021, IEEE Transactions on Image Processing.

[60] Prafulla Dhariwal,et al. Improved Denoising Diffusion Probabilistic Models , 2021, ICML.

[61] Nal Kalchbrenner,et al. Colorization Transformer , 2021, ICLR.

[62] Ling Shao,et al. Multi-Stage Progressive Image Restoration , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[63] Abhishek Kumar,et al. Score-Based Generative Modeling through Stochastic Differential Equations , 2020, ICLR.

[64] S. Gelly,et al. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale , 2020, ICLR.

[65] Jiaming Song,et al. Denoising Diffusion Implicit Models , 2020, ICLR.

[66] Stephen Lin,et al. Swin Transformer: Hierarchical Vision Transformer using Shifted Windows , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[67] S HrishikeshP.,et al. WDRN : A Wavelet Decomposed RelightNet for Image Relighting , 2020, ECCV Workshops.

[68] Shanxin Yuan,et al. Wavelet-Based Dual-Branch Network for Image Demoireing , 2020, ECCV.

[69] Pieter Abbeel,et al. Denoising Diffusion Probabilistic Models , 2020, NeurIPS.

[70] Stefano Ermon,et al. Improved Techniques for Training Score-Based Generative Models , 2020, NeurIPS.

[71] Baining Guo,et al. Learning Texture Transformer Network for Image Super-Resolution , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[72] M. S. Brown,et al. Defocus Deblurring Using Dual-Pixel Data , 2020, ECCV.

[73] Vishal M. Patel,et al. Image De-Raining Using a Conditional Generative Adversarial Network , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[74] Ling-Yu Duan,et al. FHDe2Net: Full High Definition Demoireing Network , 2020, ECCV.

[75] Yixin Chen,et al. Deep Learning for Seeing Through Window With Raindrops , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[76] Zhangyang Wang,et al. DeblurGAN-v2: Deblurring (Orders-of-Magnitude) Faster and Better , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[77] Yang Song,et al. Generative Modeling by Estimating Gradients of the Data Distribution , 2019, NeurIPS.

[78] Sungkil Lee,et al. Deep Defocus Map Estimation Using Domain Adaptation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[79] Loong Fah Cheong,et al. Heavy Rain Image Restoration: Integrating Physics Model and Conditional Adversarial Learning , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[80] Masanori Suganuma,et al. Dual Residual Networks Leveraging the Potential of Paired Operations for Image Restoration , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[81] Yun Fu,et al. Image Super-Resolution Using Very Deep Residual Channel Attention Networks , 2018, ECCV.

[82] Stephen Lin,et al. A High-Quality Denoising Dataset for Smartphone Cameras , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[83] Yizhou Yu,et al. Moiré Photo Restoration Using Multiresolution Convolutional Neural Networks , 2018, IEEE Transactions on Image Processing.

[84] Wenhan Yang,et al. Attentive Generative Adversarial Network for Raindrop Removal from A Single Image , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[85] Mohinder Malhotra. Single Image Haze Removal Using Dark Channel Prior , 2016 .

[86] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[87] Surya Ganguli,et al. Deep Unsupervised Learning using Nonequilibrium Thermodynamics , 2015, ICML.

[88] Luc Van Gool,et al. Anchored Neighborhood Regression for Fast Example-Based Super-Resolution , 2013, 2013 IEEE International Conference on Computer Vision.

[89] Michal Irani,et al. Nonparametric Blind Super-resolution , 2013, 2013 IEEE International Conference on Computer Vision.

[90] Pascal Vincent,et al. A Connection Between Score Matching and Denoising Autoencoders , 2011, Neural Computation.

[91] Michael F. Cohen,et al. Deep photo: model-based photograph enhancement and viewing , 2008, SIGGRAPH Asia '08.

[92] Xu Yang,et al. Real-time rendering of realistic rain , 2006, SIGGRAPH '06.

[93] Aapo Hyvärinen,et al. Estimation of Non-Normalized Statistical Models by Score Matching , 2005, J. Mach. Learn. Res..