论文信息 - SharpFormer: Learning Local Feature Preserving Global Representations for Image Deblurring

SharpFormer: Learning Local Feature Preserving Global Representations for Image Deblurring

The goal of dynamic scene deblurring is to remove the motion blur presented in a given image. To recover the details from the severe blurs, conventional convolutional neural networks (CNNs) based methods typically increase the number of convolution layers, kernel-size, or different scale images to enlarge the receptive field. However, these methods neglect the non-uniform nature of blurs, and cannot extract varied local and global information. Unlike the CNNs-based methods, we propose a Transformer-based model for image deblurring, named SharpFormer, that directly learns long-range dependencies via a novel Transformer module to overcome large blur variations. Transformer is good at learning global information but is poor at capturing local information. To overcome this issue, we design a novel Locality preserving Transformer (LTransformer) block to integrate sufficient local information into global features. In addition, to effectively apply LTransformer to the medium-resolution features, a hybrid block is introduced to capture intermediate mixed features. Furthermore, we use a dynamic convolution (DyConv) block, which aggregates multiple parallel convolution kernels to handle the non-uniform blur of inputs. We leverage a powerful two-stage attentive framework composed of the above blocks to learn the global, hybrid, and local features effectively. Extensive experiments on the GoPro and REDS datasets show that the proposed SharpFormer performs favourably against the state-of-the-art methods in blurred image restoration.

[1] Richang Hong,et al. CRNet: Unsupervised Color Retention Network for Blind Motion Deblurring , 2022, ACM Multimedia.

[2] Hok Shing Wong,et al. A robust non-blind deblurring method using deep denoiser prior , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[3] Chia-Wen Lin,et al. Stripformer: Strip Transformer for Fast Image Deblurring , 2022, ECCV.

[4] Syed Waqas Zamir,et al. Restormer: Efficient Transformer for High-Resolution Image Restoration , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[5] Luc Van Gool,et al. SwinIR: Image Restoration Using Swin Transformer , 2021, 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW).

[6] Sung-Jea Ko,et al. Rethinking Coarse-to-Fine Approach in Single Image Deblurring , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[7] Jianmin Bao,et al. Uformer: A General U-Shaped Transformer for Image Restoration , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[8] J. Zhang,et al. HINet: Half Instance Normalization Network for Image Restoration , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[9] Radu Timofte,et al. NTIRE 2021 Challenge on Image Deblurring , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[10] Luc Van Gool,et al. LocalViT: Bringing Locality to Vision Transformers , 2021, ArXiv.

[11] Xiang Li,et al. Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[12] Ling Shao,et al. Multi-Stage Progressive Image Restoration , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[13] Yen-Yu Lin,et al. BANet: A Blur-Aware Attention Network for Dynamic Scene Deblurring , 2021, IEEE Transactions on Image Processing.

[14] S. Gelly,et al. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale , 2020, ICLR.

[15] Nicolas Usunier,et al. End-to-End Object Detection with Transformers , 2020, ECCV.

[16] A. N. Rajagopalan,et al. Spatially-Attentive Patch-Hierarchical Network for Adaptive Motion Deblurring , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[17] Lu Yuan,et al. Dynamic Convolution: Attention Over Convolution Kernels , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[18] Se Young Chun,et al. Multi-Temporal Recurrent Neural Networks For Progressive Non-Uniform Single Image Deblurring With Incremental Temporal Training , 2019, ECCV.

[19] Radu Timofte,et al. NTIRE 2019 Challenge on Video Deblurring and Super-Resolution: Dataset and Study , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[20] Xiaoyong Shen,et al. Dynamic Scene Deblurring With Parameter Selective Sharing and Nested Skip Connections , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21] Hongdong Li,et al. Deep Stacked Hierarchical Multi-Patch Network for Image Deblurring , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[22] A. N. Rajagopalan,et al. Region-Adaptive Dense Network for Efficient Motion Deblurring , 2019, AAAI.

[23] Yanning Zhang,et al. MPTV: Matching Pursuit-Based Total Variation Minimization for Image Deconvolution , 2018, IEEE Transactions on Image Processing.

[24] Rynson W. H. Lau,et al. Dynamic Scene Deblurring Using Spatially Variant Recurrent Neural Networks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[25] Yi Wang,et al. Scale-Recurrent Network for Deep Image Deblurring , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[26] Jiri Matas,et al. DeblurGAN: Blind Motion Deblurring Using Conditional Adversarial Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[27] Mingkui Tan,et al. Self-Paced Kernel Estimation for Robust Blind Image Deblurring , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[28] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[29] Ian D. Reid,et al. From Motion Blur to Motion Flow: A Deep Learning Solution for Removing Heterogeneous Motion Blur , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30] Tae Hyun Kim,et al. Deep Multi-scale Convolutional Neural Network for Dynamic Scene Deblurring , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[31] Frank Hutter,et al. SGDR: Stochastic Gradient Descent with Warm Restarts , 2016, ICLR.

[32] Mingkui Tan,et al. Blind Image Deconvolution by Automatic Gradient Activation , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33] Deqing Sun,et al. Blind Image Deblurring Using Dark Channel Prior , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34] Ayan Chakrabarti,et al. A Neural Approach to Blind Motion Deblurring , 2016, ECCV.

[35] Jian-Jiun Ding,et al. Blur kernel estimation using normalized color-line priors , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[36] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[37] Jean Ponce,et al. Learning a convolutional neural network for non-uniform motion blur removal , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38] Ce Liu,et al. Deep Convolutional Neural Network for Image Deconvolution , 2014, NIPS.

[39] Tae Hyun Kim,et al. Dynamic Scene Deblurring , 2013, 2013 IEEE International Conference on Computer Vision.

[40] Li Xu,et al. Unnatural L0 Sparse Representation for Natural Image Deblurring , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[41] Sunghyun Cho,et al. Edge-based blur kernel estimation using patch priors , 2013, IEEE International Conference on Computational Photography (ICCP).

[42] Bernhard Schölkopf,et al. Fast removal of non-uniform camera shake , 2011, 2011 International Conference on Computer Vision.

[43] Rob Fergus,et al. Blind deconvolution using a normalized sparsity measure , 2011, CVPR 2011.

[44] Jean Ponce,et al. Non-uniform Deblurring for Shaken Images , 2010, International Journal of Computer Vision.

[45] Rob Fergus,et al. Fast Image Deconvolution using Hyper-Laplacian Priors , 2009, NIPS.

[46] Frédo Durand,et al. Understanding and evaluating blind deconvolution algorithms , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[47] Jiaya Jia,et al. High-quality motion deblurring from a single image , 2008, ACM Trans. Graph..

[48] Tony F. Chan,et al. Total variation blind deconvolution , 1998, IEEE Trans. Image Process..

[49] Stephen Lin,et al. Swin Transformer: Hierarchical Vision Transformer using Shifted Windows , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[50] Sundaresh Ram,et al. Removing Camera Shake from a Single Photograph , 2009 .