Multi-scale attention network for image inpainting

Abstract Recently, deep learning based inpainting methods have shown promising performance, in which some multi-scale networks are introduced to characterize image content in both details and structures. However, few of these networks explore local spatial components under different receptive fields and internal connection between multi-scale feature maps. In this paper, we propose a novel multi-scale attention network (MSA-Net) to fill the irregular missing regions, in which a multi-scale attention group (MSAG) with several multi-scale attention units (MSAUs) is introduced for fully analysing the features from shallow details to high-level semantics. In each MSAU, an attention based spatial pyramid structure is designed to capture the deep features from different receptive fields. In this network, attention mechanisms are explored for boosting the representation power of MSAU, where spatial attention is combined with each scale to highlight the most probably attentive spatial components and the channel attention is used as a globally semantic detector to build the connection between the multiple scales. Furthermore, for better inpainting results, a max pooling based mask update method is utilized to predict the missing parts from the border regions to the inside. Finally, experiments on Places2 dataset and CelebA dataset demonstrate that the proposed method can achieve better results than the previous inpainting methods.

[1]  Tony F. Chan,et al.  Nontexture Inpainting by Curvature-Driven Diffusions , 2001, J. Vis. Commun. Image Represent..

[2]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[3]  Faceshop , 2018, ACM Transactions on Graphics.

[4]  Leon A. Gatys,et al.  A Neural Algorithm of Artistic Style , 2015, ArXiv.

[5]  Christine Guillemot,et al.  Image Inpainting : Overview and Recent Advances , 2014, IEEE Signal Processing Magazine.

[6]  Sen Liu,et al.  Progressive Image Inpainting with Full-Resolution Residual Network , 2019, ACM Multimedia.

[7]  Bolei Zhou,et al.  Places: A 10 Million Image Database for Scene Recognition , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Xiao Xiao,et al.  Occlusion Removal Using Depth Mapping in Three-Dimensional Integral Imaging , 2012, Journal of Display Technology.

[9]  Yi Wang,et al.  Image Inpainting via Generative Multi-column Convolutional Neural Networks , 2018, NeurIPS.

[10]  Tony F. Chan,et al.  Mathematical Models for Local Nontexture Inpaintings , 2002, SIAM J. Appl. Math..

[11]  Aleksandra Pizurica,et al.  Context-Aware Patch-Based Image Inpainting Using Markov Random Field Modeling , 2015, IEEE Transactions on Image Processing.

[12]  Ugo Boscain,et al.  Highly Corrupted Image Inpainting Through Hypoelliptic Diffusion , 2018, Journal of Mathematical Imaging and Vision.

[13]  Adib Akl,et al.  A survey of exemplar-based texture synthesis methods , 2018, Comput. Vis. Image Underst..

[14]  Haimiao Zhang,et al.  Wavelet frame based Poisson noise removal and image deblurring , 2017, Signal Process..

[15]  Xiangzhi Bai,et al.  Patch-Sparsity-Based Image Inpainting Through a Facet Deduced Directional Derivative , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[16]  Geoffrey E. Hinton,et al.  Learning to combine foveal glimpses with a third-order Boltzmann machine , 2010, NIPS.

[17]  Patrick Pérez,et al.  Poisson image editing , 2003, ACM Trans. Graph..

[18]  Enhua Wu,et al.  Squeeze-and-Excitation Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  John Hannah,et al.  IEEE International Conference on Image Processing (ICIP) , 1997 .

[20]  Jian Zhang,et al.  Image Restoration Using Joint Statistical Modeling in a Space-Transform Domain , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[21]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[22]  Joachim Weickert,et al.  Diffusion-Based Inpainting for Coding Remote-Sensing Data , 2017, IEEE Geoscience and Remote Sensing Letters.

[23]  Tianqi Chen,et al.  Empirical Evaluation of Rectified Activations in Convolutional Network , 2015, ArXiv.