Capsules Encoder and Capsgan for Image Inpainting

Convolutional neural networks (CNN) have solved a lot of tasks in the field of computer vision because of its powerful processing capacity in recent years. Many different kinds of CNN are used to implement in the image restoration tasks and achieved outstanding results. Hinton propose a new network architecture with dynamic routing called Capsule networks which has shown remarkable results for image classification on MNIST data. Its advantage over CNN is that the networks can preserve more information from original image feature map with dynamic routing and compress the image into a vector, which CNN process the image as a scalar. We expand the capsule networks to the task of image restoration for the first time. Our idea is based on the structure of encoder-decoder by replacing convolutional neural networks with capsules networks in encoder aspect. Especially, we add Generative Adversarial Capsule Network (CapsuleGAN) in decoder to constraint parameters for expanding generative space. We evaluate our method on the ImageNet datasets and achieved acceptable results.

[1]  Alexei A. Efros,et al.  Context Encoders: Feature Learning by Inpainting , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Yee Whye Teh,et al.  Learning to Parse Images , 1999, NIPS.

[3]  Ulas Bagci,et al.  Capsules for Object Segmentation , 2018, ArXiv.

[4]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[5]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[6]  Premkumar Natarajan,et al.  CapsuleGAN: Generative Adversarial Capsule Network , 2018, ECCV Workshops.

[7]  Guangyao Li,et al.  Realistic single-image super-resolution using autoencoding adversarial networks , 2018, J. Electronic Imaging.

[8]  Hao Li,et al.  High-Resolution Image Inpainting Using Multi-scale Neural Patch Synthesis , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Yang Jin,et al.  Capsule Network Performance on Complex Data , 2017, ArXiv.

[10]  Thomas S. Huang,et al.  Generative Image Inpainting with Contextual Attention , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[11]  Eli Shechtman,et al.  PatchMatch: a randomized correspondence algorithm for structural image editing , 2009, ACM Trans. Graph..

[12]  Geoffrey E. Hinton,et al.  Dynamic Routing Between Capsules , 2017, NIPS.

[13]  Aaron C. Courville,et al.  Improved Training of Wasserstein GANs , 2017, NIPS.

[14]  Thomas S. Huang,et al.  Free-Form Image Inpainting With Gated Convolution , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[15]  Bernhard Schölkopf,et al.  A Machine Learning Approach for Non-blind Image Deconvolution , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Geoffrey E. Hinton,et al.  Layer Normalization , 2016, ArXiv.

[17]  Kristen Grauman,et al.  On-demand Learning for Deep Image Restoration , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[18]  Hiroshi Ishikawa,et al.  Globally and locally consistent image completion , 2017, ACM Trans. Graph..

[19]  Zhi-Li Zhang,et al.  Graph Capsule Convolutional Neural Networks , 2018, ArXiv.