论文信息 - InGAN: Capturing and Retargeting the “DNA” of a Natural Image

InGAN: Capturing and Retargeting the “DNA” of a Natural Image

Generative Adversarial Networks (GANs) typically learn a distribution of images in a large image dataset, and are then able to generate new images from this distribution. However, each natural image has its own internal statistics, captured by its unique distribution of patches. In this paper we propose an ``Internal GAN'' (InGAN) -- an image-specific GAN -- which trains on a single input image and learns its internal distribution of patches. It is then able to synthesize a plethora of new natural images of significantly different sizes, shapes and aspect-ratios – all with the same internal patch-distribution (same ``DNA'') as the input image. In particular, despite large changes in global size/shape of the image, all elements inside the image maintain their local size/shape. InGAN is fully unsupervised, requiring no additional data other than the input image itself. Once trained on the input image, it can remap the input to any size or shape in a single feedforward pass, while preserving the same internal patch distribution. InGAN provides a unified framework for a variety of tasks, bridging the gap between textures and natural images.

[1] Jean-Michel Morel,et al. A non-local algorithm for image denoising , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[2] Michael Elad,et al. Image Denoising Via Sparse and Redundant Representations Over Learned Dictionaries , 2006, IEEE Transactions on Image Processing.

[3] Alessandro Foi,et al. Image Denoising by Sparse 3-D Transform-Domain Collaborative Filtering , 2007, IEEE Transactions on Image Processing.

[4] Daniel Cohen-Or,et al. Non-homogeneous Content-driven Video-retargeting , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[5] Ariel Shamir,et al. Seam Carving for Content-Aware Image Resizing , 2007, ACM Trans. Graph..

[6] Denis Simakov,et al. Summarizing visual data using bidirectional similarity , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[7] Yael Pritch,et al. Shift-map image editing , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[8] Feng Liu,et al. Image Denoising Via Sparse and Redundant Representations Over Learned Dictionaries in Wavelet Domain , 2009, 2009 Fifth International Conference on Image and Graphics.

[9] Eli Shechtman,et al. PatchMatch: a randomized correspondence algorithm for structural image editing , 2009, ACM Trans. Graph..

[10] Michal Irani,et al. Super-resolution from a single image , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[11] William T. Freeman,et al. The Patch Transform , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12] Michal Irani,et al. Internal statistics of a single natural image , 2011, CVPR 2011.

[13] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[14] Leon A. Gatys,et al. Texture Synthesis Using Convolutional Neural Networks , 2015, NIPS.

[15] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[16] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[17] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[18] Leon A. Gatys,et al. A Neural Algorithm of Artistic Style , 2015, ArXiv.

[19] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20] Roland Vollgraf,et al. Texture Synthesis with Spatial Generative Adversarial Networks , 2016, ArXiv.

[21] Vincent Dumoulin,et al. Deconvolution and Checkerboard Artifacts , 2016 .

[22] Andrea Vedaldi,et al. Texture Networks: Feed-forward Synthesis of Textures and Stylized Images , 2016, ICML.

[23] Francesco Visin,et al. A guide to convolution arithmetic for deep learning , 2016, ArXiv.

[24] Raymond Y. K. Lau,et al. Least Squares Generative Adversarial Networks , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[25] Alexei A. Efros,et al. Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26] Tae-Hyun Oh,et al. Weakly- and Self-Supervised Learning for Content-Aware Deep Image Retargeting , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[27] Alexei A. Efros,et al. Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[28] Dani Lischinski,et al. Non-stationary texture synthesis by adversarial expansion , 2018, ACM Trans. Graph..

[29] Lior Wolf,et al. The Role of Minimal Complexity Functions in Unsupervised Learning of Semantic Mappings , 2017, ICLR.

[30] Michal Irani,et al. Internal Distribution Matching for Natural Image Retargeting , 2018, ArXiv.

[31] Yuichi Yoshida,et al. Spectral Normalization for Generative Adversarial Networks , 2018, ICLR.

[32] Michal Irani,et al. "Zero-Shot" Super-Resolution Using Deep Internal Learning , 2017, CVPR.

[33] Jan Kautz,et al. High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[34] Andrea Vedaldi,et al. Deep Image Prior , 2017, International Journal of Computer Vision.

[35] Tali Dekel,et al. SinGAN: Learning a Generative Model From a Single Natural Image , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).