论文信息 - GP-GAN: Towards Realistic High-Resolution Image Blending

GP-GAN: Towards Realistic High-Resolution Image Blending

It is common but challenging to address high-resolution image blending in the automatic photo editing application. In this paper, we would like to focus on solving the problem of high-resolution image blending, where the composite images are provided. We propose a framework called Gaussian-Poisson Generative Adversarial Network (GP-GAN) to leverage the strengths of the classical gradient-based approach and Generative Adversarial Networks. To the best of our knowledge, it's the first work that explores the capability of GANs in high-resolution image blending task. Concretely, we propose Gaussian-Poisson Equation to formulate the high-resolution image blending problem, which is a joint optimization constrained by the gradient and color information. Inspired by the prior works, we obtain gradient information via applying gradient filters. To generate the color information, we propose a Blending GAN to learn the mapping between the composite images and the well-blended ones. Compared to the alternative methods, our approach can deliver high-resolution, realistic images with fewer bleedings and unpleasant artifacts. Experiments confirm that our approach achieves the state-of-the-art performance on Transient Attributes dataset. A user study on Amazon Mechanical Turk finds that the majority of workers are in favor of the proposed method. The source code is available in \urlhttps://github.com/wuhuikai/GP-GAN, and there's also an online demo in \urlhttp://wuhuikai.me/DeepJS.

[1] Shmuel Peleg,et al. Seamless Image Stitching in the Gradient Domain , 2004, ECCV.

[2] Alexei A. Efros,et al. Generative Visual Manipulation on the Natural Image Manifold , 2016, ECCV.

[3] Kenta Oono,et al. Chainer : a Next-Generation Open Source Framework for Deep Learning , 2015 .

[4] Pieter Abbeel,et al. InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets , 2016, NIPS.

[5] Simon Osindero,et al. Conditional Generative Adversarial Nets , 2014, ArXiv.

[6] Jan Kautz,et al. Is L2 a Good Loss Function for Neural Networks for Image Processing , 2015 .

[7] Ming-Hsuan Yang,et al. Deep Image Harmonization , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8] Richard Szeliski,et al. Eliminating ghosting and exposure artifacts in image mosaics , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[9] Bolei Zhou,et al. Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.

[10] Patrick Pérez,et al. Poisson image editing , 2003, ACM Trans. Graph..

[11] Rob Fergus,et al. Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks , 2015, NIPS.

[12] Jian Sun,et al. Drag-and-drop pasting , 2006, SIGGRAPH 2006.

[13] Jan Kautz,et al. Loss Functions for Image Restoration With Neural Networks , 2017, IEEE Transactions on Computational Imaging.

[14] Alexei A. Efros,et al. Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15] Hao Li,et al. High-Resolution Image Inpainting Using Multi-scale Neural Patch Synthesis , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16] Alex Graves,et al. DRAW: A Recurrent Neural Network For Image Generation , 2015, ICML.

[17] Bernt Schiele,et al. Generative Adversarial Text to Image Synthesis , 2016, ICML.

[18] Jorge Nocedal,et al. A Limited Memory Algorithm for Bound Constrained Optimization , 1995, SIAM J. Sci. Comput..

[19] Chuan Li,et al. Precomputed Real-Time Texture Synthesis with Markovian Generative Adversarial Networks , 2016, ECCV.

[20] Alexei A. Efros,et al. Learning a Discriminative Model for the Perception of Realism in Composite Images , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[21] Alexei A. Efros,et al. Context Encoders: Feature Learning by Inpainting , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22] Léon Bottou,et al. Wasserstein Generative Adversarial Networks , 2017, ICML.

[23] Michael M. Kazhdan,et al. Streaming multigrid for gradient-domain operations on large images , 2008, ACM Trans. Graph..

[24] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[25] Ming-Hsuan Yang,et al. Sky is not the limit , 2016, ACM Trans. Graph..

[26] Antonio Torralba,et al. LabelMe: A Database and Web-Based Tool for Image Annotation , 2008, International Journal of Computer Vision.

[27] Dimitris N. Metaxas,et al. StackGAN: Text to Photo-Realistic Image Synthesis with Stacked Generative Adversarial Networks , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[28] Mohammad H. Mahoor,et al. Fast image blending using watersheds and graph cuts , 2009, Image Vis. Comput..

[29] David Salesin,et al. Interactive digital photomontage , 2004, ACM Trans. Graph..

[30] Richard Szeliski,et al. Fast Poisson blending using multi-splines , 2011, 2011 IEEE International Conference on Computational Photography (ICCP).

[31] Edward H. Adelson,et al. The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[32] Rama Chellappa,et al. A Method for Enforcing Integrability in Shape from Shading Algorithms , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[33] Abhinav Gupta,et al. Generative Image Modeling Using Style and Structure Adversarial Networks , 2016, ECCV.

[34] Wojciech Zaremba,et al. Improved Techniques for Training GANs , 2016, NIPS.

[35] Masatoshi Okutomi,et al. Seamless image cloning by a closed form solution of a modified Poisson problem , 2012, SA '12.

[36] Léon Bottou,et al. Wasserstein GAN , 2017, ArXiv.

[37] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[38] Thomas Brox,et al. Generating Images with Perceptual Similarity Metrics based on Deep Networks , 2016, NIPS.

[39] Edward H. Adelson,et al. A multiresolution spline with application to image mosaics , 1983, TOGS.

[40] Dani Lischinski,et al. Gradient Domain High Dynamic Range Compression , 2023 .

[41] Xiaofeng Tao,et al. Transient attributes for high-level understanding and editing of outdoor scenes , 2014, ACM Trans. Graph..

[42] Li Fei-Fei,et al. Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[43] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[44] Julie Dorsey,et al. Understanding and improving the realism of image composites , 2012, ACM Trans. Graph..

[45] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.