论文信息 - MagGAN: High-Resolution Face Attribute Editing with Mask-Guided Generative Adversarial Network

MagGAN: High-Resolution Face Attribute Editing with Mask-Guided Generative Adversarial Network

We present Mask-guided Generative Adversarial Network (MagGAN) for high-resolution face attribute editing, in which semantic facial masks from a pre-trained face parser are used to guide the fine-grained image editing process. With the introduction of a mask-guided reconstruction loss, MagGAN learns to only edit the facial parts that are relevant to the desired attribute changes, while preserving the attribute-irrelevant regions (e.g., hat, scarf for modification `To Bald'). Further, a novel mask-guided conditioning strategy is introduced to incorporate the influence region of each attribute change into the generator. In addition, a multi-level patch-wise discriminator structure is proposed to scale our model for high-resolution ($1024 \times 1024$) face editing. Experiments on the CelebA benchmark show that the proposed method significantly outperforms prior state-of-the-art approaches in terms of both image quality and editing performance.

[1] Yi Yang,et al. GeneGAN: Learning Object Transfiguration and Attribute Subspace from Unpaired Data , 2017, BMVC 2017.

[2] Richard S. Zemel,et al. Learning Latent Subspaces in Variational Autoencoders , 2018, NeurIPS.

[3] Lu Yuan,et al. Mask-Guided Portrait Editing With Conditional GANs , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[4] Bao-Gang Hu,et al. Facial Image Attributes Transformation via Conditional Recycle Generative Adversarial Networks , 2018, Journal of Computer Science and Technology.

[5] Lei Zhang,et al. Object-Driven Text-To-Image Synthesis via Adversarial Training , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[6] Shuicheng Yan,et al. Human Parsing with Contextualized Convolutional Neural Network , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7] Jung-Woo Ha,et al. StarGAN: Unified Generative Adversarial Networks for Multi-domain Image-to-Image Translation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[8] Taesung Park,et al. Semantic Image Synthesis With Spatially-Adaptive Normalization , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[9] Jinwen Ma,et al. ELEGANT: Exchanging Latent Encodings with GAN for Transferring Multiple Face Attributes , 2018, ECCV.

[10] Chi-Keung Tang,et al. Attribute-Guided Face Generation Using Conditional CycleGAN , 2017, ECCV.

[11] Xin Zheng,et al. A Survey of Deep Facial Attribute Analysis , 2018, International Journal of Computer Vision.

[12] Geoffrey E. Hinton,et al. Layer Normalization , 2016, ArXiv.

[13] Xiao Liu,et al. STGAN: A Unified Selective Transfer Network for Arbitrary Image Attribute Editing , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[14] Thomas S. Huang,et al. Interactive Facial Feature Localization , 2012, ECCV.

[15] Wei Shen,et al. Learning Residual Images for Face Attribute Manipulation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16] David Zhang,et al. Deep Identity-aware Transfer of Facial Attributes , 2016, ArXiv.

[17] Tom Duff,et al. Compositing digital images , 1984, SIGGRAPH.

[18] Léon Bottou,et al. Wasserstein GAN , 2017, ArXiv.

[19] Jiaya Jia,et al. Homomorphic Latent Space Interpolation for Unpaired Image-To-Image Translation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[20] Pieter Abbeel,et al. InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets , 2016, NIPS.

[21] Yang Song,et al. Age Progression/Regression by Conditional Adversarial Autoencoder , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22] Serge J. Belongie,et al. Arbitrary Style Transfer in Real-Time with Adaptive Instance Normalization , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[23] Luc Van Gool,et al. Exemplar Guided Unsupervised Image-to-Image Translation , 2018, ArXiv.

[24] Timo Aila,et al. A Style-Based Generator Architecture for Generative Adversarial Networks , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25] Trevor Darrell,et al. Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26] Gang Yu,et al. BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation , 2018, ECCV.

[27] Lingyun Wu,et al. MaskGAN: Towards Diverse and Interactive Facial Image Manipulation , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[28] Honglak Lee,et al. Attribute2Image: Conditional Image Generation from Visual Attributes , 2015, ECCV.

[29] Ole Winther,et al. Autoencoding beyond pixels using a learned similarity metric , 2015, ICML.

[30] Zhe L. Lin,et al. Semantic Component Decomposition for Face Attribute Manipulation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[31] Shiguang Shan,et al. AttGAN: Facial Attribute Editing by Only Changing What You Want , 2017, IEEE Transactions on Image Processing.

[32] Guillaume Lample,et al. Fader Networks: Manipulating Images by Sliding Attributes , 2017, NIPS.

[33] Sepp Hochreiter,et al. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium , 2017, NIPS.

[34] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[35] Jan Kautz,et al. Unsupervised Image-to-Image Translation Networks , 2017, NIPS.

[36] Xiaogang Wang,et al. Deep Learning Face Attributes in the Wild , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[37] Songhua Xu,et al. Sparsely Grouped Multi-Task Generative Adversarial Networks for Facial Attribute Manipulation , 2018, ACM Multimedia.

[38] Chen Change Loy,et al. Instance-level Facial Attributes Transfer with Geometry-Aware Flow , 2018, AAAI.

[39] Jaakko Lehtinen,et al. Progressive Growing of GANs for Improved Quality, Stability, and Variation , 2017, ICLR.

[40] Dimitris N. Metaxas,et al. StackGAN: Text to Photo-Realistic Image Synthesis with Stacked Generative Adversarial Networks , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[41] Eero P. Simoncelli,et al. Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[42] Shiguang Shan,et al. Generative Adversarial Network with Spatial Attention for Face Attribute Editing , 2018, ECCV.

[43] Wei Liu,et al. Fully-Featured Attribute Transfer , 2019, ArXiv.

[44] Yi Yang,et al. GeneGAN: Learning Object Transfiguration and Object Subspace from Unpaired Data , 2017, British Machine Vision Conference.

[45] Bogdan Raducanu,et al. Invertible Conditional GANs for image editing , 2016, ArXiv.

[46] Baoxin Li,et al. Weakly Supervised Facial Attribute Manipulation via Deep Adversarial Network , 2018, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[47] Xiaogang Wang,et al. StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[48] Alexei A. Efros,et al. Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).