论文信息 - Unsupervised Facial Image Synthesis Using Two-Discriminator Adversarial Autoencoder Network

Unsupervised Facial Image Synthesis Using Two-Discriminator Adversarial Autoencoder Network

Recent years have witnessed the unprecedented success in single image synthesis by the means of convolutional neural networks (CNNs). High-level synthesis of facial image such as expression translation and attribute swap is still a challenging task due to high non-linearity. Previous methods suffer from the limitations that being unable to transfer multiple face attributes simultaneously, or incapability of transferring an attribute to another by a continuously changing way. To address this problem, we propose a two-discriminator adversarial autoencoder network (TAAN). The latent-discriminator is trained to disentangle an input image from its original facial attribute, while the pixel-discriminator is trained to make the output image attach to the target facial attribute. By controlling the attribute values, we can choose which and how much a specific attribute can be perceivable in the generated image. Quantitative and qualitative evaluations are conducted on the celebA and KDEF datasets, and the comparison with the state-of-the-art methods shows the competency of our proposed TAAN.

[1] Jung-Woo Ha,et al. StarGAN: Unified Generative Adversarial Networks for Multi-domain Image-to-Image Translation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[2] Yunhong Wang,et al. Facial Expression Synthesis by U-Net Conditional Generative Adversarial Networks , 2018, ICMR.

[3] Lior Wolf,et al. Unsupervised Creation of Parameterized Avatars , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[4] Navdeep Jaitly,et al. Adversarial Autoencoders , 2015, ArXiv.

[5] Sanja Fidler,et al. Be Your Own Prada: Fashion Synthesis with Structural Coherence , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[6] Dong Guo,et al. Digital face makeup by example , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[7] Baining Guo,et al. Geometry-driven photorealistic facial expression synthesis , 2003, IEEE Transactions on Visualization and Computer Graphics.

[8] Justus Thies,et al. Face2Face: Real-Time Face Capture and Reenactment of RGB Videos , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9] Yuting Zhang,et al. Learning to Disentangle Factors of Variation with Manifold Interaction , 2014, ICML.

[10] Hyunsoo Kim,et al. Learning to Discover Cross-Domain Relations with Generative Adversarial Networks , 2017, ICML.

[11] Gang Hu,et al. Sharp and Real Image Super-Resolution Using Generative Adversarial Network , 2017, ICONIP.

[12] Geoffrey E. Hinton,et al. Generating Facial Expressions with Deep Belief Nets , 2008 .

[13] Xiaogang Wang,et al. Deep Learning Face Attributes in the Wild , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[14] L. Leyman,et al. The Karolinska Directed Emotional Faces: A validation study , 2008 .

[15] Chi-Keung Tang,et al. Example-Based Cosmetic Transfer , 2007 .

[16] Alexei A. Efros,et al. Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[17] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[18] Luc Van Gool,et al. Pose Guided Person Image Generation , 2017, NIPS.

[19] Tomaso A. Poggio,et al. Reanimating Faces in Images and Video , 2003, Comput. Graph. Forum.

[20] Ping Tan,et al. DualGAN: Unsupervised Dual Learning for Image-to-Image Translation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[21] Guillaume Lample,et al. Fader Networks: Manipulating Images by Sliding Attributes , 2017, NIPS.

[22] Zicheng Liu,et al. Expressive expression mapping with ratio images , 2001, SIGGRAPH.

[23] Fei Yang,et al. Expression flow for 3D-aware face component transfer , 2011, SIGGRAPH 2011.

[24] Jan Kautz,et al. Visio-lization: generating novel facial images , 2009, SIGGRAPH 2009.

[25] François Chollet,et al. Xception: Deep Learning with Depthwise Separable Convolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26] Simon Osindero,et al. Conditional Generative Adversarial Nets , 2014, ArXiv.

[27] Tal Hassner,et al. Effective face frontalization in unconstrained images , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[29] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[30] Rama Chellappa,et al. ExprGAN: Facial Expression Editing with Controllable Expression Intensity , 2017, AAAI.