Unpaired Domain Transfer for Data Augment in Face Recognition

Face recognition is one of the hottest issues in the field of computer vision and pattern recognition. Deep learning-based recognition models already have more exceptional recognition ability than the human being on open datasets, but still cannot fully undertake the identity recognition task in real scenarios without human assistance. In this paper, we mainly analyze two obstacles, i.e., domain gap and training data shortage. We propose the unpaired Domain Transfer Generative Adversarial Network (DT-GAN) to relieve these two obstacles. We improve the GAN baseline to bridge the domain gap among datasets by generating images conforming to the style of a target domain by learning the mapping between the source domain and target domain. The generator could synthesize face with an arbitrary viewpoint at the same time. The model is trained with a combination of style transfer loss, identity loss, and pose loss, which ensures the successive domain transfer and data augment. We conduct experiments to testify the effectiveness and reasonability of DT-GAN. Experimental results demonstrate the recognition performance is dramatically boosted after domain transfer and data augment.

[1]  Chuan Li,et al.  Precomputed Real-Time Texture Synthesis with Markovian Generative Adversarial Networks , 2016, ECCV.

[2]  Luc Van Gool,et al.  Disentangled Person Image Generation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[3]  James Philbin,et al.  FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  David Salesin,et al.  Image Analogies , 2001, SIGGRAPH.

[5]  Abhinav Gupta,et al.  Generative Image Modeling Using Style and Structure Adversarial Networks , 2016, ECCV.

[6]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[7]  Tie-Yan Liu,et al.  Dual Learning for Machine Translation , 2016, NIPS.

[8]  Longhui Wei,et al.  Person Transfer GAN to Bridge Domain Gap for Person Re-identification , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[9]  Alexei A. Efros,et al.  Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Xiaogang Wang,et al.  FaceID-GAN: Learning a Symmetry Three-Player GAN for Identity-Preserving Face Synthesis , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[11]  Hyunsoo Kim,et al.  Learning to Discover Cross-Domain Relations with Generative Adversarial Networks , 2017, ICML.

[12]  Alexei A. Efros,et al.  Generative Visual Manipulation on the Natural Image Manifold , 2016, ECCV.

[13]  Simon Osindero,et al.  Conditional Generative Adversarial Nets , 2014, ArXiv.

[14]  Ming-Yu Liu,et al.  Coupled Generative Adversarial Networks , 2016, NIPS.

[15]  Honglak Lee,et al.  Attribute2Image: Conditional Image Generation from Visual Attributes , 2015, ECCV.

[16]  Namil Kim,et al.  Pixel-Level Domain Transfer , 2016, ECCV.

[17]  Soumith Chintala,et al.  Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[18]  Yu Qiao,et al.  A Discriminative Feature Learning Approach for Deep Face Recognition , 2016, ECCV.

[19]  拓海 杉山,et al.  “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告 , 2017 .

[20]  Shengcai Liao,et al.  Learning Face Representation from Scratch , 2014, ArXiv.

[21]  Leon A. Gatys,et al.  Image Style Transfer Using Convolutional Neural Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Bernt Schiele,et al.  Generative Adversarial Text to Image Synthesis , 2016, ICML.

[23]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[24]  Shuicheng Yan,et al.  3D-Aided Dual-Agent GANs for Unconstrained Face Recognition , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Christian Ledig,et al.  Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[27]  Ping Tan,et al.  DualGAN: Unsupervised Dual Learning for Image-to-Image Translation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).