论文信息 - Asymmetric Joint GANs for Normalizing Face Illumination From a Single Image

Asymmetric Joint GANs for Normalizing Face Illumination From a Single Image

Illumination normalization for face recognition is very important when a face is captured under harsh lighting conditions. Instead of designing hand-crafted features, in this paper we formulate face illumination normalization as an image-to-image translation task. A great challenge of face normalization is that human facial structures are particularly sensitive to image structure distortion, which frequently occurs in traditional image-to-image translation tasks. Unfortunately, sometimes even slight facial structure distortions may prohibit human eyes and machine face recognition methods from identifying face identities. To address this issue, a novel GAN-based network architecture called the asymmetric joint generative adversarial network (AJGAN) is developed to normalize face images under arbitrary illumination conditions, without known face geometry and albedo information. In addition, an illumination normalization GAN <inline-formula><tex-math notation="LaTeX">$G_1$</tex-math></inline-formula> and an asymmetric relighting GAN <inline-formula><tex-math notation="LaTeX">$G_2$</tex-math></inline-formula> that maps a frontal-illuminated image to images with various lighting conditions are incorporated in AJGAN to maintain personalized facial structures. To avoid image blurring caused by the under-constrained relighting mapping, we introduce a scheme of one-hot lighting labels into <inline-formula><tex-math notation="LaTeX">$G_2$</tex-math></inline-formula> and enforce label classification loss. Furthermore, the number of training images starting from a very limited number of labels is dynamically extended by the combination of different lighting labels. Qualitative and quantitative experiments on three databases validate that AJGAN significantly outperforms the state-of-the-art methods.

Yanli Liu | Guanyu Xing | Hongyu Yang | Xianjun Han

[1] Yoshua Bengio,et al. Plug & Play Generative Networks: Conditional Iterative Generation of Images in Latent Space , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2] Wen Gao,et al. Face recognition under generic illumination based on harmonic relighting , 2005, Int. J. Pattern Recognit. Artif. Intell..

[3] Jian-Huang Lai,et al. Normalization of Face Illumination Based on Large-and Small-Scale Features , 2011, IEEE Transactions on Image Processing.

[4] Mahadev Satyanarayanan,et al. OpenFace: A general-purpose face recognition library with mobile applications , 2016 .

[5] 拓海杉山,et al. “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告 , 2017 .

[6] Alberto Del Bimbo,et al. A Dictionary Learning-Based 3D Morphable Shape Model , 2017, IEEE Transactions on Multimedia.

[7] David J. Kriegman,et al. From Few to Many: Illumination Cone Models for Face Recognition under Variable Lighting and Pose , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[8] Takeo Kanade,et al. Multi-PIE , 2008, 2008 8th IEEE International Conference on Automatic Face & Gesture Recognition.

[9] Qinping Zhao,et al. Face illumination transfer through edge-preserving filters , 2011, CVPR 2011.

[10] Xiaoyang Tan,et al. Enhanced Local Texture Feature Sets for Face Recognition Under Difficult Lighting Conditions , 2007, IEEE Transactions on Image Processing.

[11] Jian-Huang Lai,et al. Face Image Illumination Processing Based on Generative Adversarial Nets , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[12] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .

[13] Ronen Basri,et al. Lambertian reflectance and linear subspaces , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[14] Alexei A. Efros,et al. Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15] Takahiro Okabe,et al. Illumination normalization of face images with cast shadows , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[16] Paul E. Debevec,et al. Acquiring the reflectance field of a human face , 2000, SIGGRAPH.

[17] Andrew Zisserman,et al. Deep Face Recognition , 2015, BMVC.

[18] Fei Chen,et al. A Natural Visible and Infrared Facial Expression Database for Expression Recognition and Emotion Inference , 2010, IEEE Transactions on Multimedia.

[19] Lei Zhang,et al. Face synthesis and recognition from a single image under arbitrary unknown lighting using a spherical harmonic basis morphable model , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[20] Peiran REN,et al. Image based relighting using neural networks , 2015, ACM Trans. Graph..

[21] Haitao Wang,et al. Generalized quotient image , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[22] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[23] Yoshua Bengio,et al. Generative Adversarial Networks , 2014, ArXiv.

[24] Gang Hua,et al. Face Relighting from a Single Image under Arbitrary Unknown Lighting Conditions , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25] Chu-Song Chen,et al. Face Recognition and Retrieval Using Cross-Age Reference Coding With Cross-Age Celebrity Dataset , 2015, IEEE Transactions on Multimedia.

[26] Dacheng Tao,et al. Robust Face Recognition via Multimodal Deep Face Representation , 2015, IEEE Transactions on Multimedia.

[27] Changhui Hu,et al. IL-GAN: Illumination-invariant representation learning for single sample face recognition , 2019, J. Vis. Commun. Image Represent..

[28] Soo-Chang Pei,et al. Color Enhancement With Adaptive Illumination Estimation for Low-Backlighted Displays , 2017, IEEE Transactions on Multimedia.

[29] David J. Kriegman,et al. What Is the Set of Images of an Object Under All Possible Illumination Conditions? , 1998, International Journal of Computer Vision.

[30] Ying Li,et al. Robust Symbolic Dual-View Facial Expression Recognition With Skin Wrinkles: Local Versus Global Approach , 2010, IEEE Transactions on Multimedia.

[31] Alexei A. Efros,et al. Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[32] Pieter Abbeel,et al. InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets , 2016, NIPS.

[33] Simon Osindero,et al. Conditional Generative Adversarial Nets , 2014, ArXiv.

[34] Shimon Ullman,et al. Face Recognition: The Problem of Compensating for Changes in Illumination Direction , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[35] Wen Gao,et al. The CAS-PEAL Large-Scale Chinese Face Database and Baseline Evaluations , 2008, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[36] Jung-Woo Ha,et al. StarGAN: Unified Generative Adversarial Networks for Multi-domain Image-to-Image Translation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[37] Hai-Miao Hu,et al. Naturalness Preserved Nonuniform Illumination Estimation for Image Enhancement Based on Retinex , 2017, IEEE Transactions on Multimedia.

[38] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[39] Raymond Y. K. Lau,et al. Least Squares Generative Adversarial Networks , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[40] Jian Shi,et al. Efficient intrinsic image decomposition for RGBD images , 2015, VRST.