SADG: Self-Aligned Dual NIR-VIS Generation for Heterogeneous Face Recognition

Heterogeneous face recognition (HFR) has aroused significant interest in recent years, with some challenging tasks such as misalignment problems and limited HFR data. Misalignment occurs among different modalities’ images mainly because of misaligned semantics. Although recent methods have attempted to settle the low-shot problem, they suffer from the misalignment problem between paired near infrared (NIR) and visible (VIS) images. Misalignment can bring performance degradation to most image-to-image translation networks. In this work, we propose a self-aligned dual generation (SADG) architecture for generating semantics-aligned pairwise NIR-VIS images with the same identity, but without the additional guidance of external information learning. Specifically, we propose a self-aligned generator to align the data distributions between two modalities. Then, we present a multiscale patch discriminator to get high quality images. Furthermore, we raise the mean landmark distance (MLD) to test the alignment performance between NIR and VIS images with the same identity. Extensive experiments and an ablation study of SADG on three public datasets show significant alignment performance and recognition results. Specifically, the Rank1 accuracy achieved was close to 99.9% for the CASIA NIR-VIS 2.0, Oulu-CASIA NIR-VIS and BUAA VIS-NIR datasets, respectively.

[1]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[2]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[3]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[4]  Ning Cao,et al.  A New Face Recognition Method for Intelligent Security , 2020 .

[5]  Qiuqi Ruan,et al.  Multi-task clustering ELM for VIS-NIR cross-modal feature learning , 2017, Multidimens. Syst. Signal Process..

[6]  Tieniu Tan,et al.  Wasserstein CNN: Learning Invariant Features for NIR-VIS Face Recognition , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Shengcai Liao,et al.  Coupled Discriminant Analysis for Heterogeneous Face Recognition , 2012, IEEE Transactions on Information Forensics and Security.

[8]  Shiguang Shan,et al.  Multi-View Discriminant Analysis , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Shiyu Chen,et al.  Multi-Pose Face Recognition Based on Deep Learning in Unconstrained Scene , 2020, Applied Sciences.

[10]  Tieniu Tan,et al.  A Light CNN for Deep Face Representation With Noisy Labels , 2015, IEEE Transactions on Information Forensics and Security.

[11]  Dong Yi,et al.  Face Matching Between Near Infrared and Visible Light Images , 2007, ICB.

[12]  Yinghuan Shi,et al.  Heterogeneous Face Recognition by Margin-Based Cross-Modality Metric Learning , 2018, IEEE Transactions on Cybernetics.

[13]  M. Saquib Sarfraz,et al.  Deep Perceptual Mapping for Cross-Modal Face Recognition , 2016, International Journal of Computer Vision.

[14]  Dahua Lin,et al.  Inter-modality Face Recognition , 2006, ECCV.

[15]  Vishal M. Patel,et al.  Synthesis of High-Quality Visible Faces from Polarimetric Thermal Faces using Generative Adversarial Networks , 2018, International Journal of Computer Vision.