Coupled Attribute Learning for Heterogeneous Face Recognition

Heterogeneous face recognition (HFR) is a challenging problem in face recognition and subject to large textural and spatial structure differences of face images. Different from conventional face recognition in homogeneous environments, there exist many face images taken from different sources (including different sensors or different mechanisms) in reality. In addition, limited training samples of cross-modality pairs make HFR more challenging due to the complex generation procedure of these images. Despite the great progress that has been achieved in recent years, existing works mainly focus on HFR from only cross-modality image matching. However, it is more practical to obtain both facial images and semantic descriptions about facial attributes in real-world situations, in which the semantic description clues are nearly always obtained during the process of image generation. Motivated by human cognitive mechanisms, we naturally utilize the explicit invariant semantic description, i.e., face attributes, to help address the gap among face images of different modalities. Existing facial attributes-related face recognition methods primarily regard attributes as the high-level features used to enhance recognition performance, ignoring the inherent relationship between face attributes and identities. In this article, we propose novel coupled attribute learning for the HFR (CAL-HFR) method without labeling the attributes manually. Deep convolutional networks are employed to directly map face images in heterogeneous scenarios to a compact common space where distances are taken as dissimilarities of pairs. Coupled attribute guided triplet loss (CAGTL) is designed to train an end-to-end HFR network that can effectively eliminate defects of incorrectly estimated attributes. Extensive experiments on multiple heterogeneous scenarios demonstrate that the proposed method achieves superior performance compared with that of state-of-the-art methods. Furthermore, we make publicly available our generated pairwise annotated heterogeneous facial attribute database for evaluation and promoting related research.

[1]  Arun Ross,et al.  Score normalization in multimodal biometric systems , 2005, Pattern Recognit..

[2]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[3]  Ming Shao,et al.  Cross-Modality Feature Learning Through Generic Hierarchical Hyperlingual-Words , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[4]  James Philbin,et al.  FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Zhengming Ding,et al.  Robust Multiview Data Analysis Through Collective Low-Rank Subspace , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[6]  Hyeonjoon Moon,et al.  The FERET evaluation methodology for face-recognition algorithms , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[7]  Chunna Tian,et al.  Face Sketch Synthesis using E-HMM and Selective Ensemble , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[8]  Hanqing Lu,et al.  A nonlinear approach for face sketch synthesis and recognition , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[9]  Anil K. Jain,et al.  Matching Composite Sketches to Face Photos: A Component-Based Approach , 2013, IEEE Transactions on Information Forensics and Security.

[10]  Anil K. Jain,et al.  Heterogeneous Face Recognition Using Kernel Prototype Similarities , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Tieniu Tan,et al.  Transferring deep representation for NIR-VIS heterogeneous face recognition , 2016, 2016 International Conference on Biometrics (ICB).

[12]  Xueming Li,et al.  ForgetMeNot: Memory-Aware Forensic Facial Sketch Matching , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Shiguang Shan,et al.  Heterogeneous Face Attribute Estimation: A Deep Multi-Task Learning Approach , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Xinbo Gao,et al.  A Deep Collaborative Framework for Face Photo–Sketch Synthesis , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[15]  Qingming Huang,et al.  Spatial Pyramid-Enhanced NetVLAD With Weighted Triplet Loss for Place Recognition , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[16]  Xuelong Li,et al.  Cascade Learning by Optimally Partitioning , 2015, IEEE Transactions on Cybernetics.

[17]  Xiaogang Wang,et al.  Deep Learning Face Attributes in the Wild , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[18]  Xuelong Li,et al.  Triply Supervised Decoder Networks for Joint Detection and Segmentation , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Xueming Li,et al.  Cross-Modal Face Matching: Beyond Viewed Sketches , 2014, ACCV.

[20]  Richa Singh,et al.  Recognizing composite sketches with digital face images via SSD dictionary , 2014, IEEE International Joint Conference on Biometrics.

[21]  Shiguang Shan,et al.  Two Birds, One Stone: Jointly Learning Binary Code for Large-Scale Face Image Retrieval and Attributes Prediction , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[22]  Yinghuan Shi,et al.  Heterogeneous Face Recognition by Margin-Based Cross-Modality Metric Learning , 2018, IEEE Transactions on Cybernetics.

[23]  Richa Singh,et al.  Composite sketch recognition using saliency and attribute feedback , 2017, Inf. Fusion.

[24]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[25]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  Stan Z. Li,et al.  Coupled Spectral Regression for matching heterogeneous faces , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[27]  Tieniu Tan,et al.  Wasserstein CNN: Learning Invariant Features for NIR-VIS Face Recognition , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Jie Li,et al.  Deep Attribute Guided Representation for Heterogeneous Face Recognition , 2018, IJCAI.

[29]  Kiyoharu Aizawa,et al.  Category-Based Deep CCA for Fine-Grained Venue Discovery From Multimodal Data , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[30]  Yinghuan Shi,et al.  Cross-Modal Metric Learning for AUC Optimization , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[31]  Dahua Lin,et al.  Inter-modality Face Recognition , 2006, ECCV.

[32]  Aleix M. Martinez,et al.  The AR face database , 1998 .

[33]  Xinbo Gao,et al.  Sparse Graphical Representation based Discriminant Analysis for Heterogeneous Face Recognition , 2016, Signal Process..

[34]  Yongxin Yang,et al.  Attribute-Enhanced Face Recognition with Neural Tensor Fusion Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[35]  Jian Wang,et al.  JCS-Net: Joint Classification and Super-Resolution Network for Small-Scale Pedestrian Detection in Surveillance Images , 2019, IEEE Transactions on Information Forensics and Security.

[36]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[37]  Lei Chen,et al.  Deep Cross-Modal Correlation Learning for Audio and Lyrics in Music Retrieval , 2017, ACM Trans. Multim. Comput. Commun. Appl..

[38]  Jakob Verbeek,et al.  Heterogeneous Face Recognition with CNNs , 2016, ECCV Workshops.

[39]  Yoram Singer,et al.  Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[40]  Shiguang Shan,et al.  Multi-View Discriminant Analysis , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  Amit R.Sharma,et al.  Face Photo-Sketch Synthesis and Recognition , 2012 .

[42]  Anil K. Jain,et al.  Suspect identification based on descriptive facial attributes , 2014, IEEE International Joint Conference on Biometrics.

[43]  Zhou Yu,et al.  Beyond Bilinear: Generalized Multimodal Factorized High-Order Pooling for Visual Question Answering , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[44]  Kilian Q. Weinberger,et al.  Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.

[45]  Xinbo Gao,et al.  Composite components-based face sketch recognition , 2018, Neurocomputing.

[46]  Reuben A. Farrugia,et al.  Matching Software-Generated Sketches to Face Photographs With a Very Deep CNN, Morphed Faces, and Transfer Learning , 2018, IEEE Transactions on Information Forensics and Security.

[47]  Shree K. Nayar,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence Describable Visual Attributes for Face Verification and Image Search , 2022 .

[48]  Tieniu Tan,et al.  Learning Invariant Deep Representation for NIR-VIS Face Recognition , 2017, AAAI.

[49]  Jun Yu,et al.  Hierarchical Deep Click Feature Prediction for Fine-Grained Image Recognition , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[50]  Tal Hassner,et al.  Effective Unconstrained Face Recognition by Combining Multiple Descriptors and Learned Background Statistics , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[51]  Yunsong Li,et al.  Deep Latent Low-Rank Representation for Face Sketch Synthesis , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[52]  Feiping Nie,et al.  Simultaneously Learning Neighborship and Projection Matrix for Supervised Dimensionality Reduction , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[53]  Marwan Mattar,et al.  Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments , 2008 .

[54]  Xuelong Li,et al.  Face Sketch Synthesis by Multidomain Adversarial Learning , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[55]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[56]  Xinbo Gao,et al.  Face Sketch Synthesis in the Wild via Deep Patch Representation-Based Probabilistic Graphical Model , 2020, IEEE Transactions on Information Forensics and Security.

[57]  Andrew Zisserman,et al.  Deep Face Recognition , 2015, BMVC.

[58]  Hao Zhou,et al.  Markov Weight Fields for face sketch synthesis , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[59]  Xiaogang Wang,et al.  Face sketch synthesis and recognition , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[60]  Jiri Matas,et al.  XM2VTSDB: The Extended M2VTS Database , 1999 .

[61]  Richa Singh,et al.  Composite sketch recognition via deep network - a transfer learning approach , 2015, 2015 International Conference on Biometrics (ICB).

[62]  Xinbo Gao,et al.  Graphical Representation for Heterogeneous Face Recognition , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[63]  Shengcai Liao,et al.  Heterogeneous Face Recognition from Local Structures of Normalized Appearance , 2009, ICB.

[64]  Shengcai Liao,et al.  The CASIA NIR-VIS 2.0 Face Database , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[65]  Wen Gao,et al.  The CAS-PEAL Large-Scale Chinese Face Database and Baseline Evaluations , 2008, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[66]  Xinbo Gao,et al.  Data Augmentation-Based Joint Learning for Heterogeneous Face Recognition , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[67]  Harry Wechsler,et al.  The FERET database and evaluation procedure for face-recognition algorithms , 1998, Image Vis. Comput..

[68]  Jiwen Lu,et al.  Simultaneous Local Binary Feature Learning and Encoding for Homogeneous and Heterogeneous Face Recognition , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[69]  Xuelong Li,et al.  Learning Sampling Distributions for Efficient Object Detection , 2015, IEEE Transactions on Cybernetics.

[70]  Anil K. Jain,et al.  The FaceSketchID System: Matching Facial Composites to Mugshots , 2014, IEEE Transactions on Information Forensics and Security.

[71]  Xuelong Li,et al.  Multiple Representations-Based Face Sketch–Photo Synthesis , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[72]  Jie Li,et al.  DLFace: Deep local descriptor for cross-modality face recognition , 2019, Pattern Recognit..

[73]  Xuelong Li,et al.  Heterogeneous Face Recognition: A Common Encoding Feature Discriminant Approach , 2017, IEEE Transactions on Image Processing.