Benchmarking deep network architectures for ethnicity recognition using a new large face dataset

Although in recent years we have witnessed an explosion of the scientific research in the recognition of facial soft biometrics such as gender, age and expression with deep neural networks, the recognition of ethnicity has not received the same attention from the scientific community. The growth of this field is hindered by two related factors: on the one hand, the absence of a dataset sufficiently large and representative does not allow an effective training of convolutional neural networks for the recognition of ethnicity; on the other hand, the collection of new ethnicity datasets is far from simple and must be carried out manually by humans trained to recognize the basic ethnicity groups using the somatic facial features. To fill this gap in the facial soft biometrics analysis, we propose the VGGFace2 Mivia Ethnicity Recognition (VMER) dataset, composed by more than 3,000,000 face images annotated with 4 ethnicity categories, namely African American, East Asian, Caucasian Latin and Asian Indian. The final annotations are obtained with a protocol which requires the opinion of three people belonging to different ethnicities, in order to avoid the bias introduced by the well-known other race effect. In addition, we carry out a comprehensive performance analysis of popular deep network architectures, namely VGG-16, VGG-Face, ResNet-50 and MobileNet v2. Finally, we perform a cross-dataset evaluation to demonstrate that the deep network architectures trained with VMER generalize on different test sets better than the same models trained on the largest ethnicity dataset available so far. The ethnicity labels of the VMER dataset and the code used for the experiments are available upon request at https://mivia.unisa.it .

[1]  George Azzopardi,et al.  Fusion of Domain-Specific and Trainable Features for Gender Recognition From Face Images , 2018, IEEE Access.

[2]  C. Thomaz,et al.  A new ranking method for principal components analysis and its application to face image analysis , 2010, Image Vis. Comput..

[3]  Shan Li,et al.  Deep Facial Expression Recognition: A Survey , 2018, IEEE Transactions on Affective Computing.

[4]  Matti Pietikäinen,et al.  IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, TPAMI-2008-09-0620 1 WLD: A Robust Local Image Descriptor , 2022 .

[5]  Alberto Del Bimbo,et al.  Real-time demographic profiling from face imagery with Fisher vectors , 2018, Machine Vision and Applications.

[6]  Harry Wechsler,et al.  The FERET database and evaluation procedure for face-recognition algorithms , 1998, Image Vis. Comput..

[7]  Xiaogang Wang,et al.  Deep Learning Face Attributes in the Wild , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[8]  Zeng-Guang Hou,et al.  Spiking neural networks based cortex like mechanism: A case study for facial expression recognition , 2011, The 2011 International Joint Conference on Neural Networks.

[9]  Bo Chen,et al.  MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[10]  Mohammad Mahdi Dehshibi,et al.  Iranian Face Database with age, pose and expression , 2007, 2007 International Conference on Machine Vision.

[11]  Mario Vento,et al.  Age from Faces in the Deep Learning Revolution , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Bolei Zhou,et al.  Learning Deep Features for Discriminative Localization , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Wen Gao,et al.  The CAS-PEAL Large-Scale Chinese Face Database and Baseline Evaluations , 2008, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[14]  Mario Vento,et al.  A system for gender recognition on mobile robots , 2019, APPIS.

[15]  Marios Savvides,et al.  A robust approach to facial ethnicity classification on large scale face databases , 2012, 2012 IEEE Fifth International Conference on Biometrics: Theory, Applications and Systems (BTAS).

[16]  Haibo He,et al.  Learning Race from Face: A Survey , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Kimmo Kärkkäinen,et al.  FairFace: Face Attribute Dataset for Balanced Race, Gender, and Age , 2019, ArXiv.

[18]  S. Md. Mansoor Roomi,et al.  Race Classification Based on Facial Features , 2011, 2011 Third National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics.

[19]  Huchuan Lu,et al.  A New Automatic Recognition System of Gender, Age and Ethnicity , 2006, 2006 6th World Congress on Intelligent Control and Automation.

[20]  Omkar M. Parkhi,et al.  VGGFace2: A Dataset for Recognising Faces across Pose and Age , 2017, 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018).

[21]  Rama Chellappa,et al.  HyperFace: A Deep Multi-Task Learning Framework for Face Detection, Landmark Localization, Pose Estimation, and Gender Recognition , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Naseer Al-Jawad,et al.  Fusing Local Binary Patterns with Wavelet Features for Ethnicity Identification , 2013 .

[23]  Karl Ricanek,et al.  MORPH: a longitudinal image database of normal adult age-progression , 2006, 7th International Conference on Automatic Face and Gesture Recognition (FGR06).

[24]  Vanessa Lobue,et al.  The Child Affective Facial Expression (CAFE) set: validity and reliability from untrained adults , 2014, Front. Psychol..

[25]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[26]  Stan Z. Li,et al.  Age Estimation by Multi-scale Convolutional Network , 2014, ACCV.

[27]  Hossam M. Zawbaa,et al.  Hajj and Umrah Event Recognition Datasets , 2012, ArXiv.

[28]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[29]  Andrew Zisserman,et al.  Deep Face Recognition , 2015, BMVC.

[30]  Shree K. Nayar,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence Describable Visual Attributes for Face Verification and Image Search , 2022 .

[31]  Bo Wu,et al.  Facial image retrieval based on demographic classification , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[32]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Guodong Guo,et al.  A framework for joint estimation of age, gender and ethnicity on a large database , 2014, Image Vis. Comput..

[34]  Dacheng Tao,et al.  A Comprehensive Survey on Pose-Invariant Face Recognition , 2015, ACM Trans. Intell. Syst. Technol..

[35]  Muhammad Ghulam,et al.  Race Classification from Face Images using Local Descriptors , 2012, Int. J. Artif. Intell. Tools.

[36]  Hujun Yin,et al.  Facial expression analysis and expression-invariant face recognition by manifold-based synthesis , 2018, Machine Vision and Applications.

[37]  Fadi Dornaika,et al.  Age estimation in facial images through transfer learning , 2018, Machine Vision and Applications.

[38]  Yihong Gong,et al.  Training Hierarchical Feed-Forward Visual Recognition Models Using Transfer Learning from Pseudo-Tasks , 2008, ECCV.

[39]  Naeem Ul Islam,et al.  Learned Features are Better for Ethnicity Classification , 2017, ArXiv.

[40]  Arun Ross,et al.  What Else Does Your Biometric Data Reveal? A Survey on Soft Biometrics , 2016, IEEE Transactions on Information Forensics and Security.

[41]  Michael J. Lyons,et al.  Automatic Classification of Single Facial Images , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[42]  Alessia Saggese,et al.  An effective real time gender recognition system for smart cameras , 2020, J. Ambient Intell. Humaniz. Comput..

[43]  Genny Tortora,et al.  EGA — Ethnicity, gender and age, a pre-annotated face database , 2012, 2012 IEEE Workshop on Biometric Measurements and Systems for Security and Medical Applications (BIOMS) Proceedings.

[44]  Masato Kawade,et al.  Ethnicity estimation with facial images , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[45]  Shiguang Shan,et al.  Heterogeneous Face Attribute Estimation: A Deep Multi-Task Learning Approach , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46]  Yang Song,et al.  Age Progression/Regression by Conditional Adversarial Autoencoder , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[47]  Sadiye Guler,et al.  Automated person categorization for video surveillance using soft biometrics , 2010, Defense + Commercial Sensing.