Deep and Shallow Covariance Feature Quantization for 3D Facial Expression Recognition

Facial expressions recognition (FER) of 3D face scans has received a significant amount of attention in recent years. Most of the facial expression recognition methods have been proposed using mainly 2D images. These methods suffer from several issues like illumination changes and pose variations. Moreover, 2D mapping from 3D images may lack some geometric and topological characteristics of the face. Hence, to overcome this problem, a multi-modal 2D + 3D feature-based method is proposed. We extract shallow features from the 3D images, and deep features using Convolutional Neural Networks (CNN) from the transformed 2D images. Combining these features into a compact representation uses covariance matrices as descriptors for both features instead of single-handedly descriptors. A covariance matrix learning is used as a manifold layer to reduce the deep covariance matrices size and enhance their discrimination power while preserving their manifold structure. We then use the Bag-of-Features (BoF) paradigm to quantize the covariance matrices after flattening. Accordingly, we obtained two codebooks using shallow and deep features. The global codebook is then used to feed an SVM classifier. High classification performances have been achieved on the BU-3DFE and Bosphorus datasets compared to the state-of-the-art methods.

[1]  Jun Wang,et al.  A 3D facial expression database for facial behavior research , 2006, 7th International Conference on Automatic Face and Gesture Recognition (FGR06).

[2]  Shaogang Gong,et al.  Facial expression recognition based on Local Binary Patterns: A comprehensive study , 2009, Image Vis. Comput..

[3]  Ioannis A. Kakadiaris,et al.  3D Face Discriminant Analysis Using Gauss-Markov Posterior Marginals , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Fernando Fernández Martínez,et al.  Towards a robust affect recognition: Automatic facial expression recognition in 3D faces , 2015, Expert Syst. Appl..

[5]  Liming Chen,et al.  Muscular Movement Model-Based Automatic 3D/4D Facial Expression Recognition , 2015, IEEE Transactions on Multimedia.

[6]  Luc Van Gool,et al.  Building Deep Networks on Grassmann Manifolds , 2016, AAAI.

[7]  Guoying Zhao,et al.  3D Facial Expression Recognition Based on Multi-View and Prior Knowledge Fusion , 2019, 2019 IEEE 21st International Workshop on Multimedia Signal Processing (MMSP).

[8]  Liming Chen,et al.  Accurate Facial Parts Localization and Deep Learning for 3D Facial Expression Recognition , 2018, 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018).

[9]  Forrest N. Iandola,et al.  SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <1MB model size , 2016, ArXiv.

[10]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[11]  Lei Wang,et al.  DeepKSPD: Learning Kernel-matrix-based SPD Representation for Fine-grained Image Recognition , 2017, ECCV.

[12]  Abd El Rahman Shabayek,et al.  Facial Expression Recognition via Joint Deep Learning of RGB-Depth Map Latent Representations , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[13]  Shan Li,et al.  Deep Facial Expression Recognition: A Survey , 2018, IEEE Transactions on Affective Computing.

[14]  Subhransu Maji,et al.  Multi-view Convolutional Neural Networks for 3D Shape Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[15]  Luc Van Gool,et al.  Covariance Pooling for Facial Expression Recognition , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[16]  Sang-Heon Lee,et al.  Facial Expression Recognition Using Extended Local Binary Patterns of 3D Curvature , 2013, MUE.

[17]  David Declercq,et al.  3D facial expression recognition using kernel methods on Riemannian manifold , 2017, Eng. Appl. Artif. Intell..

[18]  Hongdong Li,et al.  Kernel Methods on the Riemannian Manifold of Symmetric Positive Definite Matrices , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Ingmar Posner,et al.  Voting for Voting in Online Point Cloud Object Detection , 2015, Robotics: Science and Systems.

[20]  Liming Chen,et al.  Author manuscript, published in "Workshop 3D Face Biometrics, IEEE Automatic Facial and Gesture Recognition, Shanghai: China (2013)" Fully Automatic 3D Facial Expression Recognition using Differential Mean Curvature Maps and Histograms of Oriented Gradien , 2013 .

[21]  José Marques Soares,et al.  Systematic review of 3D facial expression recognition methods , 2020, Pattern Recognit..

[22]  Victor O. K. Li,et al.  Multi-Region Ensemble Convolutional Neural Network for Facial Expression Recognition , 2018, ICANN.

[23]  K. M. Bhurchandi,et al.  3-D face recognition: features, databases, algorithms and challenges , 2015, Artificial Intelligence Review.

[24]  Trac D. Tran,et al.  2D+3D Facial Expression Recognition via Discriminative Dynamic Range Enhancement and Multi-Scale Learning , 2020, ArXiv.

[25]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[26]  Leonidas J. Guibas,et al.  PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Jie Shao,et al.  Three convolutional neural network models for facial expression recognition in the wild , 2019, Neurocomputing.

[28]  Xuan-Phung Huynh,et al.  Convolutional Neural Network Models for Facial Expression Recognition Using BU-3DFE Database , 2016 .

[29]  Edilson de Aguiar,et al.  Facial expression recognition with Convolutional Neural Networks: Coping with few data and the training sample order , 2017, Pattern Recognit..

[30]  Liming Chen,et al.  Unsupervised Domain Adaptation with Regularized Optimal Transport for Multimodal 2D+3D Facial Expression Recognition , 2018, 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018).

[31]  Victoria Interrante,et al.  A novel cubic-order algorithm for approximating principal direction vectors , 2004, TOGS.

[32]  P. Ekman,et al.  Constants across cultures in the face and emotion. , 1971, Journal of personality and social psychology.

[33]  Jian Sun,et al.  Multimodal 2D+3D Facial Expression Recognition With Deep Fusion Convolutional Neural Network , 2017, IEEE Transactions on Multimedia.

[34]  Stefan Winkler,et al.  Deep Learning for Emotion Recognition on Small Datasets using Transfer Learning , 2015, ICMI.

[35]  Luc Van Gool,et al.  Deep Learning on Lie Groups for Skeleton-Based Action Recognition , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[36]  Bruce A. Draper,et al.  Introduction to the Bag of Features Paradigm for Image Classification and Retrieval , 2011, ArXiv.

[37]  Alberto Del Bimbo,et al.  A Set of Selected SIFT Features for 3D Facial Expression Recognition , 2010, 2010 20th International Conference on Pattern Recognition.

[38]  Abd El Rahman Shabayek,et al.  Face-GCN: A Graph Convolutional Network for 3D Dynamic Face Identification/Recognition , 2021, ArXiv.

[39]  Qiuqi Ruan,et al.  FERLrTc: 2D+3D facial expression recognition via low-rank tensor completion , 2019, Signal Process..

[40]  Andrew Zisserman,et al.  Deep Face Recognition , 2015, BMVC.

[41]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[42]  Ioannis A. Kakadiaris,et al.  3D facial expression recognition: A perspective on promises and challenges , 2011, Face and Gesture 2011.

[43]  Arman Savran,et al.  Bosphorus Database for 3D Face Analysis , 2008, BIOID.

[44]  Hela Mahersia,et al.  Using multiple steerable filters and Bayesian regularization for facial expression recognition , 2015, Eng. Appl. Artif. Intell..

[45]  Jianxiong Xiao,et al.  3D ShapeNets: A deep representation for volumetric shapes , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[46]  Gueesang Lee,et al.  A novel 2D and 3D multimodal approach for in-the-wild facial expression recognition , 2019, Image Vis. Comput..

[47]  S. Selva Nidhyananthan,et al.  3D Facial Expression Recognition Using Multi-channel Deep Learning Framework , 2020, Circuits Syst. Signal Process..

[48]  Hongdong Li,et al.  Expanding the Family of Grassmannian Kernels: An Embedding Perspective , 2014, ECCV.

[49]  Zhihong Zeng,et al.  A Survey of Affect Recognition Methods: Audio, Visual, and Spontaneous Expressions , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[50]  Leonidas J. Guibas,et al.  FPNN: Field Probing Neural Networks for 3D Data , 2016, NIPS.

[51]  Yingdong Ma,et al.  Multi-level spatial and semantic enhancement network for expression recognition , 2021, Applied Intelligence.

[52]  Yiding Wang,et al.  Learning Encoded Facial Curvature Information for 3D Facial Emotion Recognition , 2013, 2013 Seventh International Conference on Image and Graphics.

[53]  Luc Van Gool,et al.  A Riemannian Network for SPD Matrix Learning , 2016, AAAI.

[54]  Federico Sukno,et al.  Local Shape Spectrum Analysis for 3D Facial Expression Recognition , 2017, 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017).

[55]  Ondrej Krejcar,et al.  Complement component face space for 3D face recognition from range images , 2020, Appl. Intell..