Facial Expression Recognition Robust to Occlusion and to Intra-Similarity Problem Using Relevant Subsampling

This paper proposes facial expression recognition (FER) with the wild data set. In particular, this paper chiefly deals with two issues, occlusion and intra-similarity problems. The attention mechanism enables one to use the most relevant areas of facial images for specific expressions, and the triplet loss function solves the intra-similarity problem that sometimes fails to aggregate the same expression from different faces and vice versa. The proposed approach for the FER is robust to occlusion, and it uses a spatial transformer network (STN) with an attention mechanism to utilize specific facial region that dominantly contributes (or that is the most relevant) to particular facial expressions, e.g., anger, contempt, disgust, fear, joy, sadness, and surprise. In addition, the STN model is connected to the triplet loss function to improve the recognition rate which outperforms the existing approaches that employ cross-entropy or other approaches using only deep neural networks or classical methods. The triplet loss module alleviates limitations of the intra-similarity problem, leading to further improvement of the classification. Experimental results are provided to substantiate the proposed approach for FER, and the result outperforms the recognition rate in more practical cases, e.g., occlusion. The quantitative result provides FER results with more than 2.09% higher accuracy compared to the existing FER results in CK+ data sets and 0.48% higher than the accuracy of the results with the modified ResNet model in the FER2013 data set.

[1]  A. Yazdani,et al.  Diagnosis of COVID-19 Disease in Chest CT-Scan Images Based on Combination of Low-Level Texture Analysis and MobileNetV2 Features , 2022, Computational intelligence and neuroscience.

[2]  L. V. Savchenko,et al.  Classifying Emotions and Engagement in Online Learning Based on a Single Facial Expression Recognition Neural Network , 2022, IEEE Transactions on Affective Computing.

[3]  A. Karpov,et al.  In search of a robust facial expressions recognition model: A large-scale visual cross-corpus study , 2022, Neurocomputing.

[4]  P. M. Arunkumar,et al.  Deep Learning for Forgery Face Detection Using Fuzzy Fisher Capsule Dual Graph , 2022, Inf. Technol. Control..

[5]  Valerio Basile,et al.  Local Multi-Head Channel Self-Attention for Facial Expression Recognition , 2021, Inf..

[6]  David Griol,et al.  Multimodal Emotion Recognition on RAVDESS Dataset Using Transfer Learning , 2021, Sensors.

[7]  Zhihui Wang,et al.  Facial Expression Recognition with Attention Mechanism , 2021, 2021 14th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI).

[8]  Fernando Fernández-Martínez,et al.  Guided Spatial Transformers for Facial Expression Recognition , 2021, Applied Sciences.

[9]  Kidiyo Kpalma,et al.  Learning Vision Transformer with Squeeze and Excitation for Facial Expression Recognition , 2021, ArXiv.

[10]  P. Maragos,et al.  Exploiting Emotional Dependencies with Graph Convolutional Networks for Facial Expression Recognition , 2021, 2021 16th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2021).

[11]  G. Montazer,et al.  Using Self-Supervised Auxiliary Tasks to Improve Fine-Grained Facial Representation , 2021, SSRN Electronic Journal.

[12]  Hui Yu,et al.  A review on the attention mechanism of deep learning , 2021, Neurocomputing.

[13]  Anis Yazidi,et al.  FER-net: facial expression recognition using deep neural net , 2021, Neural Computing and Applications.

[14]  Andy Kyung-yong Yoon,et al.  A Study on Overcoming Disturbance Light using Polarization Filter and Performance Improvement of Face Recognition System , 2020, Journal of Multimedia Information System.

[15]  Yue Zhao,et al.  A Convolutional Neural Network for Compound Micro-Expression Recognition , 2019, Sensors.

[16]  Shuyuan Yang,et al.  A Survey of Deep Learning-Based Object Detection , 2019, IEEE Access.

[17]  Yu Qiao,et al.  Frame Attention Networks for Facial Expression Recognition in Videos , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[18]  Jianfei Yang,et al.  Region Attention Networks for Pose and Occlusion Robust Facial Expression Recognition , 2019, IEEE Transactions on Image Processing.

[19]  Shiguang Shan,et al.  Occlusion Aware Facial Expression Recognition Using CNN With Attention Mechanism , 2019, IEEE Transactions on Image Processing.

[20]  Shervin Minaee,et al.  Deep-Emotion: Facial Expression Recognition Using Attentional Convolutional Network , 2019, Sensors.

[21]  Changhu Wang,et al.  Jersey Number Recognition with Semi-Supervised Spatial Transformer Network , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[22]  Radu Tudor Ionescu,et al.  Local Learning With Deep and Handcrafted Features for Facial Expression Recognition , 2018, IEEE Access.

[23]  Shan Li,et al.  Deep Facial Expression Recognition: A Survey , 2018, IEEE Transactions on Affective Computing.

[24]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[25]  Mohammad H. Mahoor,et al.  Facial Expression Recognition Using Enhanced Deep 3D Convolutional Neural Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[26]  Zhigeng Pan,et al.  Facial Expression Recognition with CNN Ensemble , 2016, 2016 International Conference on Cyberworlds (CW).

[27]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Xiong Chen,et al.  Facial expression recognition from image sequences using twofold random forest classifier , 2015, Neurocomputing.

[29]  Mohammad H. Mahoor,et al.  Going deeper in facial expression recognition using deep neural networks , 2015, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[30]  Max Jaderberg,et al.  Spatial Transformer Networks , 2015, NIPS.

[31]  Wojciech Zaremba,et al.  Recurrent Neural Network Regularization , 2014, ArXiv.

[32]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[33]  Yoshua Bengio,et al.  Challenges in representation learning: A report on three machine learning contests , 2013, Neural Networks.

[34]  Takeo Kanade,et al.  The Extended Cohn-Kanade Dataset (CK+): A complete dataset for action unit and emotion-specified expression , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[35]  Trevor Darrell,et al.  Hidden Conditional Random Fields for Gesture Recognition , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[36]  Masakazu Matsugu,et al.  Subject independent facial expression recognition with robust face detection using a convolutional neural network , 2003, Neural Networks.

[37]  Beat Fasel,et al.  Robust face analysis using convolutional neural networks , 2002, Object recognition supported by user interaction for service robots.

[38]  S. Hochreiter,et al.  Long Short-Term Memory , 1997, Neural Computation.

[39]  S. Eddy Hidden Markov models. , 1996, Current opinion in structural biology.

[40]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[41]  P. Ekman An argument for basic emotions , 1992 .

[42]  Lawrence D. Jackel,et al.  Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[43]  Qiang Ji,et al.  Knowledge Augmented Deep Neural Networks for Joint Facial Expression and Action Unit Recognition , 2020, NeurIPS.

[44]  Ioannis Hatzilygeroudis,et al.  Deep Learning Approaches for Facial Emotion Recognition: A Case Study on FER-2013 , 2018 .

[45]  M. Mahoor,et al.  Facial expression recognition using lp-norm MKL multiclass-SVM , 2015 .

[46]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[47]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[48]  Jeffrey T. Henrikson Completeness and Total Boundedness of the Hausdorff Metric , 1999 .

[49]  Yann LeCun,et al.  Generalization and network design strategies , 1989 .