Facial Emotion Recognition Using Transfer Learning in the Deep CNN

Human facial emotion recognition (FER) has attracted the attention of the research community for its promising applications. Mapping different facial expressions to the respective emotional states are the main task in FER. The classical FER consists of two major steps: feature extraction and emotion recognition. Currently, the Deep Neural Networks, especially the Convolutional Neural Network (CNN), is widely used in FER by virtue of its inherent feature extraction mechanism from images. Several works have been reported on CNN with only a few layers to resolve FER problems. However, standard shallow CNNs with straightforward learning schemes have limited feature extraction capability to capture emotion information from high-resolution images. A notable drawback of the most existing methods is that they consider only the frontal images (i.e., ignore profile views for convenience), although the profile views taken from different angles are important for a practical FER system. For developing a highly accurate FER system, this study proposes a very Deep CNN (DCNN) modeling through Transfer Learning (TL) technique where a pre-trained DCNN model is adopted by replacing its dense upper layer(s) compatible with FER, and the model is fine-tuned with facial emotion data. A novel pipeline strategy is introduced, where the training of the dense layer(s) is followed by tuning each of the pre-trained DCNN blocks successively that has led to gradual improvement of the accuracy of FER to a higher level. The proposed FER system is verified on eight different pre-trained DCNN models (VGG-16, VGG-19, ResNet-18, ResNet-34, ResNet-50, ResNet-152, Inception-v3 and DenseNet-161) and well-known KDEF and JAFFE facial image datasets. FER is very challenging even for frontal views alone. FER on the KDEF dataset poses further challenges due to the diversity of images with different profile views together with frontal views. The proposed method achieved remarkable accuracy on both datasets with pre-trained models. On a 10-fold cross-validation way, the best achieved FER accuracies with DenseNet-161 on test sets of KDEF and JAFFE are 96.51% and 99.52%, respectively. The evaluation results reveal the superiority of the proposed FER system over the existing ones regarding emotion detection accuracy. Moreover, the achieved performance on the KDEF dataset with profile views is promising as it clearly demonstrates the required proficiency for real-life applications.

[1]  Alessandro Floris,et al.  Evaluation of Data Augmentation Techniques for Facial Expression Recognition Systems , 2020, Electronics.

[2]  Hamza Aldabbas,et al.  Facial Emotion Recognition Using Transfer Learning , 2020, 2020 International Conference on Computing and Information Technology (ICCIT-1441).

[3]  Madhusmita Sahu,et al.  A Survey on Deep Learning: Convolution Neural Network (CNN) , 2020, Smart Innovation, Systems and Technologies.

[4]  Byung Cheol Song,et al.  Facial Landmark-Based Emotion Recognition via Directed Graph Neural Network , 2020, Electronics.

[5]  References , 2020, Sustainable Production System.

[6]  Suraj Kamal,et al.  Facial Emotion Recognition Using Deep Convolutional Neural Network , 2020, 2020 6th International Conference on Advanced Computing and Communication Systems (ICACCS).

[7]  Zhong Jin,et al.  A discriminative deep association learning for facial expression recognition , 2019, International Journal of Machine Learning and Cybernetics.

[8]  Xiaodong Wang,et al.  Facial Expression Recognition: A Survey , 2019, Symmetry.

[9]  Beladgham Mohammed,et al.  Facial Expression Recognition Based on DWT Feature for Deep CNN , 2019, 2019 6th International Conference on Control, Decision and Information Technologies (CoDIT).

[10]  D Y Liliana,et al.  Emotion recognition from facial expression using deep convolutional neural network , 2019, Journal of Physics: Conference Series.

[11]  Chris Yakopcic,et al.  A State-of-the-Art Survey on Deep Learning Theory and Architectures , 2019, Electronics.

[12]  P. Geetha,et al.  Facial emotion detection using modified eyemap–mouthmap algorithm on an enhanced image and classification with tensorflow , 2019, The Visual Computer.

[13]  Asifullah Khan,et al.  A survey of the recent architectures of deep convolutional neural networks , 2019, Artificial Intelligence Review.

[14]  Sergio Escalera,et al.  Audio-Visual Emotion Recognition in Video Clips , 2019, IEEE Transactions on Affective Computing.

[15]  Zhuo Tang,et al.  Facial Expression Recognition by Transfer Learning for Small Datasets , 2018, Security with Intelligent Computing and Big-data Services.

[16]  Neha Jain,et al.  Hybrid deep neural networks for face emotion recognition , 2018, Pattern Recognit. Lett..

[17]  Tiago H. Falk,et al.  Feature Pooling of Modulation Spectrum Features for Improved Speech Emotion Recognition in the Wild , 2018, IEEE Transactions on Affective Computing.

[18]  David Masip,et al.  Supervised Committee of Convolutional Neural Networks in Automated Facial Expression Analysis , 2018, IEEE Transactions on Affective Computing.

[19]  S. Praharaj,et al.  Facial emotion recognition, socio-occupational functioning and expressed emotions in schizophrenia versus bipolar disorder , 2018, Psychiatry Research.

[20]  François Chollet,et al.  Keras: The Python Deep Learning library , 2018 .

[21]  Shan Li,et al.  Deep Facial Expression Recognition: A Survey , 2018, IEEE Transactions on Affective Computing.

[22]  Md. Monirul Islam,et al.  Convolutional Neural Network Training incorporating Rotation-Based Generated Patterns and Handwritten Numeral Recognition of Major Indian Scripts , 2018 .

[23]  ByoungChul Ko,et al.  A Brief Review of Facial Emotion Recognition Based on Visual Information , 2018, Sensors.

[24]  Hongying Meng,et al.  Real time automated facial expression recognition app development on smart phones , 2017, 2017 8th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON).

[25]  Zenghui Wang,et al.  Deep Convolutional Neural Networks for Image Classification: A Comprehensive Review , 2017, Neural Computation.

[26]  Mark Elshaw,et al.  Stacked deep convolutional auto-encoders for emotion recognition from facial expressions , 2017, 2017 International Joint Conference on Neural Networks (IJCNN).

[27]  Lijun Jiang,et al.  Ensemble of Deep Neural Networks with Probability-Based Fusion for Facial Expression Recognition , 2017, Cognitive Computation.

[28]  José Manuel Pastor,et al.  Smart environment architecture for emotion detection and regulation , 2016, J. Biomed. Informatics.

[29]  François Chollet,et al.  Xception: Deep Learning with Depthwise Separable Convolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Rama Chellappa,et al.  FaceNet2ExpNet: Regularizing a Deep Face Recognition Net for Expression Recognition , 2016, 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017).

[31]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  Abdenour Bouzouane,et al.  A New Approach of Facial Expression Recognition for Ambient Assisted Living , 2016, PETRA.

[33]  Shikha Tripathi,et al.  Real-time emotion recognition from facial images using Raspberry Pi II , 2016, 2016 3rd International Conference on Signal Processing and Integrated Networks (SPIN).

[34]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Andrea Vedaldi,et al.  Visualizing Deep Convolutional Neural Networks Using Natural Pre-images , 2015, International Journal of Computer Vision.

[36]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37]  Mohammad H. Mahoor,et al.  Going deeper in facial expression recognition using deep neural networks , 2015, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[38]  Patrick J. Fox,et al.  Shake Table Test of Large-Scale Bridge Columns Supported on Rocking Shallow Foundations , 2015 .

[39]  Shiqing Zhang,et al.  Facial Expression Recognition via Deep Learning , 2015 .

[40]  Pieter C. Roos,et al.  Resonance properties of tidal channels with multiple retention basins: role of adjacent sea , 2015, Ocean Dynamics.

[41]  Deborah A. Maranville Transfer of Learning , 2015 .

[42]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[43]  Yoshua Bengio,et al.  How transferable are features in deep neural networks? , 2014, NIPS.

[44]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[45]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[46]  Ivan Laptev,et al.  Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[47]  Z. Warren,et al.  Prevalence of autism spectrum disorder among children aged 8 years - autism and developmental disabilities monitoring network, 11 sites, United States, 2010. , 2014, Morbidity and mortality weekly report. Surveillance summaries.

[48]  Stefan Carlsson,et al.  CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[49]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[50]  Thierry Pun,et al.  Multimodal Emotion Recognition in Response to Videos , 2012, IEEE Transactions on Affective Computing.

[51]  Chien-Cheng Lee,et al.  An improved boosting algorithm and its application to facial emotion recognition , 2012, J. Ambient Intell. Humaniz. Comput..

[52]  Oksam Chae,et al.  Robust Facial Expression Recognition Based on Local Directional Pattern , 2010 .

[53]  Chuan-Yu Chang,et al.  Personalized facial expression recognition in indoor environments , 2010, The 2010 International Joint Conference on Neural Networks (IJCNN).

[54]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[55]  Shaogang Gong,et al.  Facial expression recognition based on Local Binary Patterns: A comprehensive study , 2009, Image Vis. Comput..

[56]  Takeo Kanade,et al.  Multi-PIE , 2008, 2008 8th IEEE International Conference on Automatic Face & Gesture Recognition.

[57]  Xinhe Xu,et al.  Facial expression recognition based on PCA and NMF , 2008, 2008 7th World Congress on Intelligent Control and Automation.

[58]  Patrick Shen-Pei Wang,et al.  Performance Comparisons of Facial Expression Recognition in Jaffe Database , 2008, Int. J. Pattern Recognit. Artif. Intell..

[59]  Qiuqi Ruan,et al.  Facial expression recognition based on two-dimensional discriminant locality preserving projections , 2008, Neurocomputing.

[60]  D. Lundqvist,et al.  Facial expressions of emotion (KDEF): Identification under different display-duration conditions , 2008, Behavior research methods.

[61]  Peter L. Bartlett,et al.  Adaptive Online Gradient Descent , 2007, NIPS.

[62]  M. Pietikäinen,et al.  Facial expression recognition based on local binary patterns , 2007, Pattern Recognition and Image Analysis.

[63]  Jiang Wei,et al.  Application of Wavelet Energy Feature in Facial Expression Recognition , 2007, 2007 International Workshop on Anti-Counterfeiting, Security and Identification (ASID).

[64]  Maja Pantic,et al.  Web-based database for facial expression analysis , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[65]  Alice J. O'Toole,et al.  A video database of moving faces and people , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[66]  Michael J. Lyons,et al.  The Japanese Female Facial Expression (JAFFE) Dataset , 1998 .

[67]  E. Rosenberg Facing the facts , 1995, Nature.

[68]  A. J. Fridlund Human Facial Expression: An Evolutionary View , 1994 .

[69]  P. Ekman,et al.  Measuring facial movement , 1976 .

[70]  P. Ekman,et al.  Constants across cultures in the face and emotion. , 1971, Journal of personality and social psychology.

[71]  Min Shi,et al.  A Novel Facial Expression Intelligent Recognition Method Using Improved Convolutional Neural Network , 2020, IEEE Access.

[72]  H. Ugail,et al.  The University of Bradford Institutional Repository , 2022 .

[73]  Takehisa Yairi,et al.  Facial Expression Recognition and Analysis: A Comparison Study of Feature Descriptors , 2015, IPSJ Trans. Comput. Vis. Appl..

[74]  Yoshua Bengio Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[75]  S. C. Kremer,et al.  Gradient Flow in Recurrent Nets: The Difficulty of Learning LongTerm Dependencies , 2001 .

[76]  P. Ekman Universal facial expressions of emotion. , 1970 .