Facial Sentiment Classification Based on Resnet-18 Model

Recently, human-computer interaction is become increasingly common in daily life, in which facial expression recognition has a wide application potential. At present, the state-of-the-art accuracy of facial expression classification based on the Kaggle database is only 71.20%, which still indicates some room for improvement. Based on the resnet-18 model, we find that a large number of parameters in the model are prone to over-fitting, which hinders the face recognition accuracy. In this paper, we propose a modified network model based on the resnet18-network, in which we change the original average pooling layer to a global average pooling layer with a double convolution layer. Our experiment result suggests that our model outperformed the state-of-the-art results showing a 1.49% increase in accuracy on the Kaggle database.

[1]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[2]  Yichuan Tang,et al.  Deep Learning using Support Vector Machines , 2013, ArXiv.

[3]  Jean Meunier,et al.  Emotion recognition using dynamic grid-based HoG features , 2011, Face and Gesture 2011.

[4]  Arman Savran,et al.  Temporal Bayesian Fusion for Affect Sensing: Combining Video, Audio, and Lexical Modalities , 2015, IEEE Transactions on Cybernetics.

[5]  Sergio Escalera,et al.  Survey on RGB, 3D, Thermal, and Multimodal Approaches for Facial Expression Recognition: History, Trends, and Affect-Related Applications , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  S. Demleitner [Communication without words]. , 1997, Pflege aktuell.

[7]  Soo-Young Lee,et al.  Fusing Aligned and Non-aligned Face Information for Automatic Affect Recognition in the Wild: A Deep Learning Approach , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[8]  Yoshua Bengio,et al.  Challenges in representation learning: A report on three machine learning contests , 2013, Neural Networks.

[9]  Yichuan Tang,et al.  Deep Learning using Linear Support Vector Machines , 2013, 1306.0239.

[10]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Jun-Cheol Park,et al.  A Real-time Facial Expression Recognizer using Deep Neural Network , 2016, IMCOM.