Enhancing CNN with Preprocessing Stage in Automatic Emotion Recognition

Emotion recognition from facial expression is the subfield of social signal processing which is applied in wide variety of areas, specifically for human and computer interaction. Many researches have been proposed for automatic emotion recognition, which is fundamentally using machine learning approach. However, recognizing basic emotions such as angry, happy, disgust, fear, sad, and surprise is still becoming a challenging problem in computer vision. Lately, deep learning has gained more attention to solve many real-world problems, including emotion recognition. In this research, we enhanced Convolutional Neural Network method to recognize 6 basic emotions and compared some preprocessing methods to show the influences of its in CNN performance. The compared data preprocessing methods are: resizing, face detection, cropping, adding noises, and data normalization consists of local normalization, global contrast normalization and histogram equalization. Face detection as single pre-processing phase achieved significant result with 86.08 % of accuracy, compared with another pre-processing phase and raw data. However, by combining those techniques can boost performance of CNN and achieved 97.06% of accuracy.

[1]  Anastasios Delopoulos,et al.  The MUG facial expression database , 2010, 11th International Workshop on Image Analysis for Multimedia Interactive Services WIAMIS 10.

[2]  Shaogang Gong,et al.  Facial expression recognition based on Local Binary Patterns: A comprehensive study , 2009, Image Vis. Comput..

[3]  Sébastien Ouellet,et al.  Real-time emotion recognition for gaming using deep convolutional network features , 2014, ArXiv.

[4]  Michael J. Lyons,et al.  Coding facial expressions with Gabor wavelets , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[5]  Radhika M. Pai,et al.  Automatic Facial Expression Recognition Using DCNN , 2016 .

[6]  Thomas S. Huang,et al.  Do Deep Neural Networks Learn Facial Action Units When Doing Expression Recognition? , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[7]  Xue-wen Chen,et al.  Big Data Deep Learning: Challenges and Perspectives , 2014, IEEE Access.

[8]  Edilson de Aguiar,et al.  Facial expression recognition with Convolutional Neural Networks: Coping with few data and the training sample order , 2017, Pattern Recognit..

[9]  M. C. Hanumantharaju,et al.  Review of Various Histogram Based Medical Image Enhancement Techniques , 2015, ICARCSET '15.

[10]  Mohammad H. Mahoor,et al.  Going deeper in facial expression recognition using deep neural networks , 2015, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[11]  Takeo Kanade,et al.  The Extended Cohn-Kanade Dataset (CK+): A complete dataset for action unit and emotion-specified expression , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[12]  P. Ekman An argument for basic emotions , 1992 .

[13]  H. Emrah Tasli,et al.  Deep learning based FACS Action Unit occurrence and intensity estimation , 2015, 2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).

[14]  Takeo Kanade,et al.  Facial Expression Recognition , 2011, Handbook of Face Recognition.

[15]  Dewi Yanti Liliana,et al.  Human emotion recognition based on active appearance model and semi-supervised fuzzy C-means , 2016, 2016 International Conference on Advanced Computer Science and Information Systems (ICACSIS).

[16]  Yoshua Bengio,et al.  Maxout Networks , 2013, ICML.

[17]  Jun-Cheol Park,et al.  A Real-time Facial Expression Recognizer using Deep Neural Network , 2016, IMCOM.