Dynamic Facial Emotion Recognition Using Deep Spatial Feature and Handcrafted Spatiotemporal Feature on Spark

One important challenge of dynamic facial emotion recognition is to effectively obtain the spatial and dynamic change of face structure from videos. Besides, there is an increasing demand for distributed computing of videos, as a result of the speedy production of videos from numerous multimedia sources. To address the above issues, in this work, we propose a novel method for dynamic facial emotion recognition on top of Spark. Furthermore, we introduce an effective dynamic feature descriptor namely, Volume Symmetric Local Graph Structure (VSLGS), which extracts the spatiotemporal features. We also utilize the convolutional neural network (CNN) to obtain deep spatial features. Lastly, these obtained features are concatenated and fed to Spark MLlib Multilayer Perceptron (MLP) classifier to recognize the dynamic facial emotions. An extensive experimental investigation is performed to prove the effectiveness of our method over state-of-the-art methods. Furthermore, we also showed the scalability of the proposed method experimentally.

[1]  ByoungChul Ko,et al.  A Brief Review of Facial Emotion Recognition Based on Visual Information , 2018, Sensors.

[2]  Young-Koo Lee,et al.  Human Action Recognition Using Adaptive Local Motion Descriptor in Spark , 2017, IEEE Access.

[3]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[4]  Shaogang Gong,et al.  Facial expression recognition based on Local Binary Patterns: A comprehensive study , 2009, Image Vis. Comput..

[5]  Matti Pietikäinen,et al.  Facial expression recognition from near-infrared video sequences , 2008, 2008 19th International Conference on Pattern Recognition.

[6]  Mubarak Shah,et al.  A 3-dimensional sift descriptor and its application to action recognition , 2007, ACM Multimedia.

[7]  John See,et al.  Efficient Spatio-Temporal Local Binary Patterns for Spontaneous Facial Micro-Expression Recognition , 2015, PloS one.

[8]  Young-Koo Lee,et al.  Dynamic Scene Recognition Using Spatiotemporal Based DLTP on Spark , 2018, IEEE Access.

[9]  Shiguang Shan,et al.  Deeply Learning Deformable Facial Action Parts Model for Dynamic Expression Analysis , 2014, ACCV.

[10]  Jingyu Yang,et al.  Facial expression recognition based on histogram sequence of local Gabor binary patterns , 2008, 2008 IEEE Conference on Cybernetics and Intelligent Systems.

[11]  John See,et al.  LBP with Six Intersection Points: Reducing Redundant Information in LBP-TOP for Micro-expression Recognition , 2014, ACCV.

[12]  Matti Pietikäinen,et al.  Facial expression recognition from near-infrared videos , 2011, Image Vis. Comput..

[13]  W. Marsden I and J , 2012 .

[14]  Junmo Kim,et al.  Joint Fine-Tuning in Deep Neural Networks for Facial Expression Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[15]  Deva Ramanan,et al.  Face detection, pose estimation, and landmark localization in the wild , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Matti Pietikäinen,et al.  Discriminative Spatiotemporal Local Binary Pattern with Revisited Integral Projection for Spontaneous Facial Micro-Expression Recognition , 2019, IEEE Transactions on Affective Computing.

[17]  Mohammed Bennamoun,et al.  A spatio-temporal RBM-based model for facial expression recognition , 2016, Pattern Recognit..

[18]  Shiguang Shan,et al.  Learning Expressionlets on Spatio-temporal Manifold for Dynamic Facial Expression Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Takeo Kanade,et al.  The Extended Cohn-Kanade Dataset (CK+): A complete dataset for action unit and emotion-specified expression , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[20]  Cordelia Schmid,et al.  A Spatio-Temporal Descriptor Based on 3D-Gradients , 2008, BMVC.

[21]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Shiguang Shan,et al.  Learning Expressionlets via Universal Manifold Model for Dynamic Facial Expression Recognition , 2015, IEEE Transactions on Image Processing.

[23]  Yong Du,et al.  Facial Expression Recognition Based on Deep Evolutional Spatial-Temporal Networks , 2017, IEEE Transactions on Image Processing.

[24]  Jingjing Wang,et al.  Image retrieval and classification on deep convolutional SparkNet , 2016, 2016 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC).

[25]  Andreas E. Savakis,et al.  Manifold based Sparse Representation for robust expression recognition without neutral subtraction , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[26]  Maja Pantic,et al.  Facial action recognition for facial expression analysis from static face images , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[27]  Yang Wang,et al.  BigDL: A Distributed Deep Learning Framework for Big Data , 2018, SoCC.

[28]  Shuai Yang,et al.  Efficient large scale near-duplicate video detection base on spark , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[29]  Matti Pietikäinen,et al.  Dynamic Texture Recognition Using Local Binary Patterns with an Application to Facial Expressions , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Jürgen Schmidhuber,et al.  Facial Expression Recognition with Recurrent Neural Networks , 2008 .

[31]  Nicholas Costen,et al.  Micro-Facial Movements: An Investigation on Spatio-Temporal Descriptors , 2014, ECCV Workshops.