Pose-specific non-linear mappings in feature space towards multiview facial expression recognition

We introduce a novel approach to recognizing facial expressions over a large range of head poses. Like previous approaches, we map the features extracted from the input image to the corresponding features of the face with the same facial expression but seen in a frontal view. This allows us to collect all training data into a common referential and therefore benefit from more data to learn to recognize the expressions. However, by contrast with such previous work, our mapping depends on the pose of the input image: We first estimate the pose of the head in the input image, and then apply the mapping specifically learned for this pose. The features after mapping are therefore much more reliable for recognition purposes. In addition, we introduce a non-linear form for the mapping of the features, and we show that it is robust to occasional mistakes made by the pose estimation stage. We evaluate our approach with extensive experiments on two protocols of the BU3DFE and Multi-PIE datasets, and show that it outperforms the state-of-the-art on both datasets. We propose a novel approach to recognizing facial expressions over a large range of head poses.We introduce a non-linear form for the mapping of the features that depends on the pose of the input image.This mapping is robust to occasional mistakes made by the pose estimation stage.

[1]  Ioannis Pitas,et al.  Texture and shape information fusion for facial expression and facial action unit recognition , 2008, Pattern Recognit..

[2]  Takeo Kanade,et al.  Multi-PIE , 2008, 2008 8th IEEE International Conference on Automatic Face & Gesture Recognition.

[3]  Lei Zhang,et al.  Sparse representation or collaborative representation: Which helps face recognition? , 2011, 2011 International Conference on Computer Vision.

[4]  Fernando De la Torre,et al.  Facial Action Unit Event Detection by Cascade of Tasks , 2013, 2013 IEEE International Conference on Computer Vision.

[5]  Hazim Kemal Ekenel,et al.  Multi-view facial expression recognition using local appearance features , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[6]  Hassen Drira,et al.  4-D Facial Expression Recognition by Learning Geometric Deformations , 2014, IEEE Transactions on Cybernetics.

[7]  Mohammed Bennamoun,et al.  Linear Regression for Face Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Mohammad Rahmati,et al.  Facial expression recognition using sparse coding , 2013, 2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP).

[9]  Vinod Chandran,et al.  Facial expression recognition experiments with data from television broadcasts and the World Wide Web , 2014, Image Vis. Comput..

[10]  Wenming Zheng,et al.  Multi-View Facial Expression Recognition Based on Group Sparse Reduced-Rank Regression , 2014, IEEE Transactions on Affective Computing.

[11]  Matti Pietikäinen,et al.  Dynamic Facial Expression Recognition Using Boosted Component-Based Spatiotemporal Features and Multi-classifier Fusion , 2010, ACIVS.

[12]  Gwen Littlewort,et al.  A discriminative parts based model approach for fiducial points free and shape constrained head pose normalisation in the wild , 2014, IEEE Winter Conference on Applications of Computer Vision.

[13]  Weifeng Liu,et al.  Facial expression recognition based on discriminative dictionary learning , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[14]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[15]  Thomas S. Huang,et al.  Non-frontal view facial expression recognition based on ergodic hidden Markov model supervectors , 2010, 2010 IEEE International Conference on Multimedia and Expo.

[16]  Rama Chellappa,et al.  Towards view-invariant expression analysis using analytic shape manifolds , 2011, Face and Gesture 2011.

[17]  Jun Wang,et al.  A 3D facial expression database for facial behavior research , 2006, 7th International Conference on Automatic Face and Gesture Recognition (FGR06).

[18]  Zhihong Zeng,et al.  A Survey of Affect Recognition Methods: Audio, Visual, and Spontaneous Expressions , 2009, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Luc Van Gool,et al.  Anchored Neighborhood Regression for Fast Example-Based Super-Resolution , 2013, 2013 IEEE International Conference on Computer Vision.

[20]  Eugene Santos,et al.  Infusing Social Networks With Culture , 2014, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[21]  Vincent Lepetit,et al.  Are sparse representations really relevant for image classification? , 2011, CVPR 2011.

[22]  Karim Faez,et al.  Unrestricted pose-invariant face recognition by sparse dictionary matrix , 2015, Image Vis. Comput..

[23]  Vladimir Pavlovic,et al.  Multi-output Laplacian dynamic ordinal regression for facial expression recognition and intensity estimation , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  Lijun Yin,et al.  Multi-view facial expression recognition , 2008, 2008 8th IEEE International Conference on Automatic Face & Gesture Recognition.

[25]  Stefanos Zafeiriou,et al.  Incremental Face Alignment in the Wild , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[26]  Matti Pietikäinen,et al.  Emotion recognition from facial images with arbitrary views , 2013, BMVC.

[27]  Thomas S. Huang,et al.  Multi-view Facial Expression Recognition Analysis with Generic Sparse Coding Feature , 2012, ECCV Workshops.

[28]  Richard Bowden,et al.  Local binary patterns for multi-view facial expression recognition , 2011 .

[29]  Thomas Mauthner,et al.  Pairwise linear regression: An efficient and fast multi-view facial expression recognition , 2015, 2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).

[30]  Joel A. Tropp,et al.  Signal Recovery From Random Measurements Via Orthogonal Matching Pursuit , 2007, IEEE Transactions on Information Theory.

[31]  Maja Pantic,et al.  Coupled Gaussian processes for pose-invariant facial expression recognition , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Lijun Yin,et al.  A study of non-frontal-view facial expressions recognition , 2008, 2008 19th International Conference on Pattern Recognition.

[33]  Thomas S. Huang,et al.  Emotion Recognition from Arbitrary View Facial Images , 2010, ECCV.

[34]  Fernando De la Torre,et al.  Selective Transfer Machine for Personalized Facial Action Unit Detection , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[35]  Hubert Konik,et al.  Framework for reliable, real-time facial expression recognition for low resolution images , 2013, Pattern Recognit. Lett..

[36]  Maja Pantic,et al.  Regression-Based Multi-view Facial Expression Recognition , 2010, 2010 20th International Conference on Pattern Recognition.

[37]  Simon Lucey,et al.  Regression-Based Image Alignment for General Object Categories , 2014, ArXiv.

[38]  P. Ekman,et al.  Facial action coding system: a technique for the measurement of facial movement , 1978 .

[39]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[40]  Thomas S. Huang,et al.  Supervised super-vector encoding for facial expression recognition , 2014, Pattern Recognit. Lett..

[41]  M. Elad,et al.  $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[42]  Fei Yang,et al.  Non-manual grammatical marker recognition based on multi-scale, spatio-temporal analysis of head pose and facial expressions , 2014, Image Vis. Comput..

[43]  Matti Pietikäinen,et al.  Performance evaluation of texture measures with classification based on Kullback discrimination of distributions , 1994, Proceedings of 12th International Conference on Pattern Recognition.