Boosting Hankel matrices for face emotion recognition and pain detection

HighligthsDynamics of face expression descriptors are modeled for emotion recognition.A set of Hankel matrices is built upon several multi-scale face representations.Boosting and random subspace projection are used for dynamics selection.Dynamics of Haar-like features and Gabor Energies are compared.Fine-grained dynamics of subtle expressions can be modeled at small spatial scales. Studies in psychology have shown that the dynamics of emotional expressions play an important role in face emotion recognition in humans. Motivated by these studies, in this paper the dynamics of face expressions are modeled and used for automatic emotion recognition and pain detection.Given a temporal sequence of face images, several appearance-based descriptors are computed at each frame. Over the sequence, the descriptors corresponding to the same feature type and spatial scale define a time series. The Hankel matrix built upon each time series is used to represent the dynamics of face expressions with respect to the used feature-scale pair.The set of Hankel matrices obtained by varying the feature type and the scale is used within a boosting approach to train a strong classifier. During training, random subspace projection is adopted for feature and scale selection.Experiments on two challenging publicly available datasets show that the dynamics of appearance-based face expression representations can be used to discriminate among different emotion classes and, within a boosting approach, attain state-of-the-art average accuracy values in classification.

[1]  Mats Viberg,et al.  Subspace-based methods for the identification of linear time-invariant systems , 1995, Autom..

[2]  Gwen Littlewort,et al.  Dynamics of Facial Expression Extracted Automatically from Video , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[3]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[4]  Gwen Littlewort,et al.  Recognizing facial expression: machine learning and application to spontaneous behavior , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[5]  Yi Lin,et al.  Random Forests and Adaptive Nearest Neighbors , 2006 .

[6]  Hongying Meng,et al.  Affective State Level Recognition in Naturalistic Facial and Vocal Expressions , 2014, IEEE Transactions on Cybernetics.

[7]  J. N. Bassili Emotion recognition: the role of facial movement and the relative importance of upper and lower areas of the face. , 1979, Journal of personality and social psychology.

[8]  Donald A. Adjeroh,et al.  Random KNN feature selection - a fast and stable alternative to Random Forests , 2011, BMC Bioinformatics.

[9]  Maja Pantic,et al.  Fully automatic facial feature point detection using Gabor feature based boosted classifiers , 2005, 2005 IEEE International Conference on Systems, Man and Cybernetics.

[10]  Ki-Chung Chung,et al.  Face recognition using principal component analysis of Gabor filter responses , 1999, Proceedings International Workshop on Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems. In Conjunction with ICCV'99 (Cat. No.PR00378).

[11]  Matti Pietikäinen,et al.  Boosted multi-resolution spatiotemporal descriptors for facial expression recognition , 2009, Pattern Recognit. Lett..

[12]  K. Scherer,et al.  Studying the dynamics of emotional expression using synthesized facial muscle movements. , 2000, Journal of personality and social psychology.

[13]  Fengqi Yu,et al.  Sequential Active Appearance Model Based on Online Instance Learning , 2013, IEEE Signal Processing Letters.

[14]  S. Gong,et al.  Conditional Mutual Information Based Boosting for Facial Expression Recognition , 2005 .

[15]  Qingshan Liu,et al.  Boosting encoded dynamic features for facial expression recognition , 2009, Pattern Recognit. Lett..

[16]  James P. Carson,et al.  Automated measurement of heterogeneity in CT images of healthy and diseased rat lungs using variogram analysis of an octree decomposition , 2014, BMC Medical Imaging.

[17]  Wen Gao,et al.  Face recognition using Ada-Boosted Gabor features , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[18]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Matti Pietikäinen,et al.  Face Description with Local Binary Patterns: Application to Face Recognition , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Marco La Cascia,et al.  An on-line learning method for face association in personal photo collection , 2012, Image Vis. Comput..

[21]  Yang Wang,et al.  Boosting for Learning Multiple Classes with Imbalanced Class Distribution , 2006, Sixth International Conference on Data Mining (ICDM'06).

[22]  Deva Ramanan,et al.  Face detection, pose estimation, and landmark localization in the wild , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[23]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[24]  James M. Rehg,et al.  Decoding Children's Social Behavior , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Agata Rozga,et al.  Joint Alignment and Modeling of Correlated Behavior Streams , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[26]  Donato Cascio,et al.  Mammographic images segmentation based on chaotic map clustering algorithm , 2014, BMC Medical Imaging.

[27]  Marco La Cascia,et al.  Ensemble of Hankel Matrices for Face Emotion Recognition , 2015, ICIAP.

[28]  Gwen Littlewort,et al.  Real Time Face Detection and Facial Expression Recognition: Development and Applications to Human Computer Interaction. , 2003, 2003 Conference on Computer Vision and Pattern Recognition Workshop.

[29]  Fernando De la Torre,et al.  Continuous AU intensity estimation using localized, sparse facial feature space , 2013, 2013 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).

[30]  Fei Cheng,et al.  Facial Expression Recognition in JAFFE Dataset Based on Gaussian Process Classification , 2010, IEEE Transactions on Neural Networks.

[31]  Anil K. Jain,et al.  Unsupervised texture segmentation using Gabor filters , 1990, 1990 IEEE International Conference on Systems, Man, and Cybernetics Conference Proceedings.

[32]  Qiang Ji,et al.  Face and Facial Expression Recognition from Real World Videos , 2015, Lecture Notes in Computer Science.

[33]  Matti Pietikäinen,et al.  Dynamic Texture Recognition Using Local Binary Patterns with an Application to Facial Expressions , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Shaogang Gong,et al.  Facial expression recognition based on Local Binary Patterns: A comprehensive study , 2009, Image Vis. Comput..

[35]  Qingshan Liu,et al.  Facial Expression Recognition using Encoded Dynamic Features , 2007, ICME.

[36]  S. Baron-Cohen,et al.  Using Assistive Technology to Teach Emotion Recognition to Students With Asperger Syndrome , 2007 .

[37]  Trevor Hastie,et al.  Multi-class AdaBoost ∗ , 2009 .

[38]  Philip H. S. Torr,et al.  Struck: Structured output tracking with kernels , 2011, ICCV.

[39]  Luc Van Gool,et al.  Unsupervised face alignment by robust nonrigid mapping , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[40]  Massimiliano Pontil,et al.  Multilinear Multitask Learning , 2013, ICML.

[41]  Michael J. Lyons,et al.  Coding facial expressions with Gabor wavelets , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[42]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[43]  Marco La Cascia,et al.  Using Hankel matrices for dynamics-based facial emotion recognition and pain detection , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[44]  Matti Pietikäinen,et al.  Robust Facial Expression Recognition Using Revised Canonical Correlation , 2014, 2014 22nd International Conference on Pattern Recognition.

[45]  Yannan Zhao,et al.  FEATURES EXTRACTION USING A GABOR FILTER FAMILY , 2004 .

[46]  Beat Fasel,et al.  Automati Fa ial Expression Analysis: A Survey , 1999 .

[47]  Maja Pantic,et al.  Continuous Pain Intensity Estimation from Facial Expressions , 2012, ISVC.

[48]  Sridha Sridharan,et al.  Sparse Temporal Representations for Facial Expression Recognition , 2011, PSIVT.

[49]  Tsuhan Chen,et al.  The painful face - Pain expression recognition using active appearance models , 2009, Image Vis. Comput..

[50]  Maja Pantic,et al.  Facial landmarking for in-the-wild images with local inference based on global appearance , 2015, Image Vis. Comput..

[51]  Maja Pantic,et al.  Fully Automatic Facial Action Unit Detection and Temporal Analysis , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[52]  Jeffrey F. Cohn,et al.  Painful data: The UNBC-McMaster shoulder pain expression archive database , 2011, Face and Gesture 2011.

[53]  Timothy F. Cootes,et al.  Active Appearance Models , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[54]  Marco La Cascia,et al.  Tracking your Detector Performance - How to Grow an Effective Training Set in Tracking-by-Detection Methods , 2015, VISAPP.

[55]  Binlong Li,et al.  Cross-view activity recognition using Hankelets , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[56]  Yixin Chen,et al.  MILES: Multiple-Instance Learning via Embedded Instance Selection , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[57]  Rama Chellappa,et al.  Compressive Acquisition of Dynamic Scenes , 2010, ECCV.

[58]  Gwen Littlewort,et al.  Dynamics of Facial Expression Extracted Automatically from Video , 2004, CVPR Workshops.

[59]  Azriel Rosenfeld,et al.  Face recognition: A literature survey , 2003, CSUR.

[60]  Sridha Sridharan,et al.  Improved facial expression recognition via uni-hyperplane classification , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[61]  Shaogang Gong,et al.  Dynamic Facial Expression Recognition Using A Bayesian Temporal Manifold Model , 2006, BMVC.

[62]  Zhihong Zeng,et al.  A Survey of Affect Recognition Methods: Audio, Visual, and Spontaneous Expressions , 2009, IEEE Trans. Pattern Anal. Mach. Intell..

[63]  Tomoko Matsui,et al.  A Kernel for Time Series Based on Global Alignments , 2006, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[64]  Sridha Sridharan,et al.  Improving pain recognition through better utilisation of temporal information , 2008, AVSP.

[65]  Zakia Hammal,et al.  Pain monitoring: A dynamic and context-sensitive system , 2012, Pattern Recognit..

[66]  Aleix M. Martínez,et al.  Recognizing expression variant faces from a single sample image per class , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[67]  Andrea Cavallaro,et al.  Automatic Analysis of Facial Affect: A Survey of Registration, Representation, and Recognition , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[68]  Takeo Kanade,et al.  Automated facial expression recognition based on FACS action units , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[69]  B. Michaelis,et al.  Facial expression recognition based on Haar-like feature detection , 2008, Pattern Recognition and Image Analysis.

[70]  Sridha Sridharan,et al.  Person-independent facial expression detection using Constrained Local Models , 2011, Face and Gesture 2011.

[71]  Marco Cuturi,et al.  Fast Global Alignment Kernels , 2011, ICML.

[72]  Nicolás García-Pedrajas,et al.  Boosting random subspace method , 2008, Neural Networks.

[73]  Allen R. Hanson,et al.  Feature Selection Using Adaboost for Face Expression Recognition , 2005 .

[74]  Marian Stewart Bartlett,et al.  Classification and weakly supervised pain localization using multiple segment representation , 2014, Image Vis. Comput..

[75]  Marco La Cascia,et al.  Gesture Modeling by Hanklet-Based Hidden Markov Model , 2014, ACCV.

[76]  Marco La Cascia,et al.  Hankelet-based dynamical systems modeling for 3D action recognition , 2015, Image Vis. Comput..

[77]  N. Ranganathan,et al.  Gabor filter-based edge detection , 1992, Pattern Recognit..

[78]  Taghi M. Khoshgoftaar,et al.  RUSBoost: Improving classification performance when training data is skewed , 2008, 2008 19th International Conference on Pattern Recognition.

[79]  Bo Wu,et al.  Real time facial expression recognition with AdaBoost , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[80]  Shaogang Gong,et al.  Robust facial expression recognition using local binary patterns , 2005, IEEE International Conference on Image Processing 2005.

[81]  Oksam Chae,et al.  Local Directional Number Pattern for Face Analysis: Face and Expression Recognition , 2013, IEEE Transactions on Image Processing.

[82]  Takeo Kanade,et al.  Facial Expression Recognition , 2011, Handbook of Face Recognition.

[83]  Qiang Ji,et al.  Capturing Complex Spatio-temporal Relations among Facial Muscles for Facial Expression Recognition , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[84]  Takeo Kanade,et al.  Emotional Expression Classification Using Time-Series Kernels , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[85]  Takeo Kanade,et al.  The Extended Cohn-Kanade Dataset (CK+): A complete dataset for action unit and emotion-specified expression , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[86]  Jeffrey F. Cohn,et al.  Automatic detection of pain intensity , 2012, ICMI '12.

[87]  Shaogang Gong,et al.  Conditional Mutual Infomation Based Boosting for Facial Expression Recognition , 2005, BMVC.

[88]  Erik G. Learned-Miller,et al.  Unsupervised Joint Alignment of Complex Images , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[89]  Vladimir Pavlovic,et al.  Automatic Pain Intensity Estimation with Heteroscedastic Conditional Ordinal Random Fields , 2013, ISVC.

[90]  Brian Scassellati,et al.  Integrating socially assistive robotics into mental healthcare interventions: applications and recommendations for expanded use. , 2015, Clinical Psychology Review.

[91]  Qiang Ji,et al.  A generative restricted Boltzmann machine based method for high-dimensional motion data modeling , 2015, Comput. Vis. Image Underst..

[92]  Sridha Sridharan,et al.  Least squares congealing for unsupervised alignment of images , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[93]  Marian Stewart Bartlett,et al.  Face image analysis by unsupervised learning , 2001 .

[94]  Massimiliano Pontil,et al.  Exploiting Unrelated Tasks in Multi-Task Learning , 2012, AISTATS.