A Multiple Classifier System Approach for Facial Expressions in Image Sequences Utilizing GMM Supervectors

The Gaussian mixture model (GMM) super vector approach is a well known technique in the domain of speech processing, e.g. speaker verification and audio segmentation. In this paper we apply this approach to video data in order to recognize human facial expressions. Three different image feature types (optical ???ow histograms, orientation histograms and principal components) from four pre-selected regions of the human’s face image were extracted and GMM super-vectors of the feature channels per sequence were constructed. Support vector machines (SVM) were trained using these super vectors for every channel separately and its results were combined using classifier fusion techniques. Thus, the performance of the classifier could be improved compared to the best individual classifier.

[1]  William T. Freeman,et al.  Orientation Histograms for Hand Gesture Recognition , 1995 .

[2]  Alexander J. Smola,et al.  Learning with kernels , 1998 .

[3]  Douglas A. Reynolds,et al.  Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[4]  Takeo Kanade,et al.  Comprehensive database for facial expression analysis , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[5]  James C. Bezdek,et al.  Decision templates for multiple classifier fusion: an experimental comparison , 2001, Pattern Recognit..

[6]  Heiko Neumann,et al.  Disambiguating Visual Motion Through Contextual Feedback Modulation , 2004, Neural Computation.

[7]  Douglas E. Sturim,et al.  Support vector machines using GMM supervectors for speaker verification , 2006, IEEE Signal Processing Letters.

[8]  Matti Pietikäinen,et al.  Dynamic Texture Recognition Using Local Binary Patterns with an Application to Facial Expressions , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[10]  Wei Wu,et al.  GMM Supervector Based SVM with Spectral Features for Speech Emotion Recognition , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[11]  Enrique Muñoz,et al.  Recognising facial expressions in video sequences , 2007, Pattern Analysis and Applications.

[12]  Hagai Aronowitz,et al.  Segmental Modeling for Audio Segmentation , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[13]  Friedhelm Schwenker,et al.  Fuzzy-Input Fuzzy-Output One-Against-All Support Vector Machines , 2007, KES.