A hybrid approach of NN and HMM for facial emotion classification

Neural networks (NNs) are often combined with Hidden Markov Models (HMMs) in speech recognition for achieving superior performance. In this paper, this hybrid approach is employed in facial emotion classification. Gabor wavelets are employed to extract features from difference images obtained by subtracting the first frame showing a frontal face from the current frame. The NN, which takes the form of Multilayer perceptron (MLP), is used to classify the feature vector into different states of a HMM of a certain emotion sequence, i.e., neutral, intermediate and peak. In addition to using 1-0 as targets for the NN, a heuristic strategy of assigning variable targets 1-x-0 has also been applied. After training, we interpret the output values of the NN as the posterior of the HMM state and directly apply the Viterbi algorithm to these values to estimate the best state path. The experiments show that with variable targets for the NN, the HMM gives better results than that with 1-0 targets. The best HMM results are obtained for x = 0.8 in 1-x-0.

[1]  Kiyoharu Aizawa,et al.  Detection and Tracking of Facial Features by Using Edge Pixel Counting and Deformable Circular Template Matching , 1995, IEICE Trans. Inf. Syst..

[2]  Tsutomu Miyasato,et al.  Emotion Enhanced Face to Face Meetings Using the Concept of Virtual Space Teleconferencing (Special Issue on Multimedia Computing and Communications) , 1996 .

[3]  M. Rosenblum,et al.  Human emotion recognition from motion using a radial basis function network architecture , 1994, Proceedings of 1994 IEEE Workshop on Motion of Non-rigid and Articulated Objects.

[4]  Tsutomu Miyasato,et al.  Use of Multimodal Information in Facial Emotion Recognition , 1998 .

[5]  Hervé Bourlard,et al.  Neural networks for statistical recognition of continuous speech , 1995, Proc. IEEE.

[6]  Marian Stewart Bartlett,et al.  Classifying Facial Actions , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Irfan A. Essa,et al.  Computers Seeing People , 1999, AI Mag..

[8]  J. Ohya,et al.  Recognition of facial expressions using HMM with continuous output probabilities , 1996, Proceedings 5th IEEE International Workshop on Robot and Human Communication. RO-MAN'96 TSUKUBA.

[9]  Zhengyou Zhang,et al.  Comparison between geometry-based and Gabor-wavelets-based facial expression recognition using multi-layer perceptron , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[10]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.