Effect of sensor fusion for recognition of emotional states using voice, face image and thermal image of face

A new integrated method is presented to recognize the emotional expressions of human using both voices and facial expressions. For voices, we use such prosodic parameters as pitch signals, energy, and their derivatives, which are trained by hidden Markov model for recognition. For facial expressions, we use feature parameters from thermal images in addition to visible images, which are trained by neural networks for recognition. The thermal images are observed by infrared ray which is not influenced by lighting conditions. The total recognition rates show better performance than that obtained from each single experiment. The results are compared with the recognition by human questionnaire.

[1]  Yasunari Yoshitomi,et al.  Facial expression recognition using thermal image processing and neural network , 1997, Proceedings 6th IEEE International Workshop on Robot and Human Communication. RO-MAN'97 SENDAI.

[2]  Yasunari Yoshitomi,et al.  A method for detecting transitions of emotional states using a thermal facial image based on a synthesis of facial expressions , 2000, Robotics Auton. Syst..

[3]  Yasunari Yoshitomi,et al.  Facial Expression Recognition Using Infrared Rays Image Processing , 1996 .

[4]  Alex Waibel,et al.  Prosody and speech recognition , 1988 .

[5]  Shigeyuki Tomita,et al.  Face identification using thermal image processing , 1997, Proceedings 6th IEEE International Workshop on Robot and Human Communication. RO-MAN'97 SENDAI.