Feature Subset Selection Based on Evolutionary Algorithms for Automatic Emotion Recognition in Spoken Spanish and Standard Basque Language

The study of emotions in human-computer interaction is a growing research area. Focusing on automatic emotion recognition, work is being performed in order to achieve good results particularly in speech and facial gesture recognition. In this paper we present a study performed to analyze different Machine Learning techniques validity in automatic speech emotion recognition area. Using a bilingual affective database, different speech parameters have been calculated for each audio recording. Then, several Machine Learning techniques have been applied to evaluate their usefulness in speech emotion recognition. In this particular case, techniques based on evolutive algorithms (EDA) have been used to select speech feature subsets that optimize automatic emotion recognition success rate. Achieved experimental results show a representative increase in the abovementioned success rate.

[1]  Elmar Nöth,et al.  Recognition of emotion in a realistic dialogue scenario , 2000, INTERSPEECH.

[2]  Belur V. Dasarathy,et al.  Nearest neighbor (NN) norms: NN pattern classification techniques , 1991 .

[3]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[4]  Léon J. M. Rothkrantz,et al.  Voice Stress Analysis , 2004, TSD.

[5]  Pierre Loonis,et al.  Combination, Cooperation And Selection Of Classifiers: A State Of The Art , 2003, Int. J. Pattern Recognit. Artif. Intell..

[6]  Petri Laukka Vocal Expression of Emotion Discrete-emotions and Dimensional Accounts , 2004 .

[7]  David W. Aha,et al.  Instance-Based Learning Algorithms , 1991, Machine Learning.

[8]  Xuejing Sun,et al.  Pitch determination and voice quality analysis using Subharmonic-to-Harmonic Ratio , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  Frank Dellaert,et al.  Recognizing emotion in speech , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[10]  Tieniu Tan,et al.  Affective Computing: A Review , 2005, ACII.

[11]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[12]  P. Ekman Pictures of Facial Affect , 1976 .

[13]  Roddy Cowie,et al.  Emotion and brain: Understanding emotions and modelling their recognition , 2005, Neural Networks.

[14]  J. Bachorowski,et al.  Vocal Expression of Emotion: Acoustic Properties of Speech Are Associated With Emotional Intensity and Context , 1995 .

[15]  Ron Kohavi,et al.  Data Mining Using MLC a Machine Learning Library in C++ , 1996, Int. J. Artif. Intell. Tools.

[16]  Ángel Rodríguez Bravo,et al.  Modelización acústica de la expresión emocional en el español , 1999 .

[17]  Rosalind W. Picard,et al.  A computational model for the automatic recognition of affect in speech , 2004 .

[18]  Juan Manuel Montero-Martínez,et al.  Emotional speech synthesis: from speech database to TTS , 1998, ICSLP.

[19]  Marvin Minsky,et al.  Steps toward Artificial Intelligence , 1995, Proceedings of the IRE.

[20]  Rosalind W. Picard Affective Computing , 1997 .

[21]  Hiroshi Motoda,et al.  Feature Selection for Knowledge Discovery and Data Mining , 1998, The Springer International Series in Engineering and Computer Science.

[22]  Shrikanth Narayanan,et al.  Acoustic correlates of user response to error in human-computer dialogues , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[23]  K. Scherer,et al.  Vocal expression of emotion. , 2003 .

[24]  Thomas G. Dietterich,et al.  A study of distance-based machine learning algorithms , 1994 .

[25]  David E. Goldberg,et al.  A Survey of Optimization by Building and Using Probabilistic Models , 2002, Comput. Optim. Appl..

[26]  Pedro Larrañaga,et al.  Feature Subset Selection by Bayesian network-based optimization , 2000, Artif. Intell..

[27]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[28]  Inma Hernáez,et al.  Obtaining and Evaluating an Emotional Database for Prosody Modelling in Standard Basque , 2004, TSD.

[29]  J. Kent Martin,et al.  An Exact Probability Metric for Decision Tree Splitting and Stopping , 1997, Machine Learning.

[30]  Roddy Cowie,et al.  Beyond emotion archetypes: Databases for emotion modelling using neural networks , 2005, Neural Networks.

[31]  George N. Votsis,et al.  Emotion recognition in human-computer interaction , 2001, IEEE Signal Process. Mag..