A system for feature classification of emotions based on speech analysis; applications to human-robot interaction

A system for recognition of emotions based on speech analysis can have interesting applications in human robot interaction. Robot should make a proper mutual communication between sound recognition and perception for creating a desired emotional interaction with humans. Advanced research in this field will be based on sound analysis and recognition of emotions in spontaneous dialog. In this paper, we report the results obtained from an exploratory study on a methodology to automatically recognize and classify basic emotional states. The study attempted to investigate the appropriateness of using acoustic and phonetic properties of emotive speech with the minimal use of signal processing algorithms. The efficiency of the methodology was evaluated by experimental tests on adult European speakers. The speakers had to repeat six simple sentences in English language in order to emphasize features of the pitch (peak, value and range), the intensity of the speech, the formants and the speech rate. The proposed methodology using the freeware program (PRAAT) and consists of generating and analyzing a graph of pitch, formant and intensity of speech signals for classify basic emotion. Eventually, the proposed model provided successful recognition of the basic emotion in most of the cases.

[1]  Zhihong Zeng,et al.  A Survey of Affect Recognition Methods: Audio, Visual, and Spontaneous Expressions , 2009, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Li Zhang,et al.  Affect Sensing Using Linguistic, Semantic and Cognitive Cues in Multi-threaded Improvisational Dialogue , 2012, Cognitive Computation.

[3]  Christin Kirchhübel,et al.  Detecting suspicious behaviour using speech: acoustic correlates of deceptive speech -- an exploratory investigation. , 2013, Applied ergonomics.

[4]  Lukás Burget,et al.  Application of speaker- and language identification state-of-the-art techniques for emotion recognition , 2011, Speech Commun..

[5]  Shashidhar G. Koolagudi,et al.  Recognition of Emotions from Speech using Excitation Source Features , 2012 .

[6]  Shrikanth S. Narayanan,et al.  Primitives-based evaluation and estimation of emotions in speech , 2007, Speech Commun..

[7]  Björn W. Schuller,et al.  Paralinguistics in speech and language - State-of-the-art and the challenge , 2013, Comput. Speech Lang..

[8]  Constantine Kotropoulos,et al.  Emotional speech recognition: Resources, features, and methods , 2006, Speech Commun..

[9]  A. Tanju Erdem,et al.  Formant position based weighted spectral features for emotion recognition , 2011, Speech Commun..

[10]  Tim Polzehl,et al.  Anger recognition in speech using acoustic and linguistic cues , 2011, Speech Commun..

[11]  Andreas Wendemuth,et al.  Modeling phonetic pattern variability in favor of the creation of robust emotion classifiers for real-life applications , 2014, Comput. Speech Lang..

[12]  Ragini Verma,et al.  Class-level spectral features for emotion recognition , 2010, Speech Commun..

[14]  Rosalind W. Picard,et al.  Recognizing affect from speech prosody using hierarchical graphical models , 2011, Speech Commun..

[15]  Carlos Busso,et al.  Shape-based modeling of the fundamental frequency contour for emotion detection in speech , 2014, Comput. Speech Lang..

[16]  Bin Yang,et al.  Emotion recognition from speech signals using new harmony features , 2010, Signal Process..

[17]  Carlos A. Reyes-García,et al.  Acoustic feature selection and classification of emotions in speech using a 3D continuous emotion model , 2012 .

[18]  Tim Polzehl,et al.  Comparing Features for Acoustic Anger Classification in German and English IVR Portals , 2009 .

[19]  Nivja H. Jong,et al.  Praat script to detect syllable nuclei and measure speech rate automatically , 2009, Behavior research methods.

[20]  Fakhri Karray,et al.  Survey on speech emotion recognition: Features, classification schemes, and databases , 2011, Pattern Recognit..

[21]  Hiok Chai Quek,et al.  Cultural dependency analysis for understanding speech emotion , 2012, Expert Syst. Appl..