A New Fuzzy Cognitive Map Learning Algorithm for Speech Emotion Recognition

Selecting an appropriate recognition method is crucial in speech emotion recognition applications. However, the current methods do not consider the relationship between emotions. Thus, in this study, a speech emotion recognition system based on the fuzzy cognitive map (FCM) approach is constructed. Moreover, a new FCM learning algorithm for speech emotion recognition is proposed. This algorithm includes the use of the pleasure-arousal-dominance emotion scale to calculate the weights between emotions and certain mathematical derivations to determine the network structure. The proposed algorithm can handle a large number of concepts, whereas a typical FCM can handle only relatively simple networks (maps). Different acoustic features, including fundamental speech features and a new spectral feature, are extracted to evaluate the performance of the proposed method. Three experiments are conducted in this paper, namely, single feature experiment, feature combination experiment, and comparison between the proposed algorithm and typical networks. All experiments are performed on TYUT2.0 and EMO-DB databases. Results of the feature combination experiments show that the recognition rates of the combination features are 10%–20% better than those of single features. The proposed FCM learning algorithm generates 5%–20% performance improvement compared with traditional classification networks.

[1]  N. Huang,et al.  The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis , 1998, Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[2]  Theodoros Iliou,et al.  Statistical Evaluation of Speech Features for Emotion Recognition , 2009, 2009 Fourth International Conference on Digital Telecommunications.

[3]  David Philippou-Hübner,et al.  The Performance of the Speaking Rate Parameter in Emotion Recognition from Speech , 2012, 2012 IEEE International Conference on Multimedia and Expo Workshops.

[4]  Witold Pedrycz,et al.  A divide and conquer method for learning large Fuzzy Cognitive Maps , 2010, Fuzzy Sets Syst..

[5]  Ragini Verma,et al.  Class-level spectral features for emotion recognition , 2010, Speech Commun..

[6]  Bin Yang,et al.  The Relevance of Voice Quality Features in Speaker Independent Emotion Recognition , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[7]  Xiaolan Fu,et al.  The Reliability and Validity of the Chinese Version of Abbreviated PAD Emotion Scales , 2005, ACII.

[8]  Ning An,et al.  Speech Emotion Recognition Using Fourier Parameters , 2015, IEEE Transactions on Affective Computing.

[9]  Chenchen Huang,et al.  A Research of Speech Emotion Recognition Based on Deep Belief Network and SVM , 2014 .

[10]  John H. L. Hansen,et al.  Analysis and detection of cognitive load and frustration in drivers' speech , 2010, INTERSPEECH.

[11]  Christer Johansson,et al.  Chemical and physical characterization of emissions from birch wood combustion in a wood stove , 2002 .

[12]  Bin Yang,et al.  Combining classifiers with diverse feature sets for robust speaker independent emotion recognition , 2009, 2009 17th European Signal Processing Conference.

[13]  Erik Marchi,et al.  Emotion in the speech of children with autism spectrum conditions: prosody and everything else , 2012, WOCCI.

[14]  X. Xu,et al.  Graph Learning Based Speaker Independent Speech Emotion Recognition , 2014 .

[15]  Jose L. Salmeron Fuzzy cognitive maps for artificial emotions forecasting , 2012, Appl. Soft Comput..

[16]  Shashidhar G. Koolagudi,et al.  Text Independent Emotion Recognition Using Spectral Features , 2011, IC3.

[17]  Astrid Paeschke,et al.  A database of German emotional speech , 2005, INTERSPEECH.

[18]  Denis Serre Matrix Factorizations and Their Applications , 2010 .

[19]  D. G. Aggelis,et al.  Effect of wave distortion on acoustic emission characterization of cementitious materials , 2012 .

[20]  Kah Phooi Seng,et al.  A new approach of audio emotion recognition , 2014, Expert Syst. Appl..

[21]  Nathalie Herbeth,et al.  Product appraisal dimensions impact emotional responses and visual acceptability of instrument panels , 2013 .

[22]  Peter P. Groumpos,et al.  Modeling of Parkinson's Disease Using Fuzzy Cognitive Maps and Non-Linear Hebbian Learning , 2014, Int. J. Artif. Intell. Tools.

[23]  Martin Buss,et al.  Increasing Helpfulness towards a Robot by Emotional Adaption to the User , 2013, International Journal of Social Robotics.

[24]  M. Furkan Dodurka,et al.  Fuzzy cognitive maps learning using Artificial Bee Colony optimization , 2013, 2013 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[25]  C. R. Rao,et al.  Generalized Inverse of Matrices and its Applications , 1972 .

[26]  Antonio Origlia,et al.  Automatic classification of emotions via global and local prosodic features on a multilingual emotional database , 2010 .

[27]  Bart Kosko,et al.  Fuzzy Cognitive Maps , 1986, Int. J. Man Mach. Stud..

[28]  Jon Sánchez,et al.  Automatic emotion recognition using prosodic parameters , 2005, INTERSPEECH.

[29]  Dimitris E. Koulouriotis,et al.  Towards Hebbian learning of Fuzzy Cognitive Maps in pattern classification problems , 2012, Expert Syst. Appl..

[30]  Nilesh R. Patel,et al.  Implementation and Comparison of Speech Emotion Recognition System Using Gaussian Mixture Model (GMM) and K- Nearest Neighbor (K-NN) Techniques , 2015 .

[31]  Carlos Busso,et al.  Emotion recognition using a hierarchical binary decision tree approach , 2011, Speech Commun..

[32]  A. Mehrabian Pleasure-arousal-dominance: A general framework for describing and measuring individual differences in Temperament , 1996 .

[33]  Jose L. Salmeron,et al.  Fuzzy grey cognitive maps and nonlinear Hebbian learning in process control , 2013, Applied Intelligence.

[34]  J. Russell,et al.  An approach to environmental psychology , 1974 .

[35]  Tiago H. Falk,et al.  Automatic speech emotion recognition using modulation spectral features , 2011, Speech Commun..