Nonlinear dynamics characterization of emotional speech

This paper proposes the application of complexity measures based on nonlinear dynamics for emotional speech characterization. Measures such as mutual information, dimension correlation, entropy correlation, Shannon entropy, Lempel-Ziv complexity and Hurst exponent are extracted from the samples of three databases of emotional speech. Then, statistics such as mean, standard deviation, skewness and kurtosis are applied on the extracted measures. Experiments were conducted on the Polish emotional speech database, on the Berlin emotional speech database and on the LCD emotional database for a three-class problem (neutral, fear and anger emotional states). A procedure for feature selection is proposed based on an affinity analysis of the features. This feature selection procedure is accomplished to select a reduced number of features over the Polish emotional database. Finally, the selected features are evaluated in the Berlin emotional speech database and in the LDC emotional database using a neural network classifier in order to assess the usefulness of the selected features. Global success rates of 72.28%, 75.4% and 80.75%, were obtained for the Polish emotional speech database, the Berlin emotional speech database and the LDC emotional speech database respectively.

[1]  H. Kantz,et al.  Nonlinear time series analysis , 1997 .

[2]  Farshad Almasganj,et al.  Pathological assessment of patients' speech signals using nonlinear dynamical analysis , 2010, Comput. Biol. Medicine.

[3]  Tiago H. Falk,et al.  Automatic speech emotion recognition using modulation spectral features , 2011, Speech Commun..

[4]  Björn W. Schuller,et al.  Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge , 2011, Speech Commun..

[5]  Wojciech Majewski,et al.  Polish Emotional Speech Database - Recording and Preliminary Validation , 2009, COST 2102 Conference.

[6]  Tim Polzehl,et al.  Detecting real life anger , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  Abraham Lempel,et al.  On the Complexity of Finite Sequences , 1976, IEEE Trans. Inf. Theory.

[8]  Miguel Angel Ferrer-Ballester,et al.  Characterization of Healthy and Pathological Voice Through Measures Based on Nonlinear Dynamics , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  James Theiler,et al.  Lacunarity in a best estimator of fractal dimension , 1988 .

[10]  F. Almasganj,et al.  On the fractal self-similarity of laryngeal pathologies detection: The estimation of Hurst parameter , 2008, 2008 International Conference on Information Technology and Applications in Biomedicine.

[11]  Léon J. M. Rothkrantz,et al.  Automatic stress detection in emergency (telephone) calls , 2011, Int. J. Intell. Def. Support Syst..

[12]  David Ruelle,et al.  Deterministic chaos: the science and the fiction , 1995 .

[13]  Shrikanth S. Narayanan,et al.  Detecting emotional state of a child in a conversational computer game , 2011, Comput. Speech Lang..

[14]  Tiago H. Falk,et al.  Automatic recognition of speech emotion using long-term spectro-temporal features , 2009, 2009 16th International Conference on Digital Signal Processing.

[15]  D. Ruelle,et al.  The Claude Bernard Lecture, 1989 - Deterministic chaos: the science and the fiction , 1990, Proceedings of the Royal Society of London. A. Mathematical and Physical Sciences.

[16]  F. Takens Detecting strange attractors in turbulence , 1981 .

[17]  Tim Polzehl,et al.  Anger recognition in speech using acoustic and linguistic cues , 2011, Speech Commun..

[18]  H. Abarbanel,et al.  Determining embedding dimension for phase-space reconstruction using a geometrical construction. , 1992, Physical review. A, Atomic, molecular, and optical physics.

[19]  Miguel Angel Ferrer-Ballester,et al.  Application of Nonlinear Dynamics Characterization to Emotional Speech , 2011, NOLISP.

[20]  Fraser,et al.  Independent coordinates for strange attractors from mutual information. , 1986, Physical review. A, General physics.

[21]  Schuster,et al.  Easily calculable measure for the complexity of spatiotemporal patterns. , 1987, Physical review. A, General physics.

[22]  Josef Kittler,et al.  Floating search methods in feature selection , 1994, Pattern Recognit. Lett..

[23]  E. H. Lloyd,et al.  Long-Term Storage: An Experimental Study. , 1966 .

[24]  Sergios Theodoridis,et al.  A dimensional approach to emotion recognition of speech from movies , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.