论文信息 - Automatic Recognition of Emotionally Coloured Speech

Automatic Recognition of Emotionally Coloured Speech

Emotion in speech is an issue that has been attracting the interest of the speech community for many years, both in the context of speech synthesis as well as in automatic speech recognition (ASR). In spite of the remarkable recent progress in Large Vocabulary Recognition (LVR), it is still far behind the ultimate goal of recognising free conversational speech uttered by any speaker in any environment. Current experimental tests prove that using state of the art large vocabulary recognition systems the error rate increases substantially when applied to spontaneous/emotional speech. This paper shows that recognition rate for emotionally coloured speech can be improved by using a language model based on increased representation of emotional utterances. Keywords—Statistical language model, N-grams, emotionally coloured speech

[1] Thomas Polzin,et al. Pronunciation Variations In Emotional Speech , 1998 .

[2] John H. L. Hansen,et al. Speech under stress conditions: overview of the effect on speech production and on system performance , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[3] Julia Hirschberg,et al. Predicting Automatic Speech Recognition Performance Using Prosodic Cues , 2000, ANLP.

[4] Multimodal data in action and interaction : a library of recordings and labelling schemes , 2004 .

[5] Roddy Cowie,et al. Describing the emotional states that are expressed in speech , 2003, Speech Commun..

[6] Roddy Cowie,et al. ASR for emotional speech: Clarifying the issues and enhancing performance , 2005, Neural Networks.

[7] Cynthia Whissell,et al. THE DICTIONARY OF AFFECT IN LANGUAGE , 1989 .

[8] K. Stevens,et al. Emotions and speech: some acoustical correlates. , 1972, The Journal of the Acoustical Society of America.

[9] K E Cummings,et al. Analysis of the glottal excitation of emotionally styled and stressed speech. , 1995, The Journal of the Acoustical Society of America.

[10] Roddy Cowie,et al. FEELTRACE: an instrument for recording perceived emotion in real time , 2000 .