Amused speech components analysis and classification: Towards an amusement arousal level assessment system

Abstract In this paper, we present our work on analysis and classification of smiled vowels, chuckling (or shaking) vowels and laughter syllables. This work is part of a larger framework that aims at assessing the level of amusement in speech using the audio modality only. Indeed all of these three categories occur in amused speech and are considered to contribute in the expression of different levels of amusement. We first analyze these three amused speech components on the acoustic level. Then, we improve a classification system we previously developed. With a limited amount of data and features, we are able to obtain good classification results with different systems. Among the compared systems, the best one achieved 82.8% of accuracy, therefore outperforming chance.

[1]  Alan Costall,et al.  The vocal communication of different kinds of smile , 2008, Speech Commun..

[2]  J. Trouvain Phonetic Aspects of "Speech-Laughs" , 2001 .

[3]  Mireia Farrús,et al.  Using jitter and shimmer in speaker verification , 2009 .

[4]  Gerald M. Knapp,et al.  Affect Intensity Estimation Using Multiple Modalities , 2014, FLAIRS.

[5]  Roland Göcke,et al.  Group expression intensity estimation in videos via Gaussian Processes , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[6]  Xi Li,et al.  Stress and Emotion Classification using Jitter and Shimmer Features , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[7]  Peter Robinson,et al.  Speech Emotion Classification and Public Speaking Skill Assessment , 2010, HBU.

[8]  Yoshiko Arimoto,et al.  An Estimation Method of Degree of Speaker's Anger Emotion with Acoustic and Linguistic Features , 2007 .

[9]  P. Ekman,et al.  The expressive pattern of laughter , 2001 .

[10]  Björn W. Schuller,et al.  Recent developments in openSMILE, the munich open-source multimedia feature extractor , 2013, ACM Multimedia.

[11]  Thierry Dutoit,et al.  Towards a level assessment system of amusement in speech signals: Amused speech components classification , 2015, 2015 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT).

[12]  Thierry Dutoit,et al.  Shaking and speech-smile vowels classification: An attempt at amusement arousal estimation from speech signals , 2015, 2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP).

[13]  Kevin El Haddad,et al.  TOWARDS A SPEECH SYNTHESIS SYSTEM WITH CONTROLLABLE AMUSEMENT LEVELS , 2015 .

[14]  V. Tartter Happy talk: Perceptual and acoustic effects of smiling on speech , 1980, Perception & psychophysics.

[15]  Dong Yu,et al.  Speech emotion recognition using deep neural network and extreme learning machine , 2014, INTERSPEECH.

[16]  Enes Yuncu,et al.  Automatic Speech Emotion Recognition Using Auditory Models with Binary Decision Tree and SVM , 2014, 2014 22nd International Conference on Pattern Recognition.