Use of Vowels in Discriminating Speech-Laugh from Laughter and Neutral Speech

In natural conversations, significant part of laughter co-occurs with speech which is referred to as speech-laugh. Hence, speech-laugh will have characteristics of both laughter and neutral speech. But it is not clearly evident how acoustic properties of neutral speech are influenced by its co-occurring laughter. The objective of this study is to analyze the acoustic variations between vowel regions of laughter, speech-laugh and neutral speech. The features based on excitation source characteristics extracted at epochs are considered in this study. Features extracted in the vowel regions of speech-laugh exhibit deviations from that of laughter and neutral speech. These deviations in feature values are exploited to discriminate speech-laugh from laughter and neutral speech. Two different datasets consisting of conversational speech and meeting recordings are used in this analysis. Experimental results show that the discrimination between the three classes obtained by considering vowel regions is better than that of considering the complete utterance.

[1]  J. Bachorowski,et al.  The acoustic features of human laughter. , 2001, The Journal of the Acoustical Society of America.

[2]  Norihiro Hagita,et al.  Analysis of laughter events in real science classes by using multiple environment sensor data , 2014, INTERSPEECH.

[3]  Ge Wang,et al.  Laughter modulation: from speech to speech-laugh , 2013, INTERSPEECH.

[4]  Jean Carletta,et al.  The AMI meeting corpus , 2005 .

[5]  J. Trouvain Phonetic Aspects of "Speech-Laughs" , 2001 .

[6]  Bayya Yegnanarayana,et al.  Analysis of production characteristics of laughter , 2015, Comput. Speech Lang..

[7]  Jürgen Trouvain,et al.  Investigating prosodic relations between initiating and responding laughs , 2014, INTERSPEECH.

[8]  Yosuke Igarashi,et al.  The speech laugh spectrum , 2006 .

[9]  Bayya Yegnanarayana,et al.  Analysis of laugh signals for detecting in continuous speech , 2009, INTERSPEECH.

[10]  Daniel P. W. Ellis,et al.  Laughter Detection in Meetings , 2004 .

[11]  Malcolm Slaney,et al.  Characteristic contours of syllabic-level units in laughter , 2013, INTERSPEECH.

[12]  A. Fogel,et al.  The integration of laughter and speech in vocal communication: a dynamic systems perspective. , 1999, Journal of speech, language, and hearing research : JSLHR.

[13]  John R. Hershey,et al.  Approximating the Kullback Leibler Divergence Between Gaussian Mixture Models , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[14]  Bayya Yegnanarayana,et al.  Analysis of laughter and speech-laugh signals using excitation source information , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[15]  Bayya Yegnanarayana,et al.  Characterization of Glottal Activity From Speech Signals , 2009, IEEE Signal Processing Letters.

[16]  Bayya Yegnanarayana,et al.  Study of changes in glottal vibration characteristics during laughter , 2014, INTERSPEECH.

[17]  Khiet P. Truong,et al.  Detection of nonverbal vocalizations using Gaussian mixture models: looking for fillers and laughter in conversational speech , 2013, INTERSPEECH.