Finding emotionally involved speech using implicitly proximity-annotated laughter

Browsing through collections of audio recordings of conversation nominally relies on the processing of participants' lexical productions. The evolving verbal and non-verbal context of those productions, likely indicative of the degree of participant involvement, is often ignored. The present work explores the relevance of laughter to the retrieval of conversation intervals in which the speech of one or more participants is prosodically or pragmatically marked as involved. Experiments indicate that the relevance of laughter depends on its temporal distance to the laugher's speech. The results suggest that in order to be pertinent to downstream emotion recognition applications, laughter detection systems must first and foremost detect that laughter which is most temporally proximate to the laugher's speech.

[1]  Kornel Laskowski,et al.  Analysis of the occurrence of laughter in meetings , 2007, INTERSPEECH.

[2]  Kornel Laskowski,et al.  Contrasting emotion-bearing laughter types in multiparticipant vocal activity detection for meetings , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[3]  Nikki Mirghafori,et al.  Getting the last laugh: automatic laughter segmentation in meetings , 2008, INTERSPEECH.

[4]  Kornel Laskowski Modeling vocal interaction for text-independent detection of involvement hotspots in multi-party meetings , 2008, 2008 IEEE Spoken Language Technology Workshop.

[5]  Elizabeth Shriberg,et al.  The ICSI Meeting Recorder Dialog Act (MRDA) Corpus , 2004, SIGDIAL Workshop.

[6]  D. V. Leeuwen,et al.  Evaluating automatic laughter segmentation in meetings using acoustic and acoustic-phonetic features , 2007 .

[7]  Steve Whittaker,et al.  Accessing Multimodal Meeting Data: Systems, Problems and Possibilities , 2004, MLMI.

[8]  Andreas Stolcke,et al.  The ICSI Meeting Corpus , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[9]  Elizabeth Shriberg,et al.  Spotting "hot spots" in meetings: human judgments and prosodic cues , 2003, INTERSPEECH.

[10]  S. Burger,et al.  On the Correlation between Perceptual and Contextual Aspects of Laughter in Meetings , 2007 .

[11]  A. Fogel,et al.  The integration of laughter and speech in vocal communication: a dynamic systems perspective. , 1999, Journal of speech, language, and hearing research : JSLHR.

[12]  Elizabeth Shriberg,et al.  Relationship between dialogue acts and hot spots in meetings , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).