论文信息 - RECOGNIZING EMOTIONS IN DIALOGUES WITH DISFLUENCIES AND NON-VERBAL VOCALISATIONS

RECOGNIZING EMOTIONS IN DIALOGUES WITH DISFLUENCIES AND NON-VERBAL VOCALISATIONS

We investigate the usefulness of DISfluencies and Non-verbal Vocalisations (DIS-NV) for recognizing human emotions in dialogues. The proposed features measure filled pauses, fillers, stutters, laughter, and breath in utterances. The predictiveness of DISNV features is compared with lexical features and state-of-the-art low-level acoustic features. Our experimental results show that using DIS-NV features alone is not as predictive as using lexical or acoustic features. However, adding them to lexical or acoustic feature set yields improvement compared to using lexical or acoustic features alone. This indicates that disfluencies and non-verbal vocalisations provide useful information overlooked by the other two types of features for emotion recognition.

Catherine Lai | Johanna D. Moore | Leimin Tian | Catherine Lai | Leimin Tian

[1] R. Lickley. Fluency and Disfluency , 2015 .

[2] Johanna D. Moore,et al. Word-Level Emotion Recognition Using High-Level Features , 2014, CICLing.

[3] Ian H. Witten,et al. The WEKA data mining software: an update , 2009, SKDD.

[4] Björn W. Schuller,et al. AVEC 2012: the continuous audio/visual emotion challenge , 2012, ICMI '12.

[5] P. Vuilleumier,et al. How brains beware: neural mechanisms of emotional attention , 2005, Trends in Cognitive Sciences.

[6] Björn Schuller,et al. Opensmile: the munich versatile and fast open-source audio feature extractor , 2010, ACM Multimedia.

[7] Carlos Busso,et al. IEMOCAP: interactive emotional dyadic motion capture database , 2008, Lang. Resour. Evaluation.

[8] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.