Let me finish: automatic conflict detection using speaker overlap

The automated detection of conflict will be a crucial feature of emerging speech-analysis technologies, whether the purpose is to assuage conflict in online applications or simply to mark its location for corpus analysis. In this study, we examine the predictive potential of overlapping speech in determining conflict, and we find that this feature alone is strongly correlated with high conflict levels as rated by human judges. In analyzing the SSPNET debate corpus, we effect a 2.3% improvement over baseline accuracy using speaker overlap ratio as a predicted value, suggesting that this feature is a reliable proxy for conflict level. In a follow-up experiment, we analyze the patterns of predicted conflict in the beginning, middle and end of an audio clip. Our findings show that the beginning and final segments are more predictive than the middle, which indicates that a primacyrecency effect is bearing on the perception of conflict. Since the beginning segment itself can be quite predictive, we also show that accurate predictions can be made dynamically, allowing for real-time classification during live debates.

[1]  H. Simon,et al.  A theory of the serial position effect. , 1962, British journal of psychology.

[2]  Fabio Valente,et al.  The INTERSPEECH 2013 computational paralinguistics challenge: social signals, conflict, emotion, autism , 2013, INTERSPEECH.

[3]  Maja Pantic,et al.  Spotting agreement and disagreement: A survey of nonverbal audiovisual cues and tools , 2009, 2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops.

[4]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[5]  Björn Schuller,et al.  Opensmile: the munich versatile and fast open-source audio feature extractor , 2010, ACM Multimedia.

[6]  Futoshi Asano,et al.  Detection of overlapping speech in meeting recordings using the modified exponential fitting test , 2007, 2007 15th European Signal Processing Conference.

[7]  Elizabeth Shriberg,et al.  Spotting "hot spots" in meetings: human judgments and prosodic cues , 2003, INTERSPEECH.

[8]  Fabio Valente,et al.  Automatic detection of conflict escalation in spoken conversations , 2012, INTERSPEECH.

[9]  Björn W. Schuller,et al.  Speech overlap detection using convolutive non-negative sparse coding: New improvements and insights , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).

[10]  Dilek Z. Hakkani-Tür,et al.  Using context to improve emotion detection in spoken dialog systems , 2005, INTERSPEECH.

[11]  Takeshi Yamada,et al.  Detection of Overlapping Speech in Meetings Using Support Vector Machines and Support Vector Regression , 2006, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..