Recognition of stance strength and polarity in spontaneous speech

From activities as simple as scheduling a meeting to those as complex as balancing a national budget, people take stances in negotiations and decision making. While the related areas of subjectivity and sentiment analysis have received significant attention, work has focused almost exclusively on text, and much stance-taking activity is carried out verbally. This paper investigates automatic recognition of stance-taking in spontaneous speech. It first presents a new annotated corpus of spontaneous, conversational speech designed to elicit high densities of stance-taking at different strengths. Speaker spurts are annotated both for strength of stance-taking behavior and polarity of stance. Based on this annotated corpus, we develop classifiers for automatic recognition of stance-taking behavior in speech. We employ a range of lexical, speaking style, and prosodic features in a boosting framework. The classifiers achieve strong accuracies on both binary detection of stance and four-way recognition of stance strength, well above most common class assignment. Finally, we classify the polarity of stance-taking spurts, obtaining accuracies around 80%. The best classifiers rely primarily on word unigram features, with speaking style and prosodic features yielding lower accuracies but still well above common class assignment.

[1]  Theresa Wilson,et al.  Comparing word, character, and phoneme n-grams for subjective utterance recognition , 2008, INTERSPEECH.

[2]  Giuseppe Carenini,et al.  Detecting subjectivity in multiparty speech , 2009, INTERSPEECH.

[3]  Mari Ostendorf,et al.  Detection Of Agreement vs. Disagreement In Meetings: Training With Unlabeled Data , 2003, NAACL.

[4]  Mark Liberman,et al.  Speaker identification on the SCOTUS corpus , 2008 .

[5]  Claire Cardie,et al.  Annotating Expressions of Opinions and Emotions in Language , 2005, Lang. Resour. Evaluation.

[6]  Susanne Burger,et al.  The ISL meeting corpus: the impact of meeting type on speech style , 2002, INTERSPEECH.

[7]  Swapna Somasundaran,et al.  Manual Annotation of Opinion Categories in Meetings , 2006 .

[8]  John J. Godfrey,et al.  SWITCHBOARD: telephone speech corpus for research and development , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  Mari Ostendorf,et al.  Manipulating stance and involvement using collaborative tasks: an exploratory comparison , 2014, INTERSPEECH.

[10]  Daniel Povey,et al.  The Kaldi Speech Recognition Toolkit , 2011 .

[11]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[12]  Swapna Somasundaran,et al.  Recognizing Stances in Ideological On-Line Debates , 2010, HLT-NAACL 2010.

[13]  Pentti Haddington,et al.  Stance Taking in News Interviews , 2004 .

[14]  Valerie Freeman,et al.  Hyperarticulation as a signal of stance , 2014, J. Phonetics.

[15]  Swapna Somasundaran,et al.  Recognizing Stances in Online Debates , 2009, ACL.

[16]  Matt Thomas,et al.  Get out the vote: Determining support or opposition from Congressional floor-debate transcripts , 2006, EMNLP.

[17]  Theresa Wilson,et al.  Multimodal Subjectivity Analysis of Multiparty Conversation , 2008, EMNLP.

[18]  Andreas Stolcke,et al.  The Meeting Project at ICSI , 2001, HLT.

[19]  Sanjeev Khudanpur,et al.  A pitch extraction algorithm tuned for automatic speech recognition , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[20]  Jean Carletta,et al.  The AMI meeting corpus , 2005 .

[21]  Theresa Wilson,et al.  Annotating Subjective Content in Meetings , 2008, LREC.

[22]  Yoram Singer,et al.  BoosTexter: A Boosting-based System for Text Categorization , 2000, Machine Learning.

[23]  Paul Boersma,et al.  Praat: doing phonetics by computer , 2003 .

[24]  Jan Svartvik,et al.  A __ comprehensive grammar of the English language , 1988 .

[25]  D. Biber,et al.  Longman Grammar of Spoken and Written English , 1999 .

[26]  Valerie D. Freeman,et al.  Using acoustic measures of hyperarticulation to quantify novelty and evaluation in a corpus of political talk shows , 2010 .