Automatic Decision Detection in Meeting Speech

Decision making is an important aspect of meetings in organisational settings, and archives of meeting recordings constitute a valuable source of information about the decisions made. However, standard utilities such as playback and keyword search are not sufficient for locating decision points from meeting archives. In this paper, we present the AMI DecisionDetector, a system that automatically detects and highlights where the decision-related conversations are. In this paper, we apply the models developed in our previous work [1], which detects decision-related dialogue acts (DAs) from parts of the transcripts that have been manually annotated as extract-worthy, to the task of detecting decision-related DAs and topic segments directly from complete transcripts. Results show that we need to combine features extracted from multiple knowledge sources (e.g., lexical, prosodic, DA-related, and topical class) in order to yield the model with the highest precision. We have provided a quantitative account of the feature class effects. As our ultimate goal is to operate AMI DecisionDetector in a fully automatic fashion, we also investigate the impacts of using automatically generated features, for example, the 5-class DA features obtained in [2].

[1]  Alexander I. Rudnicky,et al.  You Are What You Say: Using Meeting Participants’ Speech to Detect their Roles and Expertise , 2006, HLT-NAACL 2006.

[2]  Johanna D. Moore,et al.  Combining Multiple Knowledge Sources for Dialogue Segmentation in Multimedia Archives , 2007, ACL.

[3]  Stanley Peters,et al.  Ontology-Based Discourse Understanding for a Persistent Meeting Assistant , 2005, AAAI Spring Symposium: Persistent Assistants: Living and Working with AI.

[4]  Mari Ostendorf,et al.  Detection Of Agreement vs. Disagreement In Meetings: Training With Unlabeled Data , 2003, NAACL.

[5]  Steve Renals,et al.  DBN Based Joint Dialogue Act Recognition of Multiparty Meetings , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[6]  Elizabeth Shriberg,et al.  Spotting "hot spots" in meetings: human judgments and prosodic cues , 2003, INTERSPEECH.

[7]  Violeta Seretan,et al.  User Requirements Analysis for Meeting Information Retrieval Based on Query Elicitation , 2007, ACL.

[8]  Anton Nijholt,et al.  Addressee Identification in Face-to-Face Meetings , 2006, EACL.

[9]  V. Pallotta Collaborative and Argumentative Models of Meeting Discussions , 2005 .

[10]  Andreas Stolcke,et al.  Direct Modeling of Prosody: An Overview of Applications in Automatic Speech Processing , 2004 .

[11]  Elizabeth Shriberg,et al.  Relationship between dialogue acts and hot spots in meetings , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[12]  Jay F. Nunamaker,et al.  Meeting analysis: findings from research and practice , 2001, Proceedings of the 34th Annual Hawaii International Conference on System Sciences.

[13]  Samy Bengio,et al.  Semi-supervised adapted HMMs for unusual event detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[14]  Johanna D. Moore,et al.  What Decisions Have You Made: Automatic Decision Detection in Conversational Speech , 2007 .

[15]  Andrei Popescu-Belis,et al.  Machine Learning for Multimodal Interaction , 4th International Workshop, MLMI 2007, Brno, Czech Republic, June 28-30, 2007, Revised Selected Papers , 2008, MLMI.

[16]  Steve Whittaker,et al.  Analysing Meeting Records: An Ethnographic Study and Technological Implications , 2005, MLMI.

[17]  Dirk Heylen,et al.  Argument Diagramming of Meeting Conversations , 2005 .

[18]  Samy Bengio,et al.  Automatic analysis of multimodal group actions in meetings , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Diane J. Litman,et al.  Recognizing student emotions and attitudes on the basis of utterances in spoken tutoring dialogues with both human and computer tutors , 2006, Speech Commun..

[20]  Wilfried Post,et al.  A research environment for meeting behavior , 2004 .

[21]  Julia Hirschberg,et al.  Identifying Agreement and Disagreement in Conversational Speech: Use of Bayesian Networks to Model Pragmatic Dependencies , 2004, ACL.

[22]  Matthew Purver,et al.  Shallow Discourse Structure for Action Item Detection , 2006, HLT-NAACL 2006.

[23]  Andreas Stolcke,et al.  Combining Prosodic Lexical and Cepstral Systems for Deceptive Speech Detection , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[24]  Carolyn Penstein Rosé,et al.  The Necessity of a Meeting Recording and Playback System, and the Benefit of Topic-Level Annotations to Meeting Browsing , 2005, INTERACT.

[25]  Samy Bengio,et al.  Detecting group interest-level in meetings , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[26]  Jean Carletta,et al.  The AMI Meeting Corpus: A Pre-announcement , 2005, MLMI.

[27]  Maite Taboada,et al.  Prosodic Correlates of Rhetorical Relations , 2006, HLT-NAACL 2006.