论文信息 - Automatically Detecting Action Items in Audio Meeting Recordings

Automatically Detecting Action Items in Audio Meeting Recordings

Identification of action items in meeting recordings can provide immediate access to salient information in a medium notoriously difficult to search and summarize. To this end, we use a maximum entropy model to automatically detect action item-related utterances from multi-party audio meeting recordings. We compare the effect of lexical, temporal, syntactic, semantic, and prosodic features on system performance. We show that on a corpus of action item annotations on the ICSI meeting recordings, characterized by high imbalance and low inter-annotator agreement, the system performs at an F measure of 31.92%. While this is low compared to better-studied tasks on more mature corpora, the relative usefulness of the features towards this task is indicative of their usefulness on more consistent annotations, as well as to related tasks.

[1] Elizabeth Shriberg,et al. Spotting "hot spots" in meetings: human judgments and prosodic cues , 2003, INTERSPEECH.

[2] Jean Carletta,et al. Assessing Agreement on Classification Tasks: The Kappa Statistic , 1996, CL.

[3] John J. Godfrey,et al. SWITCHBOARD: telephone speech corpus for research and development , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4] Matthew Purver,et al. Meeting Structure Annotation: Data and Tools , 2005, SIGDIAL.

[5] Michael Gamon,et al. Task-Focused Summarization of Email , 2004 .

[6] A. Stolcke,et al. Dialog act modelling for conversational speech , 1998 .

[7] Adam L. Berger,et al. A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[8] Paul N. Bennett,et al. Detecting action-items in e-mail , 2005, SIGIR '05.

[9] Jeff A. Bilmes,et al. Dialog act tagging using graphical models , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[10] Andreas Stolcke,et al. Joint Segmentation and Classification of Dialog Acts in Multiparty Meetings , 2005, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[11] Elizabeth Shriberg,et al. Automatic dialog act segmentation and classification in multiparty meetings , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[12] Tom M. Mitchell,et al. Learning to Classify Email into “Speech Acts” , 2004, EMNLP.

[13] Elmar Nöth,et al. Dialog act classification with the help of prosody , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[14] Richard M. Schwartz,et al. Nymble: a High-Performance Learning Name-finder , 1997, ANLP.

[15] Matthew Purver,et al. Detecting Action Items in Multi-party Meetings: Annotation and Initial Experiments , 2006, MLMI.

[16] Christopher D. Manning,et al. Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger , 2000, EMNLP.

[17] Dan Jurafsky,et al. Dialog Act Modeling for Conversational Speech , 1998 .

[18] Andreas Stolcke,et al. Can Prosody Aid the Automatic Classification of Dialog Acts in Conversational Speech? , 1998, Language and speech.

[19] Elizabeth Shriberg,et al. The ICSI Meeting Recorder Dialog Act (MRDA) Corpus , 2004, SIGDIAL Workshop.

[20] Paul Boersma,et al. Praat: doing phonetics by computer , 2003 .

[21] Andreas Stolcke,et al. The ICSI Meeting Corpus , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..