Segmenting meetings into agenda items by extracting implicit supervision from human note-taking

Splitting a meeting into segments such that each segment contains discussions on exactly one agenda item is useful for tasks such as retrieval and summarization of agenda item discussions. However, accurate topic segmentation of meetings is a difficult task. In this paper, we investigate the idea of acquiring implicit supervision from human meeting participants to solve the segmentation problem. Specifically we have implemented and tested a note taking interface that gives value to users by helping them organize and retrieve their notes easily, but that also extracts a segmentation of the meeting based on note taking behavior. We show that the segmentation so obtained achieves a Pk value of 0.212 which improves upon an unsupervised baseline by 45% relative, and compares favorably with a current state-of-the-art algorithm. Most importantly, we achieve this performance without any features or algorithms in the classic sense.

[1]  Andreas Stolcke,et al.  PROGRESS IN MEETING RECOGNITION: THE ICSI-SRI-UW SPRING 2004 EVALUATION SYSTEM , 2008 .

[2]  Manuel Blum,et al.  Peekaboom: a game for locating objects in images , 2006, CHI.

[3]  Eric Fosler-Lussier,et al.  Discourse Segmentation of Multi-Party Conversation , 2003, ACL.

[4]  Alexander I. Rudnicky,et al.  A texttiling based approach to topic boundary detection in meetings , 2006, INTERSPEECH.

[5]  Matthew Purver,et al.  Shallow Discourse Structure for Action Item Detection , 2006, HLT-NAACL 2006.

[6]  M. Veloso,et al.  Using Sparse Visual Data to Model Human Activities in Meetings , 2004 .

[7]  Carolyn Penstein Rosé,et al.  The Necessity of a Meeting Recording and Playback System, and the Benefit of Topic-Level Annotations to Meeting Browsing , 2005, INTERACT.

[8]  Laura A. Dabbish,et al.  Labeling images with a computer game , 2004, AAAI Spring Symposium: Knowledge Collection from Volunteer Contributors.

[9]  Matthew Purver,et al.  Meeting Structure Annotation: Data and Tools , 2005, SIGDIAL.

[10]  Roeland Ordelman,et al.  Transcription of conference room meetings: an investigation , 2005, INTERSPEECH.

[11]  Marti A. Hearst Text Tiling: Segmenting Text into Multi-paragraph Subtopic Passages , 1997, CL.

[12]  John D. Lafferty,et al.  Statistical Models for Text Segmentation , 1999, Machine Learning.

[13]  Manuel Blum,et al.  Verbosity: a game for collecting common-sense facts , 2006, CHI.

[14]  Thomas L. Griffiths,et al.  Unsupervised Topic Modelling for Multi-Party Spoken Discourse , 2006, ACL.

[15]  Alexander I. Rudnicky,et al.  SmartNotes: Implicit Labeling of Meeting Data through User Note-Taking and Browsing , 2006, NAACL.

[16]  Jean Carletta,et al.  Extractive summarization of meeting recordings , 2005, INTERSPEECH.

[17]  Alexander I. Rudnicky,et al.  Using simple speech-based features to detect the state of a meeting and the roles of the meeting participants , 2004, INTERSPEECH.

[18]  Tanja Schultz,et al.  Issues in meeting transcription - the ISL meeting transcription system , 2004, INTERSPEECH.

[19]  Scott P. Robertson,et al.  Proceedings of the SIGCHI Conference on Human Factors in Computing Systems , 1991 .

[20]  Regina Barzilay,et al.  Catching the Drift: Probabilistic Content Models, with Applications to Generation and Summarization , 2004, NAACL.