Generating automated meeting summaries

The thesis at hand introduces a novel approach for the generation of abstractive summaries of meetings. While the automatic generation of document summaries has been studied for some decades now, the novelty of this thesis is mainly the application to the meeting domain (instead of text documents) as well as the use of a lexicalized representation formalism on the basis of Frame Semantics. This allows us to generate summaries abstractively (instead of extractively). The thesis begins with an overall motivation of the research domain, and a description of the central research questions. After that, the notion of a “summary” is discussed in general, and different dimensions of summaries are compared, before we give a broad overview over related work in the field of automatic summarization. Then, we introduce the necessary theories for this approach and the data sets used. Following that, we discuss the architecture of the MEESU system which has been developed in the course of this work, as well as the theory and implementation of the contained components. The system has been evaluated using a novel extrinsic evaluation approach which is detailed next. The thesis concludes with a summary and a discussion of possible points for future work.

[1]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[2]  G. A. Miller THE PSYCHOLOGICAL REVIEW THE MAGICAL NUMBER SEVEN, PLUS OR MINUS TWO: SOME LIMITS ON OUR CAPACITY FOR PROCESSING INFORMATION 1 , 1956 .

[3]  Tomek Strzalkowski,et al.  A Robust Practical Text Summarization , 1998 .

[4]  Regina Barzilay,et al.  Using Lexical Chains for Text Summarization , 1997 .

[5]  William C. Mann,et al.  Rhetorical Structure Theory: Toward a functional theory of text organization , 1988 .

[6]  Norbert Reithinger,et al.  Summarizing Multilingual Spoken Negotiation Dialogues , 2000, ACL.

[7]  Robert Dale,et al.  Building Natural Language Generation Systems (Studies in Natural Language Processing) , 2006 .

[8]  Tilman Becker,et al.  Combining Multiple Information Layers for the Automatic Generation of Indicative Meeting Abstracts , 2007, ENLG.

[9]  Kathleen McKeown,et al.  Generating Concise Natural Language Summaries , 1995, Inf. Process. Manag..

[10]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[11]  Elizabeth Shriberg,et al.  Spotting "hot spots" in meetings: human judgments and prosodic cues , 2003, INTERSPEECH.

[12]  Harold Borko,et al.  Abstracting Concepts and Methods , 1975 .

[13]  Barry Z. Posner,et al.  The Project Manager , 1998 .

[14]  Thomas R. Gruber,et al.  A translation approach to portable ontology specifications , 1993, Knowl. Acquis..

[15]  H. P. Edmundson,et al.  New Methods in Automatic Extracting , 1969, JACM.

[16]  Christian Husodo-Schulz,et al.  Exploring Features and Classifiers for Dialogue Act Segmentation , 2008, MLMI.

[17]  Francine Chen,et al.  A trainable document summarizer , 1995, SIGIR '95.

[18]  Ion Androutsopoulos,et al.  A Survey of Paraphrasing and Textual Entailment Methods , 2009, J. Artif. Intell. Res..

[19]  Noah A. Smith,et al.  Semi-Supervised Frame-Semantic Parsing for Unknown Predicates , 2011, ACL.

[20]  Sandro Castronovo,et al.  A Generic Layout-Tool for Summaries of Meetings in a Constraint-Based Approach , 2008, MLMI.

[21]  Beatrice Santorini Part-of-speech tagging guidelines for the penn treebank project , 1990 .

[22]  Roger Levy,et al.  Tregex and Tsurgeon: tools for querying and manipulating tree data structures , 2006, LREC.

[23]  Jan Alexandersson,et al.  Overlay: The Basic Operation for Discourse Processing , 2006, SmartKom.

[24]  Douglas A. Reynolds,et al.  Measuring the readability of automatic speech-to-text transcripts , 2003, INTERSPEECH.

[25]  Roger C. Schank,et al.  Conceptual dependency: A theory of natural language understanding , 1972 .

[26]  Wolfgang Finkler Automatische Selbstkorrektur bei der inkrementellen Generierung gesprochener Sprache unter Realzeitbedingungen - ein empirisch-simulativer Ansatz unter Verwendung eines Begründungsverwaltungssystems , 1997, DISKI.

[27]  Wolfgang Wahlster,et al.  Verbmobil: Foundations of Speech-to-Speech Translation , 2000, Artificial Intelligence.

[28]  Gerald DeJong,et al.  Prediction and Substantiation: A New Approach to Natural Language Processing , 1979, Cogn. Sci..

[29]  Robin Valenza SUMMARISATION OF SPOKEN AUDIO THROUGH INFORMATION EXTRACTION , 1999 .

[30]  Chin-Yew Lin,et al.  Robust automated topic identification , 1997 .

[31]  Jun-ichi Fukumoto,et al.  Automated Summarization Evaluation with Basic Elements. , 2006, LREC.

[32]  M. Sanderson Book Reviews: Advances in Automatic Text Summarization , 2000, Computational Linguistics.

[33]  Richard Johansson,et al.  LTH: Semantic Structure Extraction using Nonprojective Dependency Trees , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[34]  Jan Alexandersson,et al.  Hybrid discourse modeling and summarization for a speech-to-speech translation system , 2003 .

[35]  Jason Eisner,et al.  Lexical Semantics , 2020, The Handbook of English Linguistics.

[36]  Johanna D. Moore,et al.  Evaluating Automatic Summaries of Meeting Recordings , 2005, IEEvaluation@ACL.

[37]  Michael Halliday,et al.  Cohesion in English , 1976 .

[38]  Giuseppe Carenini,et al.  Interpretation and Transformation for Abstracting Conversations , 2010, HLT-NAACL.

[39]  Mark T. Maybury,et al.  Generating Summaries from Event Data , 1995, Inf. Process. Manag..

[40]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[41]  Elisabeth Schriberg,et al.  Preliminaries to a Theory of Speech Disfluencies , 1994 .

[42]  M. Banerjee,et al.  Beyond kappa: A review of interrater agreement measures , 1999 .

[43]  Steve Whittaker,et al.  Temporal Compression Of Speech: An Evaluation , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[44]  Gerd Herzog,et al.  VIsual TRAnslator: Linking perceptions and natural language descriptions , 1994, Artificial Intelligence Review.

[45]  Simone Teufel,et al.  Examining the consensus between human summaries: initial experiments with factoid analysis , 2003, HLT-NAACL 2003.

[46]  Gabriel Murray,et al.  Using Speech-Specific Characteristics for Automatic Speech Summarization , 2008 .

[47]  Katrin Erk,et al.  The SALSA Corpus: a German Corpus Resource for Lexical Semantics , 2006, LREC.

[48]  Richard I. Kittredge,et al.  Using natural-language processing to produce weather forecasts , 1994, IEEE Expert.

[49]  Ani Nenkova,et al.  Entity-driven Rewrite for Multi-document Summarization , 2008, IJCNLP.

[50]  Ani Nenkova,et al.  Evaluating Content Selection in Summarization: The Pyramid Method , 2004, NAACL.

[51]  John J. Godfrey,et al.  SWITCHBOARD: telephone speech corpus for research and development , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[52]  Mark Liberman,et al.  Transcriber: Development and use of a tool for assisting speech corpora production , 2001, Speech Commun..

[53]  Miriam R. L. Petruck FRAME SEMANTICS , 1996 .

[54]  W. Kintsch,et al.  Strategies of discourse comprehension , 1983 .

[55]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[56]  Ken Litkowski,et al.  The Preposition Project , 2021, ArXiv.

[57]  Andreas Stolcke,et al.  The ICSI Meeting Project: Resources and Research , 2004 .

[58]  Beatrice Santorini,et al.  Part-of-Speech Tagging Guidelines for the Penn Treebank Project (3rd Revision) , 1990 .

[59]  Christian A. Müller,et al.  Automatic recognition of speakers' age and gender on the basis of empirical studies , 2006, INTERSPEECH.

[60]  Norbert Reithinger,et al.  Insights into the Dialogue Processing of VERBMOBIL , 1997, ANLP.

[61]  Collin F. Baker,et al.  A Frames Approach to Semantic Analysis , 2009 .

[62]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[63]  Miriam R. L. Petruck,et al.  Surprise: Spanish FrameNet! , 2003 .

[64]  Steve Renals,et al.  DBN Based Joint Dialogue Act Recognition of Multiparty Meetings , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[65]  Josef Ruppenhofer,et al.  FrameNet II: Extended theory and practice , 2006 .

[66]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[67]  Thomas Kleinbauer,et al.  ARKTiS - A Fast Tag Recommender System Based On Heuristics , 2009, DC@PKDD/ECML.

[68]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[69]  Wolfgang Wahlster,et al.  SmartKom: Foundations of Multimodal Dialogue Systems , 2006, SmartKom.

[70]  Andreas Stolcke,et al.  The ICSI Meeting Corpus , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[71]  Steve Renals,et al.  Term-Weighting for Summarization of Multi-party Spoken Dialogues , 2007, MLMI.

[72]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[73]  Andreas Stolcke,et al.  Switchboard Discourse Language Modeling Project (Final Report) , 1997 .

[74]  Andrei Popescu-Belis,et al.  Abstracting a Dialogue Act Tagset forMeeting Processing , 2004 .

[75]  Brigitte Endres-Niggemeyer,et al.  Summarizing information , 1998 .

[76]  Peter Poller,et al.  Extrinsic summarization evaluation: A decision audit task , 2008, TSLP.

[77]  Andreas Stolcke,et al.  Dialogue act modeling for automatic tagging and recognition of conversational speech , 2000, CL.

[78]  Eduard H. Hovy,et al.  Summarization Evaluation Using Transformed Basic Elements , 2008, TAC.

[79]  Dragomir R. Radev,et al.  Centroid-based summarization of multiple documents: sentence extraction, utility-based evaluation, and user studies , 2000, ArXiv.

[80]  Christopher R. Johnson,et al.  Background to Framenet , 2003 .

[81]  E. Morris Giving a presentation. , 2005, British journal of hospital medicine.

[82]  Gerhard Rigoll,et al.  Action Recognition in Meeting Scenarios using Global Motion Features , 2003 .

[83]  Noah A. Smith,et al.  Probabilistic Frame-Semantic Parsing , 2010, NAACL.

[84]  Ingrid Zukerman,et al.  A Probabilistic Approach to the Interpretation of Spoken Utterances , 2008, PRICAI.

[85]  Alexander Dekhtyar,et al.  Information Retrieval , 2018, Lecture Notes in Computer Science.

[86]  Ani Nenkova,et al.  Automatic Summarization , 2011, ACL.

[87]  Arthur Vogelsang What to Say , 1978 .

[88]  Lukás Burget,et al.  The AMI System for the Transcription of Speech in Meetings , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[89]  Eduard Hovy,et al.  Evaluating DUC 2005 using Basic Elements , 2005 .

[90]  Csr Young,et al.  How to Do Things With Words , 2009 .

[91]  Alon Lavie,et al.  Automatic Summarization of Spoken Dialogues in Unrestricted Domains , 2012 .

[92]  Eugene Charniak,et al.  Coarse-to-Fine n-Best Parsing and MaxEnt Discriminative Reranking , 2005, ACL.

[93]  Alex Waibel,et al.  MEETING BROWSER: TRACKING AND SUMMARIZING MEETINGS , 2007 .

[94]  Elizabeth Shriberg,et al.  The ICSI Meeting Recorder Dialog Act (MRDA) Corpus , 2004, SIGDIAL Workshop.

[95]  Stanley Peters,et al.  Detecting and Summarizing Action Items in Multi-Party Dialogue , 2007, SIGDIAL.

[96]  Ehud Reiter,et al.  Book Reviews: Building Natural Language Generation Systems , 2000, CL.

[97]  Elizabeth Shriberg,et al.  Automatic dialog act segmentation and classification in multiparty meetings , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[98]  Jun'ichi Tsujii,et al.  Comparative Parser Performance Analysis across Grammar Frameworks through Automatic Tree Conversion using Synchronous Grammars , 2008, COLING.

[99]  Daniel Marcu,et al.  The rhetorical parsing, summarization, and generation of natural language texts , 1998 .

[100]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[101]  Jan Alexandersson,et al.  Amigram-a general-purpose tool for multimodal corpus annotation , 2005 .

[102]  Mark T. Maybury,et al.  Automatic Summarization , 2002, Computational Linguistics.

[103]  Marvin Minsky,et al.  A framework for representing knowledge , 1974 .

[104]  Nicola Guarino,et al.  WonderWeb Deliverable D18 Ontology Library , 2003 .

[105]  Glenn Shafer,et al.  Perspectives on the theory and practice of belief functions , 1990, Int. J. Approx. Reason..

[106]  Andreas Stolcke,et al.  Can Prosody Aid the Automatic Classification of Dialog Acts in Conversational Speech? , 1998, Language and speech.

[107]  Virginia A. Lingle,et al.  Indexing and Abstracting in Theory and Practice , 2005 .

[108]  C. Fillmore FRAME SEMANTICS AND THE NATURE OF LANGUAGE * , 1976 .

[109]  Andreas Stolcke,et al.  The Meeting Project at ICSI , 2001, HLT.

[110]  Karen Spärck Jones A statistical interpretation of term specificity and its application in retrieval , 2021, J. Documentation.

[111]  Jáchym Kolář,et al.  Automatic Segmentation of Speech into Sentence-like Units , 2008 .

[112]  Amanda Stent,et al.  Rhetorical Structure in Dialog , 2000, INLG.

[113]  Helen R. Tibbo The art of abstracting , 1997 .

[114]  Ralf Engel SPIN : A Semantic Parser for Spoken Dialog Systems , 2006 .

[115]  Daniel Jurafsky,et al.  Shallow Semantic Parsing using Support Vector Machines , 2004, NAACL.

[116]  C. Bazzanella,et al.  On Context and Dialogue , 1998 .

[117]  Harry Bunt,et al.  Context and Dialogue Control , 1994 .

[118]  Gustave J. Rath,et al.  The formation of abstracts by the selection of sentences , 1961 .

[119]  Daniel Gildea,et al.  Automatic Labeling of Semantic Roles , 2000, ACL.

[120]  Norbert Reithinger,et al.  Dialogue act classification using language models , 1997, EUROSPEECH.

[121]  Roger C. Schank,et al.  SCRIPTS, PLANS, GOALS, AND UNDERSTANDING , 1988 .

[122]  Alexander H. Waibel,et al.  Minimizing Word Error Rate in Textual Summaries of Spoken Language , 2000, ANLP.

[123]  Eduard Hovy,et al.  Automated Text Summarization in SUMMARIST , 1997, ACL 1997.

[124]  Jan Alexandersson,et al.  A new Metric for the Evaluation of Dialog Act Classification ∗ , 2005 .

[125]  Inge M. R. De Bleecker Towards an Optimal Lexicalization in a Natural-Sounding Portable Natural Language Generator for Dialog Systems , 2005, ACL.

[126]  Katrin Erk,et al.  HALMANESER – A Toolchain For Shallow Semantic Parsing , 2006 .

[127]  Narendra Ahuja,et al.  Detecting Faces in Images: A Survey , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[128]  May How to Say , 2005 .

[129]  Marc Moens,et al.  Argumentative Classification of Extracted Sentences as a First Step Towards Flexible Abstracting , 1999 .

[130]  David A. van Leeuwen,et al.  The 2007 AMI(DA) System for Meeting Transcription , 2007, CLEAR.

[131]  Björn W. Schuller,et al.  Suspicious Behavior Detection in Public Transport by Fusion of Low-Level Video Descriptors , 2007, 2007 IEEE International Conference on Multimedia and Expo.