Generating natural language summaries from multiple on-line sources

We present a methodology for summarization of news about current events in the form of briefings that include appropriate background (historical) information. The system that we developed, SUMMONS, uses the output of systems developed for the DARPA Message Understanding Conferences to generate summaries of multiple documents on the same or related events, presenting similarities and differences, contradictions, and generalizations among sources of information. We describe the various components of the system, showing how information from multiple articles is combined, organized into a paragraph, and finally, realized as English sentences. A feature of our work is the extraction of descriptions of entities such as people and places for reuse to enhance a briefing.

[1]  Kathleen McKeown,et al.  Tailoring Lexical Choice to the User's Vocabulary in Multimedia Explanation Generation , 1993, ACL.

[2]  Nina Wacholder,et al.  Disambiguation of Proper Names in Text , 1997, ANLP.

[3]  David D. McDonald Internal and External Evidence in the Identification and Semantic Categorization of Proper Names , 1993 .

[4]  Francine Chen,et al.  A trainable document summarizer , 1995, SIGIR '95.

[5]  Lisa F. Rau,et al.  Conceptual Information Extraction and Retrieval from Natural Language Input , 1997, RIAO.

[6]  James Pustejovsky,et al.  CRL/NMSU and Brandeis: description of the MucBruce system as used for MUC-4 , 1992, MUC.

[7]  Karen Kukich,et al.  Design of a Knowledge-Based Report Generator , 1983, ACL.

[8]  Dragomir R. Radev,et al.  Generating summaries of multiple news articles , 1995, SIGIR '95.

[9]  Brigitte Endres-Niggemeyer An empirical process model of abstracting , 1992, ISI.

[10]  Michael Elhadad,et al.  Using argumentation to control lexical choice: a functional unification implementation , 1993 .

[11]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[12]  Daniel Marcu,et al.  From discourse structures to text summaries , 1997 .

[13]  Sheryl R. Young,et al.  Automatic Classification and Summarization of Banking Telexes , 1985, CAIA.

[14]  Eduard H. Hovy,et al.  Aggregation in Natural Language Generation , 1993, EWNLG.

[15]  John Tait,et al.  Automatic summarising of English texts , 1982 .

[16]  Kathleen McKeown,et al.  Text generation: using discourse strategies and focus constraints to generate natural language text , 1985 .

[17]  Alfred V. Aho,et al.  Columbia Digital News System. An environment for briefing and search over multimedia information , 1997, Proceedings of ADL '97 Forum on Research and Technology. Advances in Digital Libraries.

[18]  Kathleen R. McKeown,et al.  Software Re-Use and Evolution in Text Generation Applications , 1997 .

[19]  Eduard H. Hovy,et al.  Automated Text Summarization and the SUMMARIST System , 1998, TIPSTER.

[20]  James Shaw Conciseness through Aggregation in Text Generation , 1995, ACL.

[21]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[22]  Chris Buckley,et al.  Automatic Text Summarization by Paragraph Extraction , 1997 .

[23]  Inderjeet Mani,et al.  Identifying Unknown Proper Names in Newswire Text , 1996 .

[24]  Michael Elhadad,et al.  FUF: the Universal Unifier User Manual Version 5.2 , 1991 .

[25]  Kathleen McKeown,et al.  Corpus Analysis for Revision-Based Generation of Complex Sentences , 1993, AAAI.

[26]  Herbert Gish,et al.  BBN: Description of the PLUM System as Used for MUC-5 , 2005, MUC.

[27]  James Shaw,et al.  Practical Issues in Automatic Documentation Generation , 1994, ANLP.

[28]  John D. Burger,et al.  MITRE-Bedford: description of the ALEMBIC system as used for MUC-4 , 1992, MUC.

[29]  Dragomir R. Radev,et al.  Building a Generation Knowledge Source using Internet-Accessible Newswire , 1997, ANLP.

[30]  Harold Borko,et al.  Abstracting Concepts and Methods , 1975 .

[31]  David Fisher,et al.  Description of the UMass system as used for MUC-6 , 1995, MUC.

[32]  Dragomir R. Radev An Architecture For Distributed Natural Language Summarization , 1996, INLG.

[33]  Chris D. Paice,et al.  Constructing literature abstracts by computer: Techniques and prospects , 1990, Inf. Process. Manag..

[34]  Annely Rothkegel Abstracting from the Perspective of Text Production , 1995, Inf. Process. Manag..

[35]  Ii Gerald Francis Dejong Skimming stories in real time: an experiment in integrated understanding. , 1979 .

[36]  Alain Polguère,et al.  Bilingual Generation of Weather Forecasts in an Operations Environment , 1990, COLING.

[37]  Claire Cardie,et al.  UMass/Hughes: Description of the CIRCUS System Used for MUC-51 , 1993, MUC.

[38]  Kathleen R. McKeown,et al.  Summarization Evaluation Methods: Experiments and Analysis , 1998 .

[39]  Michael R. Genesereth,et al.  Software agents , 1994, CACM.

[40]  Steven K. Feiner,et al.  Automating the generation of coordinated multimedia explanations , 1991, Computer.

[41]  Inderjeet Mani,et al.  Multi-Document Summarization by Graph Search and Matching , 1997, AAAI/IAAI.

[42]  Eduard H. Hovy,et al.  Planning Coherent Multisentential Text , 1988, ACL.

[43]  Alain Polguère,et al.  Generation of Extended Bilingual Statistical Reports , 1992, COLING.

[44]  Regina Barzilay,et al.  Using Lexical Chains for Text Summarization , 1997 .

[45]  Kathleen McKeown,et al.  Generating Concise Natural Language Summaries , 1995, Inf. Process. Manag..

[46]  Kathleen McKeown,et al.  Empirically Designing and Evaluating a New Revision-Based Model for Summary Generation , 1996, Artif. Intell..

[47]  Lisa F. Rau,et al.  Automatic Condensation of Electronic Publications by Sentence Selection , 1995, Inf. Process. Manag..

[48]  Julian Kupiec,et al.  MURAX: a robust linguistic approach for question answering using an on-line encyclopedia , 1993, SIGIR.

[49]  Jacques Robin,et al.  Revision-based generation of natural language summaries providing historical background: corpus-based analysis, design, implementation and evaluation , 1995 .

[50]  Darrin Duford,et al.  crep: a regular expression-matching textual corpus tool , 1993 .

[51]  Eduard Hovy,et al.  Automated Text Summarization in SUMMARIST , 1997, ACL 1997.

[52]  Karen Spärck Jones What Might be in a Summary? , 1993, Information Retrieval.

[53]  Elizabeth D. Liddy,et al.  Interpretation of Proper Nouns for Information Retrieval , 1993, HLT.

[54]  Kenneth Ward Church A Stochastic Parts Program and Noun Phrase Parser for Unrestricted Text , 1988, ANLP.

[55]  Michael Halliday,et al.  Cohesion in English , 1976 .

[56]  James Pustejovsky,et al.  Description-directed Natural Language Generation , 1985, IJCAI.

[57]  Udo Hahn,et al.  Topic parsing: Accounting for text macro structures in full-text analysis , 1990, Inf. Process. Manag..