Improving summarization through rhetorical parsing tuning

We study the relationship between the structure of" discourse and a set of summarization heuristics that are employed by current systems. A tight coupling of the two enables us to learn genre-specific combinations of heuristics that can be used for disambiguation during discourse parsing. The same coupling enables us to construct discourse structures that yield summaries that contain textual units that are not only important according to a variety of position-, title-, and lexical-similarity-based heuristics, but also central to the main claims of texts. A careful analysis of our results enables us to shed some new light on issues related to summary evaluation and learning.

[1]  Marti A. Hearst Text Tiling: Segmenting Text into Multi-paragraph Subtopic Passages , 1997, CL.

[2]  H. P. Edmundson,et al.  New Methods in Automatic Extracting , 1969, JACM.

[3]  Phyllis B. Baxendale,et al.  Machine-Made Index for Technical Literature - An Experiment , 1958, IBM J. Res. Dev..

[4]  Hector J. Levesque,et al.  A New Method for Solving Hard Satisfiability Problems , 1992, AAAI.

[5]  Francine Chen,et al.  A trainable document summarizer , 1995, SIGIR '95.

[6]  Seiji Miike,et al.  Abstract Generation Based on Rhetorical Structure Extraction , 1994, COLING.

[7]  Regina Barzilay,et al.  Using Lexical Chains for Text Summarization , 1997 .

[8]  Daniel Marcu The rhetorical parsing of natural language texts , 1997 .

[9]  Simone Teufel,et al.  Sentence extraction as a classification task , 1997 .

[10]  E. F. Skorochod'ko Adaptive Method of Automatic Abstracting and Indexing , 1971, IFIP Congress.

[11]  Eduard H. Hovy,et al.  Identifying Topics by Position , 1997, ANLP.

[12]  Daniel Marcu,et al.  The rhetorical parsing, summarization, and generation of natural language texts , 1998 .

[13]  James Allan,et al.  Selective text utilization and text traversal , 1993, Int. J. Hum. Comput. Stud..

[14]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[15]  Inderjeet Mani,et al.  Using Cohesion and Coherence Models for Text Summarization , 1998 .

[16]  李幼升,et al.  Ph , 1989 .

[17]  Inderjeet Mani,et al.  Machine Learning of Generic and User-Focused Summarization , 1998, AAAI/IAAI.

[18]  Michael Hoey,et al.  Patterns of Lexis In Text , 1991 .

[19]  Kathleen R. McKeown,et al.  Summarization Evaluation Methods: Experiments and Analysis , 1998 .

[20]  Chin-Yew Lin Assembly of Topic Extraction Modules in SUMMARIST , 1998 .

[21]  Daniel Marcu,et al.  From discourse structures to text summaries , 1997 .

[22]  Daniel Marcu,et al.  Building Up Rhetorical Structure Trees , 1996, AAAI/IAAI, Vol. 2.

[23]  William C. Mann,et al.  Rhetorical Structure Theory: Toward a functional theory of text organization , 1988 .