Query-Based Summarization using Rhetorical Structure Theory

Research on Question Answering is focused mainly on classifying the question type and finding the answer. Presenting the answer in a way that suits the user’s needs has received little attention. This paper shows how existing question answering systems—which aim at finding precise answers to questions—can be improved by exploiting summarization techniques to extract more than just the answer from the document in which the answer resides. This is done using a graph search algorithm which searches for relevant sentences in the discourse structure, which is represented as a graph. The Rhetorical Structure Theory (RST) is used to create a graph representation of a text document. The output is an extensive answer, which not only answers the question, but also gives the user an opportunity to assess the accuracy of the answer (is this what I am looking for?), and to find additional information that is related to the question, and which may satisfy an information need. This has been implemented in a working multimodal question answering system where it operates with two independently developed question answering modules.

[1]  Kalina Bontcheva,et al.  Robust Generic and Query-based Summarization , 2003, EACL.

[2]  William C. Mann,et al.  RHETORICAL STRUCTURE THEORY: A THEORY OF TEXT ORGANIZATION , 1987 .

[3]  Ellen M. Voorhees,et al.  Building a question answering test collection , 2000, SIGIR '00.

[4]  Daniel Marcu,et al.  From discourse structures to text summaries , 1997 .

[5]  Lynette Hirschman,et al.  Natural language question answering: the view from here , 2001, Natural Language Engineering.

[6]  Michael Halliday,et al.  Cohesion in English , 1976 .

[7]  Thomas Rist,et al.  Generating coherent presentations employing textual and visual material , 1995, Artificial Intelligence Review.

[8]  Elisabeth André,et al.  Ein planbasierter Ansatz zur Generierung multimedialer Präsentationen , 1995, DISKI.

[9]  Inderjeet Mani,et al.  Multi-Document Summarization by Graph Search and Matching , 1997, AAAI/IAAI.

[10]  William C. Mann,et al.  Rhetorical Structure Theory: Toward a functional theory of text organization , 1988 .

[11]  Ellen M. Voorhees,et al.  Overview of the TREC 2002 Question Answering Track , 2003, TREC.

[12]  Jimmy J. Lin,et al.  What Makes a Good Answer? The Role of Context in Question Answering , 2003, INTERACT.

[13]  Edsger W. Dijkstra,et al.  A note on two problems in connexion with graphs , 1959, Numerische Mathematik.

[14]  J. D. Ouden Prosodic realizations of text structure , 2004 .

[15]  G. Meade Building a Discourse-Tagged Corpus in the Framework of Rhetorical Structure Theory , 2001 .

[16]  Candace L. Sidner,et al.  Attention, Intentions, and the Structure of Discourse , 1986, CL.

[17]  Inderjeet Mani,et al.  Using Cohesion and Coherence Models for Text Summarization , 1998 .

[18]  Daniel Marcu,et al.  Building a Discourse-Tagged Corpus in the Framework of Rhetorical Structure Theory , 2001, SIGDIAL Workshop.

[19]  Yllias Chali Generic and Query-Based Text Summarization Using Lexical Cohesion , 2002, Canadian Conference on AI.

[20]  Daniel Marcu,et al.  To build text summaries of high quality, nuclearity is not sufficient , 1998 .

[21]  Daniel Marcu,et al.  An Unsupervised Approach to Recognizing Discourse Relations , 2002, ACL.