The Automated Estimation of Content-Terms for Query-Focused Multi-document Summarization

Query-focused multi-document summarization aims to produce a summary in response to a user query. We present an approach based on estimation of content-terms to address this task. In the process of estimating content-terms, we make full use of the relevant feature and the information richness feature for assigning importance to each of them. With summary content-terms being identified correctly, the candidate sentences are ranked and best sentences are selected to form the summary. Experiments on DUC 2005 and 2006 are performed and the ROUGH evaluation results show that the proposed approach is encouraging.

[1]  Ryen W. White,et al.  Finding relevant documents using top ranking sentences: an evaluation of two alternative schemes , 2002, SIGIR '02.

[2]  Jade Goldstein-Stewart,et al.  Summarizing text documents: sentence selection and evaluation metrics , 1999, SIGIR '99.

[3]  Curt Burgess,et al.  Producing high-dimensional semantic spaces from lexical co-occurrence , 1996 .

[4]  W. Bruce Croft,et al.  Relevance-Based Language Models , 2001, SIGIR '01.

[5]  Inderjeet Mani,et al.  Multi-Document Summarization by Graph Search and Matching , 1997, AAAI/IAAI.

[6]  Ted Dunning,et al.  Accurate Methods for the Statistics of Surprise and Coincidence , 1993, CL.

[7]  Sanda M. Harabagiu,et al.  Topic themes for multi-document summarization , 2005, SIGIR '05.

[8]  Atefeh Farzindar,et al.  CATS a topic-oriented multi-document summarization system at DUC 2005 , 2005 .

[9]  Xiaojun Wan,et al.  Improved Affinity Graph Based Multi-Document Summarization , 2006, NAACL.

[10]  Eduard Hovy,et al.  A BE-based Multi-document Summarizer with Query Interpretation , 2005 .

[11]  Dragomir R. Radev,et al.  LexPageRank: Prestige in Multi-Document Text Summarization , 2004, EMNLP.

[12]  Ani Nenkova,et al.  A compositional context sensitive multi-document summarizer: exploring the factors that influence summarization , 2006, SIGIR.

[13]  Andrew Hickl,et al.  LCC's GISTexter at DUC 2006: Multi-Strategy Multi-Document Summarization , 2006 .

[14]  Dianne P. O'Leary,et al.  Topic-Focused Multi-Document Summarization Using an Approximate Oracle Score , 2006, ACL.