Compressing Multi-document Summaries through Sentence Simplification

Multi-document summarization aims at creating a single summary based on the information conveyed by a collection of texts. After the candidate sentences have been identified and ordered, it is time to select which will be included in the summary. In this paper, we propose an approach that uses sentence simplification, both lexical and syntactic, to help improve the compression step in the summarization process. Simplification is performed by removing specific sentential constructions conveying information that can be considered to be less relevant to the general message of the summary. Thus, the rationale is that sentence simplification not only removes expendable information, but also makes room for further relevant data in a summary.

[1]  Katja Filippova,et al.  Multi-Sentence Compression: Finding Shortest Paths in Word Graphs , 2010, COLING.

[2]  Elena Lloret Pastor Text summarisation based on human language technologies and its applications , 2011 .

[3]  Kathleen McKeown,et al.  Cut and Paste Based Text Summarization , 2000, ANLP.

[4]  Ani Nenkova,et al.  Syntactic Simplification for Improving Content Selection in Multi-Document Summarization , 2004, COLING.

[5]  Jimmy J. Lin,et al.  Multi-candidate reduction: Sentence compression as a tool for document summarization tasks , 2007, Inf. Process. Manag..

[6]  Roger Levy,et al.  Tregex and Tsurgeon: tools for querying and manipulating tree data structures , 2006, LREC.

[7]  Maria das Graças Volpe Nunes,et al.  GistSumm: A Summarization Tool Based on a New Extractive Method , 2003, PROPOR.

[8]  Mirella Lapata,et al.  Sentence Compression as Tree Transduction , 2009, J. Artif. Intell. Res..

[9]  Hongyan Jing,et al.  Sentence Reduction for Automatic Text Summarization , 2000, ANLP.

[10]  António Branco,et al.  Out-of-the-Box Robust Parsing of Portuguese , 2010, PROPOR.

[11]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[12]  David Evans,et al.  Columbia University at DUC 2004 , 2004 .

[13]  António Branco,et al.  Combining a double clustering approach with sentence simplification to produce highly informative multi-document summaries , 2012, 2012 IEEE 13th International Conference on Information Reuse & Integration (IRI).

[14]  Emiel Krahmer,et al.  Sentence Simplification by Monolingual Machine Translation , 2012, ACL.

[15]  António Branco,et al.  Enhancing Multi-document Summaries with Sentence Simplification , 2012 .

[16]  Manabu Okumura,et al.  Sentence Compression with Semantic Role Constraints , 2012, ACL.

[17]  Raman Chandrasekar,et al.  Motivations and Methods for Text Simplification , 1996, COLING.