Extending a Single-Document Summarizer to Multi-Document: a Hierarchical Approach

The increasing amount of online content motivated the development of multi-document summarization methods. In this work, we explore straightforward approaches to extend single-document summarization methods to multi-document summarization. The proposed methods are based on the hierarchical combination of single-document summaries, and achieves state of the art results.

[1]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[2]  Oren Etzioni,et al.  Towards Coherent Multi-Document Summarization , 2013, NAACL.

[3]  Scott Sanner,et al.  Diverse retrieval via greedy optimization of expected 1-call@k in a latent subtopic relevance model , 2011, CIKM '11.

[4]  Jade Goldstein-Stewart,et al.  The Use of MMR, Diversity-Based Reranking for Reordering Documents and Producing Summaries , 1998, SIGIR Forum.

[5]  Dragomir R. Radev,et al.  Centroid-based summarization of multiple documents , 2004, Inf. Process. Manag..

[6]  Eduard H. Hovy,et al.  The Automated Acquisition of Topic Signatures for Text Summarization , 2000, COLING.

[7]  Bhiksha Raj,et al.  Privacy-Preserving Important Passage Retrieval , 2014, PIR@SIGIR.

[8]  Ricardo Ribeiro,et al.  Centrality-as-Relevance: Support Sets and Similarity as Geometric Proximity , 2011, J. Artif. Intell. Res..

[9]  David Evans,et al.  Tracking and summarizing news on a daily basis with Columbia's Newsblaster , 2002 .

[10]  Jaime G. Carbonell,et al.  Self reinforcement for important passage retrieval , 2013, SIGIR.

[11]  André F. T. Martins,et al.  Fast and Robust Compressive Summarization with Dual Decomposition and Multi-Task Learning , 2013, ACL.

[12]  Thorsten Joachims,et al.  Temporal corpus summarization using submodular word coverage , 2012, CIKM '12.

[13]  Jaime G. Carbonell,et al.  Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization , 2012, LREC.

[14]  Chris H. Q. Ding,et al.  Multi-document summarization via sentence-level semantic analysis and symmetric matrix factorization , 2008, SIGIR '08.

[15]  Scott Sanner,et al.  Probabilistic latent maximal marginal relevance , 2010, SIGIR '10.

[16]  Dilek Z. Hakkani-Tür,et al.  The ICSI Summarization System at TAC 2008 , 2008, TAC.

[17]  Hui Lin,et al.  Multi-document Summarization via Budgeted Maximization of Submodular Functions , 2010, NAACL.

[18]  Dragomir R. Radev,et al.  LexRank: Graph-based Centrality as Salience in Text Summarization , 2004 .

[19]  Scott Sanner,et al.  On the mathematical relationship between expected n-call@k and the relevance vs. diversity trade-off , 2012, SIGIR '12.

[20]  Jun Wang,et al.  Portfolio theory of information retrieval , 2009, SIGIR.

[21]  Vasileios Hatzivassiloglou,et al.  Event-Based Extractive Summarization , 2004 .

[22]  Dragomir R. Radev,et al.  NewsInEssence: summarizing online news topics , 2005, Commun. ACM.