Improving Update Summarization via Supervised ILP and Sentence Reranking

Integer Linear Programming (ILP) based summarization methods have been widely adopted recently because of their state-of-the-art performance. This paper proposes two new modifications in this framework for update summarization. Our key idea is to use discriminative models with a set of features to measure both the salience and the novelty of words and sentences. First, these features are used in a supervised model to predict the weights of the concepts used in the ILP model. Second, we generate preliminary sentence candidates in the ILP model and then rerank them using sentence level features. We evaluate our method on different TAC update summarization data sets, and the results show that our system performs competitively compared to the best TAC systems based on the ROUGE evaluation metric.

[1]  Brian Roark,et al.  Query-focused Supervised Sentence Ranking for Update Summaries , 2008, TAC.

[2]  Juan-Manuel Torres-Moreno,et al.  The LIA Update Summarization Systems at TAC 2008 (DRAFT) , 2008, TAC.

[3]  Furu Wei,et al.  PNR2: Ranking Sentences with Positive and Negative Reinforcement for Query-Oriented Update Summarization , 2008, COLING.

[4]  Dilek Z. Hakkani-Tür,et al.  The ICSI Summarization System at TAC 2008 , 2008, TAC.

[5]  Chew Lim Tan,et al.  Exploiting Category-Specific Information for Multi-Document Summarization , 2012, COLING.

[6]  Dilek Z. Hakkani-Tür,et al.  The ICSI/UTD Summarization System at TAC 2009 , 2009, TAC.

[7]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[8]  Xun Wang,et al.  Update Summarization using a Multi-level Hierarchical Dirichlet Process Model , 2012, COLING.

[9]  Dan Klein,et al.  Jointly Learning to Extract and Compress , 2011, ACL.

[10]  Praveen Bysani,et al.  Detecting Novelty in the context of Progressive Summarization , 2010, NAACL.

[11]  Lin Zhao,et al.  Improving Multi-documents Summarization by Sentence Compression based on Expanded Constituent Parse Trees , 2014, EMNLP.

[12]  Michael Collins,et al.  Discriminative Training Methods for Hidden Markov Models: Theory and Experiments with Perceptron Algorithms , 2002, EMNLP.

[13]  Noah A. Smith,et al.  Summarization with a Joint Model for Sentence Extraction and Compression , 2009, ILP 2009.

[14]  Enrique Alfonseca,et al.  DualSum: a Topic-Model based approach for update summarization , 2012, EACL.

[15]  Fei Liu,et al.  Document Summarization via Guided Sentence Compression , 2013, EMNLP.

[16]  Xiaojun Wan,et al.  Summarizing the differences in multilingual news , 2011, SIGIR.

[17]  Yang Liu,et al.  Using Supervised Bigram-based ILP for Extractive Summarization , 2013, ACL.

[18]  Mirella Lapata,et al.  Multiple Aspect Summarization Using Integer Linear Programming , 2012, EMNLP.

[19]  Xiaojun Wan Update Summarization Based on Co-Ranking with Constraints , 2012, COLING.