Fast Joint Compression and Summarization via Graph Cuts

Extractive summarization typically uses sentences as summarization units. In contrast, joint compression and summarization can use smaller units such as words and phrases, resulting in summaries containing more information. The goal of compressive summarization is to find a subset of words that maximize the total score of concepts and cutting dependency arcs under the grammar constraints and summary length constraint. We propose an efficient decoding algorithm for fast compressive summarization using graph cuts. Our approachfirstrelaxesthelength constraint using Lagrangian relaxation. Then we propose to bound the relaxed objective function by the supermodular binary quadratic programming problem, which can be solved efficiently using graph max-flow/min-cut. Since finding the tightest lower bound suffers from local optimality, we use convex relaxation for initialization. Experimental results on TAC2008 dataset demonstrate our method achieves competitive ROUGE score and has good readability, while is much faster than the integer linear programming (ILP) method.

[1]  Fei Liu,et al.  Document Summarization via Guided Sentence Compression , 2013, EMNLP.

[2]  Vladimir Kolmogorov,et al.  What energy functions can be minimized via graph cuts? , 2002, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  André F. T. Martins,et al.  Fast and Robust Compressive Summarization with Dual Decomposition and Multi-Task Learning , 2013, ACL.

[4]  Fei Liu,et al.  From Extractive to Abstractive Meeting Summaries: Can It Be Done by Sentence Compression? , 2009, ACL.

[5]  Mirella Lapata,et al.  Multiple Aspect Summarization Using Integer Linear Programming , 2012, EMNLP.

[6]  Yang Liu,et al.  Using Supervised Bigram-based ILP for Extractive Summarization , 2013, ACL.

[7]  Endre Boros,et al.  Pseudo-Boolean optimization , 2002, Discret. Appl. Math..

[8]  Dragomir R. Radev,et al.  Experiments in Single and Multi-Document Summarization Using MEAD , 2001 .

[9]  Hui Lin,et al.  A Class of Submodular Functions for Document Summarization , 2011, ACL.

[10]  Dilek Z. Hakkani-Tür,et al.  The ICSI Summarization System at TAC 2008 , 2008, TAC.

[11]  Daniel Freedman,et al.  Energy minimization via graph cuts: settling what is possible , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[12]  Lisa F. Rau,et al.  Automatic Condensation of Electronic Publications by Sentence Selection , 1995, Inf. Process. Manag..

[13]  Noah A. Smith,et al.  Summarization with a Joint Model for Sentence Extraction and Compression , 2009, ILP 2009.

[14]  Yang Liu,et al.  Branch and Bound Algorithm for Dependency Parsing with Non-local Features , 2013, Transactions of the Association for Computational Linguistics.

[15]  Ahmet Aker,et al.  Summary Generation for Toponym-referenced Images using Object Type Language Models , 2009, RANLP.

[16]  Richard M. Karp,et al.  Theoretical Improvements in Algorithmic Efficiency for Network Flow Problems , 1972, Combinatorial Optimization.

[17]  J. Clarke,et al.  Global inference for sentence compression : an integer linear programming approach , 2008, J. Artif. Intell. Res..

[18]  Benoit Favre,et al.  A Scalable Global Model for Summarization , 2009, ILP 2009.

[19]  Jason Eisner,et al.  Nonconvex Global Optimization for Latent-Variable Models , 2013, ACL.

[20]  Sadid A. Hasan,et al.  On the Effectiveness of using Sentence Compression Models for Query-Focused Multi-Document Summarization , 2012, COLING.

[21]  Yoram Singer,et al.  Efficient projections onto the l1-ball for learning in high dimensions , 2008, ICML '08.

[22]  Kazuyuki Aihara,et al.  Size-constrained Submodular Minimization through Minimum Norm Base , 2011, ICML.

[23]  Dan Klein,et al.  Jointly Learning to Extract and Compress , 2011, ACL.

[24]  Alain Billionnet,et al.  Maximizing a supermodular pseudoboolean function: A polynomial algorithm for supermodular cubic functions , 1985, Discret. Appl. Math..

[25]  Alexander M. Rush,et al.  Dual Decomposition for Parsing with Non-Projective Head Automata , 2010, EMNLP.

[26]  H. P. Edmundson,et al.  New Methods in Automatic Extracting , 1969, JACM.