Sentence Compression with Joint Structural Inference

Sentence compression techniques often assemble output sentences using fragments of lexical sequences such as ngrams or units of syntactic structure such as edges from a dependency tree representation. We present a novel approach for discriminative sentence compression that unifies these notions and jointly produces sequential and syntactic representations for output text, leveraging a compact integer linear programming formulation to maintain structural integrity. Our supervised models permit rich features over heterogeneous linguistic structures and generalize over previous state-of-theart approaches. Experiments on corpora featuring human-generated compressions demonstrate a 13-15% relative gain in 4gram accuracy over a well-studied language model-based compression system.

[1]  Mirella Lapata,et al.  Sentence Compression as Tree Transduction , 2009, J. Artif. Intell. Res..

[2]  Daniel Marcu,et al.  A Noisy-Channel Model for Document Compression , 2002, ACL.

[3]  Tadashi Nomoto,et al.  Discriminative sentence compression with conditional random fields , 2007, Inf. Process. Manag..

[4]  Chris Callison-Burch,et al.  Evaluating Sentence Compression: Pitfalls and Suggested Remedies , 2011, Monolingual@ACL.

[5]  Daniel Marcu,et al.  Statistics-Based Summarization - Step One: Sentence Compression , 2000, AAAI/IAAI.

[6]  Michael White,et al.  A Joint Phrasal and Dependency Model for Paraphrase Alignment , 2012, COLING.

[7]  Alexander M. Rush,et al.  Exact Decoding of Syntactic Translation Models through Lagrangian Relaxation , 2011, ACL.

[8]  Daniel Marcu,et al.  Summarization beyond sentence extraction: A probabilistic approach to sentence compression , 2002, Artif. Intell..

[9]  Stefan Riezler,et al.  Statistical Sentence Condensation using Ambiguity Packing and Stochastic Disambiguation Methods for Lexical-Functional Grammar , 2003, NAACL.

[10]  Michael Strube,et al.  Dependency Tree Based Sentence Compression , 2008, INLG.

[11]  Noah A. Smith,et al.  Summarization with a Joint Model for Sentence Extraction and Compression , 2009, ILP 2009.

[12]  NomotoTadashi Discriminative sentence compression with conditional random fields , 2007 .

[13]  Eric P. Xing,et al.  Concise Integer Linear Programming Formulations for Dependency Parsing , 2009, ACL.

[14]  Ryan T. McDonald Discriminative Sentence Compression with Soft Syntactic Evidence , 2006, EACL.

[15]  J. Clarke,et al.  Global inference for sentence compression : an integer linear programming approach , 2008, J. Artif. Intell. Res..

[16]  Mirella Lapata,et al.  Modelling Compression with Discourse Constraints , 2007, EMNLP.

[17]  Mirella Lapata,et al.  Multiple Aspect Summarization Using Integer Linear Programming , 2012, EMNLP.

[18]  Dan Klein,et al.  Jointly Learning to Extract and Compress , 2011, ACL.

[19]  Eugene Charniak,et al.  Supervised and Unsupervised Learning for Sentence Compression , 2005, ACL.

[20]  Hongyan Jing,et al.  Sentence Reduction for Automatic Text Summarization , 2000, ANLP.

[21]  Mark Dras Reluctant Paraphrase: Textual Restructuring under an Optimisation Model , 1997, ArXiv.

[22]  Michael Collins,et al.  Discriminative Training Methods for Hidden Markov Models: Theory and Experiments with Perceptron Algorithms , 2002, EMNLP.

[23]  Mirella Lapata,et al.  Constraint-Based Sentence Compression: An Integer Programming Approach , 2006, ACL.

[24]  Chris Callison-Burch,et al.  Paraphrastic Sentence Compression with a Character-based Metric: Tightening without Deletion , 2011, Monolingual@ACL.

[25]  Chris Callison-Burch,et al.  Learning Sentential Paraphrases from Bilingual Parallel Corpora for Text-to-Text Generation , 2011, EMNLP.

[26]  Sebastian Riedel,et al.  Incremental Integer Linear Programming for Non-projective Dependency Parsing , 2006, EMNLP.

[27]  Ming-Wei Chang,et al.  Discriminative Learning over Constrained Latent Representations , 2010, NAACL.

[28]  Ted Briscoe,et al.  The Second Release of the RASP System , 2006, ACL.

[29]  Kathleen McKeown,et al.  Cut and Paste Based Text Summarization , 2000, ANLP.

[30]  Koby Crammer,et al.  Online Large-Margin Training of Dependency Parsers , 2005, ACL.

[31]  Mirella Lapata,et al.  Models for Sentence Compression: A Comparison across Domains, Training Requirements and Evaluation Measures , 2006, ACL.

[32]  Mirella Lapata,et al.  Discourse Constraints for Document Compression , 2010, CL.

[33]  Ion Androutsopoulos,et al.  An extractive supervised two-stage method for sentence compression , 2010, NAACL.

[34]  Sadaoki Furui,et al.  Speech Summarization: An Approach through Word Extraction and a Method for Evaluation , 2004, IEICE Trans. Inf. Syst..

[35]  John DeNero,et al.  The Complexity of Phrase Alignment Problems , 2008, ACL.

[36]  Jun'ichi Tsujii,et al.  Trimming CFG Parse Trees for Sentence Compression Using Machine Learning Approaches , 2006, ACL.

[37]  Peter Eades,et al.  On Optimal Trees , 1981, J. Algorithms.

[38]  Kathleen McKeown,et al.  Lexicalized Markov Grammars for Sentence Compression , 2007, NAACL.