Hybrid Simplification using Deep Semantics and Machine Translation

We present a hybrid approach to sentence simplification which combines deep semantics and monolingual machine translation to derive simple sentences from complex ones. The approach differs from previous work in two main ways. First, it is semantic based in that it takes as input a deep semantic representation rather than e.g., a sentence or a parse tree. Second, it combines a simplification model for splitting and deletion with a monolingual translation model for phrase substitution and reordering. When compared against current state of the art methods, our model yields significantly simpler output that is both grammatical and meaning preserving.

[1]  Anja Belz,et al.  LG-Eval: A Toolkit for Creating Online Language Evaluation Experiments , 2012, LREC.

[2]  Emiel Krahmer,et al.  Sentence Simplification by Monolingual Machine Translation , 2012, ACL.

[3]  Horacio Saggion,et al.  Text Simplification Tools for Spanish , 2012, LREC.

[4]  Advaith Siddharthan,et al.  Text Simplification using Typed Dependencies: A Comparision of the Robustness of Different Generation Strategies , 2011, ENLG.

[5]  Mirella Lapata,et al.  Learning to Simplify Sentences with Quasi-Synchronous Grammar and Integer Programming , 2011, EMNLP.

[6]  David Kauchak,et al.  Learning to Simplify Sentences Using Wikipedia , 2011, Monolingual@ACL.

[7]  Iryna Gurevych,et al.  A Monolingual Tree-based Translation Model for Sentence Simplification , 2010, COLING.

[8]  Advaith Siddharthan,et al.  Complex Lexico-syntactic Reformulation of Sentences Using Typed Dependency Representations , 2010, INLG.

[9]  Renata Pontin de Mattos Fortes,et al.  Facilita: reading assistance for low-literacy readers , 2009, SIGDOC '09.

[10]  Michael Strube,et al.  Dependency Tree Based Sentence Compression , 2008, INLG.

[11]  Daphne Koller,et al.  Sentence Simplification for Semantic Role Labeling , 2008, ACL.

[12]  H. Kamp A Theory of Truth and Semantic Representation , 2008 .

[13]  Johan Bos,et al.  Linguistically Motivated Large-Scale NLP with C&C and Boxer , 2007, ACL.

[14]  David A. Smith,et al.  Quasi-Synchronous Grammars: Alignment by Soft Projection of Syntactic Dependencies , 2006, WMT@HLT-NAACL.

[15]  Advaith Siddharthan,et al.  Syntactic Simplification and Text Cohesion , 2006 .

[16]  Chris Callison-Burch,et al.  Paraphrasing with Bilingual Parallel Corpora , 2005, ACL.

[17]  Ani Nenkova,et al.  Syntactic Simplification for Improving Content Selection in Multi-Document Summarization , 2004, COLING.

[18]  R. Barzilay,et al.  Sentence Alignment for Monolingual Comparable Corpora , 2003, EMNLP.

[19]  Advaith Siddharthan,et al.  An architecture for a text simplification system , 2002, Language Engineering Conference, 2002. Proceedings.

[20]  Kevin Knight,et al.  A Syntax-based Statistical Translation Model , 2001, ACL.

[21]  Daniel Marcu,et al.  Statistics-Based Summarization - Step One: Sentence Compression , 2000, AAAI/IAAI.

[22]  R. Chandrasekar,et al.  Automatic induction of rules for text simplification , 1997, Knowl. Based Syst..

[23]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[24]  Yvonne Margaret Canning,et al.  Syntactic simplification of text , 2002 .

[25]  Siobhan Devlin,et al.  Simplifying Text for Language-Impaired Readers , 1999, EACL.

[26]  Mark Dras,et al.  Tree adjoining grammar and the reluctant paraphrasing of text , 1999 .

[27]  cationR. Chandrasekar Automatic Induction of Rules for Text Simpli cation , 1997 .

[28]  S. Bangalore,et al.  Motivations and Methods for Text Simplification , 1996, COLING.