Translating from Original to Simplified Sentences using Moses: When does it Actually Work?

In recent years, several studies have approached the Text Simplification (TS) task as a machine translation (MT) problem. They report promising results in learning how to translate from ‘original’ to ‘simplified’ language using the standard phrasebased translation model. However, our results indicate that this approach works well only when the training dataset consists mostly of those sentence pairs in which the simplified sentence is already very similar to its original. Our findings suggest that the standard phrase-based approach might not be appropriate to learn strong simplifications which are needed for certain target populations.

[1]  Daniel Marcu,et al.  Statistical Phrase-Based Translation , 2003, NAACL.

[2]  Sanja Stajner,et al.  Making It Simplext , 2015, ACM Trans. Access. Comput..

[3]  Advaith Siddharthan,et al.  Hybrid text simplification using synchronous dependency grammars with hand-written and automatically harvested rules , 2014, EACL.

[4]  Mirella Lapata,et al.  Learning to Simplify Sentences with Quasi-Synchronous Grammar and Integer Programming , 2011, EMNLP.

[5]  David Kauchak,et al.  Simple English Wikipedia: A New Text Simplification Task , 2011, ACL.

[6]  Lucia Specia,et al.  Building a Brazilian Portuguese Parallel Corpus of Original and Simplified Texts , 2009 .

[7]  Sanja Štajner,et al.  New data-driven approaches to text simplification , 2015 .

[8]  Hermann Ney,et al.  A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[9]  Caroline Gasperin,et al.  Fostering Digital Inclusion and Accessibility: The PorSimples project for Simplification of Portuguese Texts , 2010, NAACL.

[10]  Gregg C. Vanderheiden,et al.  Web Content Accessibility Guidelines (WCAG) 2.0 , 2008 .

[11]  Sanja Stajner,et al.  A Deeper Exploration of the Standard PB-SMT Approach to Text Simplification and its Evaluation , 2015, ACL.

[12]  Philipp Koehn,et al.  Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[13]  Lucia Specia,et al.  Learning When to Simplify Sentences for Natural Text Simplification , 2009 .

[14]  Sanja Stajner,et al.  Translating sentences from 'original' to 'simplified' Spanish , 2014, Proces. del Leng. Natural.

[15]  Delphine Bernhard,et al.  Syntactic Sentence Simplification for French , 2014, PITR@EACL.

[16]  Lucia Specia Translating from Complex to Simplified Sentences , 2010, PROPOR.

[17]  David Kauchak,et al.  Learning to Simplify Sentences Using Wikipedia , 2011, Monolingual@ACL.

[18]  Iryna Gurevych,et al.  A Monolingual Tree-based Translation Model for Sentence Simplification , 2010, COLING.

[19]  Horacio Saggion,et al.  Corpus-based Sentence Deletion and Split Decisions for Spanish Text Simplification , 2013 .