Simplification-induced transformations: typology and some characteristics

The purpose of automatic text simplification is to transform technical or difficult to understand texts into a more friendly version. The semantics must be preserved during this transformation. Automatic text simplification can be done at different levels (lexical, syntactic, semantic, stylistic...) and relies on the corresponding knowledge and resources (lexicon, rules...). Our objective is to propose methods and material for the creation of transformation rules from a small set of parallel sentences differentiated by their technicity. We also propose a typology of transformations and quantify them. We work with French-language data related to the medical domain, although we assume that the method can be exploited on texts in any language and from any domain.

[1]  Robert L. Goldstone,et al.  ' s personal copy Simplicity and generalization : Short-cutting abstraction in children ’ s object categorizations , 1997 .

[2]  Delphine Bernhard,et al.  Syntactic Sentence Simplification for French , 2014, PITR@EACL.

[3]  P. David Pearson,et al.  The effects of syntactic and lexical complexity on the comprehension of elementary science texts , 2011 .

[4]  Rico Sennrich,et al.  Improving Neural Machine Translation Models with Monolingual Data , 2015, ACL.

[5]  Jörg Tiedemann,et al.  Statistical Machine Translation with Readability Constraints , 2013, NODALIDA.

[6]  Pablo Gervás,et al.  Feasibility Analysis for SemiAutomatic Conversion of Text to Improve Readability , 2009, ICTA.

[7]  Mirella Lapata,et al.  Sentence Simplification with Deep Reinforcement Learning , 2017, EMNLP.

[8]  Marie-Francine Moens,et al.  Text simplification for children , 2010, SIGIR 2010.

[9]  Daniel Marcu,et al.  Text Simplification for Information-Seeking Applications , 2004, CoopIS/DOA/ODBASE.

[10]  Horacio Rodríguez,et al.  Paraphrase Concept and Typology. A Linguistically Based and Computationally Oriented Approach , 2011, Proces. del Leng. Natural.

[11]  Qian Yu,et al.  Revisiting sentence alignment algorithms for alignment visualization and evaluation , 2012 .

[12]  David West,et al.  UNC-CH at DUC 2007: Query Expansion, Lexical Simplification and Sentence Selection Strategies for Multi-Document Summarization , 2007 .

[13]  Chris Callison-Burch,et al.  Optimizing Statistical Machine Translation for Text Simplification , 2016, TACL.

[14]  Zhiyong Lu,et al.  SimConcept: a hybrid approach for simplifying composite named entities in biomedicine , 2014, BCB.

[15]  Ulrich Germann,et al.  Yawat: Yet Another Word Alignment Tool , 2008, ACL.

[16]  Sergiu Nisioi,et al.  Exploring Neural Text Simplification Models , 2017, ACL.

[17]  Goran Glavas,et al.  Simplifying Lexical Simplification: Do We Need Simplified Corpora? , 2015, ACL.

[18]  Daphne Koller,et al.  Sentence Simplification for Semantic Role Labeling , 2008, ACL.

[19]  David Kauchak,et al.  A user-study measuring the effects of lexical simplification and coherence enhancement on perceived and actual text difficulty , 2013, Int. J. Medical Informatics.

[20]  David N. Kennedy,et al.  Automatic Text Simplification for People with Intellectual Disabilities , 2017 .

[21]  Haifeng Wang,et al.  Leveraging Multiple MT Engines for Paraphrase Generation , 2010, COLING.

[22]  Raman Chandrasekar,et al.  Automatic induction of rules for text simplification , 1997, Knowl. Based Syst..

[23]  Lucia Specia,et al.  Benchmarking Lexical Simplification Systems , 2016, LREC.

[24]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[25]  Matthew Shardlow,et al.  A Survey of Automated Text Simplification , 2014 .

[26]  Ping Chen,et al.  Text Simplification Using Neural Machine Translation , 2016, AAAI.

[27]  Ping Chen,et al.  An Experimental Study of LSTM Encoder-Decoder Model for Text Simplification , 2016, ArXiv.

[28]  Emiel Krahmer,et al.  Sentence Simplification by Monolingual Machine Translation , 2012, ACL.

[29]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[30]  Eytan Adar,et al.  SimpleScience: Lexical Simplification of Scientific Terminology , 2016, EMNLP.

[31]  Horacio Saggion,et al.  Reporting simply: A lexical simplification strategy for enhancing text accessibility , 2012 .

[32]  G. Moruzzi Defining an annotation scheme with a view to automatic text simplification , 2014 .

[33]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[34]  Iryna Gurevych,et al.  A Monolingual Tree-based Translation Model for Sentence Simplification , 2010, COLING.