Defining an annotation scheme with a view to automatic text simplification

English. This paper presents the preliminary steps of ongoing research in the field of automatic text simplification. In line with current approaches, we propose here a new annotation scheme specifically conceived to identify the typologies of changes an original sentence undergoes when it is manually simplified. Such a scheme has been tested on a parallel corpus available for Italian, which we have first aligned at sentence level and then annotated with simplification rules. Italiano. In questo contributo presentiamo i primi passi delle ricerche attuali sulla semplificazione automatica del testo. In linea con gli approcci più recenti, proponiamo qui un nuovo schema di annotazione teso specificamente a identificare le tipologie di cambiamenti che una frase originale subisce quando viene semplificata manualmente. Questo schema è stato testato su un corpus parallelo disponibile per l’italiano, che abbiamo precedentemente allineato a livello di frase e successivamente annotato con le regole di semplificazione.

[1]  Raman Chandrasekar,et al.  Motivations and Methods for Text Simplification , 1996, COLING.

[2]  Delphine Bernhard,et al.  Syntactic Sentence Simplification for French , 2014, PITR@EACL.

[3]  Sigrid Klerke,et al.  DSim, a Danish Parallel Corpus for Text Simplification , 2012, LREC.

[4]  Horacio Saggion,et al.  Text simplification resources for Spanish , 2014, Lang. Resour. Evaluation.

[5]  M. Tanenhaus Afterword The impact of “The cognitive basis for linguistic structures” , 2013 .

[6]  Marie-Francine Moens,et al.  Text simplification for children , 2010, SIGIR 2010.

[7]  Sara Tonelli,et al.  ERNESTA: A Sentence Simplification Tool for Children's Stories in Italian , 2013, CICLing.

[8]  Francesca Volpato,et al.  The acquisition of relative clauses and phi-features: evidence from hearing and hearing-impaired populations , 2010 .

[9]  Mari Ostendorf,et al.  Text simplification for language learners: a corpus analysis , 2007, SLaTE.

[10]  Lucia Specia,et al.  Building a Brazilian Portuguese Parallel Corpus of Original and Simplified Texts , 2009 .

[11]  Simonetta Montemagni,et al.  READ–IT: Assessing Readability of Italian Texts with a View to Text Simplification , 2011, SLPAT.

[12]  Michael P Maratsos,et al.  Children who get worse at understanding the passive: A replication of Bever , 1974 .

[13]  Renata Pontin de Mattos Fortes,et al.  Towards Brazilian Portuguese automatic text simplification systems , 2008, DocEng '08.

[14]  Gaetano Fiorin,et al.  The interpretation of imperfective aspect in development dyslexia , 2009 .

[15]  Kentaro Inui,et al.  Text Simplification for Reading Assistance: A Project Note , 2003, IWP@ACL.