SVALA: Annotation of Second-Language Learner Text Based on Mostly Automatic Alignment of Parallel Corpora

Annotation of second-language learner text is a cumbersome manual task which in turn requires interpretation to postulate the intended meaning of the learner’s language. This paper describes SVALA, ...

[1]  I. Dan Melamed,et al.  Bitext Maps and Alignment via Pattern Recognition , 1999, CL.

[2]  Adriane Boyd Normalization in Context: Inter-Annotator Agreement for Meaning-Based Target Hypothesis Annotation , 2018 .

[3]  Laurent Romary,et al.  A model oriented approach to the mapping of annotation formats using standards , 2010 .

[4]  Anna Feldman,et al.  Evaluating and automating the annotation of a learner corpus , 2013, Language Resources and Evaluation.

[5]  Magnus Merkel,et al.  A System for Incremental and Interactive Word Linking , 2002, LREC.

[7]  Jirka Hana,et al.  Building a learner corpus , 2012, LREC.

[8]  Julia Prentice,et al.  Learner Corpus Anonymization in the Age of GDPR: Insights from the Creation of a Learner Corpus of Swedish , 2018 .

[9]  Patrick Grommes,et al.  Mehrdeutigkeiten und Kategorisierung: Probleme bei der Annotation von Lernerkorpora , 2008 .

[10]  Rod Ellis,et al.  The Study of Second Language Acquisition , 1994 .

[11]  Kemal Oflazer,et al.  A Web-based Annotation Framework For Large-Scale Text Correction , 2013, IJCNLP.

[12]  Philipp Koehn,et al.  Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , 2016 .

[13]  Markus Forsberg,et al.  Sparv : Språkbanken ’ s corpus annotation pipeline infrastructure , 2016 .

[14]  Magnus Merkel,et al.  Interactive Word Alignment for Corpus Linguistics , 2003 .

[15]  Julia Prentice,et al.  Annotation of Learner Corpora: first SweLL insights , 2018 .

[16]  Amália Mendes,et al.  The COPLE2 corpus: a learner corpus for Portuguese , 2016, LREC.

[17]  John Lee,et al.  L1-L2 Parallel Treebank of Learner Chinese: Overused and Underused Syntactic Structures , 2018, LREC.

[18]  Johannes Graën Exploiting alignment in multiparallel corpora for applications in linguistics and language learning , 2018 .

[19]  Boris Katz,et al.  Universal Dependencies for Learner English , 2016, ACL.

[20]  Walt Detmar Meurers,et al.  The MERLIN corpus: Learner language and the CEFR , 2014, LREC.

[21]  Paul Meurer,et al.  The ASK Corpus - a Language Learner Corpus of Norwegian as a Second Language , 2006, LREC.

[22]  Iryna Gurevych,et al.  A Web-based Tool for the Integrated Annotation of Semantic and Syntactic Structures , 2016, LT4DH@COLING.

[23]  Jörg Tiedemann ISA a ICA - Two Web Interfaces for Interactive Alignment of Bitexts , 2006 .