On the Automatic Analysis of Learner Language: Introduction to the Special Issue

Natural language processing (NLP) has long been used to automatically analyze language produced by language learners, typically aimed at providing individualized feedback and learner modeling in intelligent computer-assisted language learning systems (see Heift & Schulze, 2007). While much interesting research has been reported, it is difficult to determine the state of the art for the automatic analysis of learner language. Which error types and other learner language properties can be detected and diagnosed automatically? How reliably can this be done, for which kind of learner language, resulting from which types of tasks? For sustainable progress on the automatic analysis of learner language it is arguably crucial to answer these questions, to discuss and compare the performance of different analysis methods on real-life learner data sets.

[1]  Bertus van Rooy,et al.  The effect of learner errors on POS tag errors during automatic POS tagging , 2002 .

[2]  Sylviane Granger-Legrand,et al.  Learner English on computer , 1998 .

[3]  Ulla Connor,et al.  Applied corpus linguistics : a multidimensional perspective , 2004 .

[4]  P.J.M. de Haan,et al.  Tagging non-native English with the TOSCA-ICLE tagger , 2000, Corpus Linguistics and Linguistic Theory.

[5]  Bertus van Rooy,et al.  An evaluation of three POS taggers for the tagging of the Tswana Learner English Corpus , 2003 .

[6]  Alon Lavie,et al.  High-accuracy Annotation and Parsing of CHILDES Transcripts , 2007 .

[7]  Hossein Vahid Dastjerdi,et al.  Annotating speaker stance in discourse , 2017 .

[8]  Florence Myles,et al.  Using information technology to support empirical SLA research , 2007 .

[9]  Xiaofei Lu,et al.  Automatic measurement of syntactic complexity in child language acquisition , 2009 .

[10]  Trude Heift,et al.  Heift Trude Schulze Mathias. Errors and Intelligence in Computer-Assisted Language Learning: Parsers and Pedagogues Routledge (Routledge series in computer-assisted language learning), 2007. xviii+283 Pages. ISBN: 978-0-415-36191-0. Price: $115 , 2009, ReCALL.

[11]  Eileen Fitzpatrick,et al.  The Montclair Electronic Language Database Project , 2004 .

[12]  Ana Díaz-Negrillo,et al.  ERROR TAGGING SYSTEMS FOR LEARNER CORPORA , 2006 .

[13]  Brian MacWhinney,et al.  Enriching CHILDES for Morphosyntactic Analysis , 2008 .