Learner corpora: the case of the NOSE corpus

This paper provides a brief overview of the scope of learner corpus research and describes a learner corpus by Spanish university students of English, the NOn-native Spanish corpus of English (NOSE). It presents the corpus data, its annotation and how it can be retrieved and exploited for research purposes in the areas of interlanguage studies and automatic recognition of learner-specific features. It also reviews the various research topics that have been investigated in the corpus.

[1]  Susanne Tayfoor Common Mistakes at First Certificate... and How to Avoid Them , 2004 .

[2]  Focus on French as a Foreign Language: Multidisciplinary Approaches , 2005 .

[3]  F. Myles Chapter 5. The Emergence of Morpho-syntactic Structure in French L2 , 2005 .

[4]  Christian Chiarcos,et al.  ANNIS: A Search Tool for Multi-Layer Annotated Corpora , 2009 .

[5]  Peter H. Ragan,et al.  8. Classroom use of a systemic functional small learner corpus , 2001 .

[6]  Helmut Schmidt,et al.  Probabilistic part-of-speech tagging using decision trees , 1994 .

[7]  Julie Moore Common Mistakes at Proficiency...and How to Avoid Them , 2005 .

[8]  Walt Detmar Meurers,et al.  Towards interlanguage POS annotation for effective learner corpora in SLA and FLT , 2009 .

[9]  Barbara Seidlhofer,et al.  Pedagogy and local learner corpora: Working with learning-driven data , 2002 .

[10]  Nadja Nesselhauf,et al.  Collocations in a Learner Corpus , 2005 .

[11]  Pauline Cullen Common Mistakes at IELTS Intermediate: And How to Avoid Them , 2007 .

[12]  Thorsten Brants,et al.  TnT – A Statistical Part-of-Speech Tagger , 2000, ANLP.

[13]  Norma A. Pravec Survey of learner corpora , 2002 .

[14]  Ron Cowan,et al.  Four Questions for Error Diagnosis and Correction in CALL. , 2003 .

[15]  Ana Díaz-Negrillo,et al.  ERROR TAGGING SYSTEMS FOR LEARNER CORPORA , 2006 .

[16]  Ana Díaz-Negrillo,et al.  A tagging tool for error analysis on learner corpora , 2007 .

[17]  Ana Díaz-Negrillo,et al.  A learner corpus-based study on error associations1☆ , 2010 .

[18]  Liz Driscoll Common Mistakes at PET...and How to Avoid Them , 2005 .

[19]  Patrick Gillard Cambridge Advanced Learner's Dictionary , 2013 .

[20]  Debra Powell Common Mistakes at CAE...and How to Avoid Them , 2005 .

[21]  Joseph Collentine,et al.  A Multidimensional Analysis of a Written L2 Spanish Corpus , 2011 .

[22]  Nadja Nesselhauf,et al.  Learner Corpora and their Potential for Language Teaching , 2004 .

[23]  Michael Rundell,et al.  Macmillan English Dictionary for Advanced Learners , 2002 .

[24]  Dan Klein,et al.  Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.