CEDEL2: Design, compilation and web interface of an online corpus for L2 Spanish acquisition research

This article presents and reviews a new methodological resource for research in second language acquisition (SLA), CEDEL2 ( Corpus Escrito del Español L2 ‘L2 Spanish Written Corpus’), and its free online search-engine interface ( cedel2.learnercorpora.com ). CEDEL2 is a multi-first-language corpus (Spanish, English, German, Dutch, Portuguese, Italian, French, Greek, Russian, Japanese, Chinese, and Arabic) of L2 Spanish learners at all proficiency levels. It additionally contains several native control subcorpora (English, Portuguese, Greek, Japanese, and Arabic). Its latest release (version 2) holds material from around 4,400 speakers, which amounts to over 1,100,000 words. CEDEL2 follows strict corpus-design criteria (Sinclair, 2005) and L2 corpus-design recommendations (Tracy-Ventura and Paquot, 2021), and all subcorpora are equally designed to be fully contrastable, as recommended by Contrastive Interlanguage Analysis (Granger, 2015). Thanks to its design and web interface, CEDEL2 allows for complex searches which can be further narrowed down according to its SLA-motivated variables, e.g. first language (L1), proficiency level, self-reported proficiency level, age of onset to the L2, length of exposure to the L2, length of residence in a Spanish-speaking country, knowledge of other foreign languages, type of task, etc. These CEDEL2 features allow L2 researchers to address SLA questions and hypotheses.

[1]  Cristóbal Lozano,et al.  Interface conditions on postverbal subjects: A corpus study of L2 English , 2010 .

[2]  Ignacio M. Palacios Martínez,et al.  Learner Spanish on computer , 2016 .

[3]  Yukio Tono What is missing in learner corpus design , 2016 .

[4]  Magali Paquot,et al.  Learner Corpus Research: An interdisciplinary field on the move , 2015 .

[5]  Cristóbal Lozano,et al.  From Corpora to Experiments: Methodological Triangulation in the Study of Word Order at the Interfaces in Adult Late Bilinguals (L2 learners) , 2018, Journal of psycholinguistic research.

[6]  Sylviane Granger,et al.  The contribution of learner corpora to second language acquisition and foreign language teaching: A critical evaluation , 2009 .

[7]  Amanda Huensch,et al.  Understanding second language fluency behavior: The effects of individual differences in first language fluency, cross-linguistic differences, and proficiency over time , 2016, Applied Psycholinguistics.

[8]  G. Rojo,et al.  Introducción a la lingüística de corpus en español , 2021 .

[9]  Cristóbal Lozano,et al.  Learner corpora and second language acquisition , 2013 .

[10]  Magali Paquot,et al.  Quantitative research methods and study quality in learner corpus research , 2015 .

[11]  Nicole Tracy-Ventura,et al.  The importance of task variability in the design of learner corpora for SLA research , 2015 .

[12]  Nicole Tracy-Ventura,et al.  The LANGSNAP longitudinal learner corpus: design and use , 2016 .

[13]  Cristóbal Lozano Pragmatic principles in anaphora resolution at the syntax-discourse interface: Advanced English learners of Spanish in the CEDEL2 corpus , 2016 .

[14]  Nancy Vázquez Veiga,et al.  Discourse markers in CEDEL2 and SPLLOC corpora of learner Spanish: Analysis of some lexical-pragmatic failures , 2016 .

[15]  Florence Myles Second language acquisition theory and learner corpus research , 2015 .

[16]  Margarita Alonso-Ramos Spanish Learner Corpus Research: current trends and future perspectives , 2016 .

[17]  Sylviane Granger,et al.  The Cambridge Handbook of Learner Corpus Research , 2015 .

[18]  Florence Myles,et al.  Interlanguage corpora and second language acquisition research , 2005 .

[19]  Florence Myles SPLLOC: A new database for Spanish second language acquisition , 2008 .

[20]  Gaëtanelle Gilquin Learner Corpora , 2020, A Practical Handbook of Corpus Linguistics.

[21]  Gaëtanelle Gilquin,et al.  From design to collection of learner corpora , 2015 .

[22]  Sylviane Granger,et al.  How to use foreign and second language learner corpora , 2012 .

[23]  Gaëtanelle Gilquin One norm to rule them all? Corpus-derived norms in learner corpus research and foreign language teaching , 2021, Language Teaching.

[24]  Jennifer Yusun Kang,et al.  Telling a coherent story in a foreign language: analysis of Korean EFL learners’ referential strategies in oral narrative discourse , 2004 .

[25]  S. Granger,et al.  Learner corpus methodology , 2015 .

[26]  Cristóbal Lozano,et al.  Selective deficits at the syntax-discourse interface: evidence from the CEDEL2 corpus , 2009 .

[27]  Sylviane Granger,et al.  Contrastive interlanguage analysis: A reappraisal , 2015 .

[28]  Nicole Tracy-Ventura,et al.  The role of dynamic contrasts in the L2 acquisition of Spanish past tense morphology* , 2012, Bilingualism: Language and Cognition.