Modèles adaptatifs pour prédire automatiquement la compétence lexicale d'un apprenant de français langue étrangère (Adaptive models for automatically predicting the lexical competence of French as a foreign language learners)

This study examines the use of supervised incremental machine learning techniques to automatically predict the lexical competence of French as a foreign language learners (FFL). The targeted learners are native speakers of Dutch having attained the A2/B1 proficiency level according to the Common European Framework of Reference for Languages (CEFR). Following recent work on lexical proficiency prediction using complexity indices, we elaborate two types of models that adapt to feedback disclosing the learners’ knowledge. In particular, we define (i) a model that predicts the lexical competence of learners having the same proficiency level and (ii) a model that predicts the lexical competence of one particular learner. The obtained models are then evaluated with respect to a baseline model, which predicts the lexical competence based on a specialised lexicon for FFL, and appear to gain significantly in accuracy (9%-17%).

[1]  Norbert Schmitt,et al.  Tracking the Incremental Acquisition of Second Language Vocabulary: A Longitudinal Study , 1998 .

[2]  P. Nation,et al.  Unknown vocabulary density and reading comprehension , 2020 .

[3]  Glenn Stockwell,et al.  Vocabulary on the move: Investigating an intelligent mobile phone-based vocabulary tutor , 2007 .

[4]  Nitesh V. Chawla,et al.  Data Mining for Imbalanced Datasets: An Overview , 2005, The Data Mining and Knowledge Discovery Handbook.

[5]  Simone Paolo Ponzetto,et al.  BabelNet: Building a Very Large Multilingual Semantic Network , 2010, ACL.

[6]  Matthew Shardlow,et al.  A Comparison of Techniques to Automatically Identify Complex Words. , 2013, ACL.

[7]  Chih-Ming Chen,et al.  Personalized mobile English vocabulary learning system based on item response theory and learning memory cycle , 2008, Comput. Educ..

[8]  Cédrick Fairon,et al.  Evaluating Lexical Simplification and Vocabulary Knowledge for Learners of French: Possibilities of Using the FLELex Resource , 2016, LREC.

[9]  Viktor Pekar,et al.  Methods for extracting and classifying pairs of cognates and false friends , 2008, Machine Translation.

[10]  Cédrick Fairon,et al.  Les apports du TAL à la lisibilité du français langue étrangère [Contributions of NLP to the readability of French as a foreign language] , 2013, TAL.

[11]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[12]  Maxine Eskénazi,et al.  Combining Lexical and Grammatical Features to Improve Readability Measures for First and Second Language Texts , 2007, NAACL.

[13]  Richard Johansson,et al.  Rule-based and machine learning approaches for second language sentence-level readability , 2014, BEA@ACL.

[14]  S. Krashen We Acquire Vocabulary and Spelling by Reading: Additional Evidence for the Input Hypothesis , 1989 .

[15]  Delphine Bernhard,et al.  Syntactic Sentence Simplification for French , 2014, PITR@EACL.

[16]  B. Laufer Second Language Vocabulary Acquisition: The lexical plight in second language reading: Words you don't know, words you think you know, and words you can't guess , 1996 .

[17]  M. Brysbaert,et al.  The use of film subtitles to estimate word frequencies , 2007, Applied Psycholinguistics.

[18]  Stefano Ferilli,et al.  Machine Learning Approaches for Inducing Student Models , 2004, IEA/AIE.

[19]  Michel Fayol,et al.  Psychologie cognitive de la lecture , 1992 .

[20]  Cédrick Fairon,et al.  Un modèle pour prédire la complexité lexicale et graduer les mots , 2014 .

[21]  Helmut Schmidt,et al.  Probabilistic part-of-speech tagging using decision trees , 1994 .

[22]  Cédrick Fairon,et al.  FLELex: a graded lexical resource for French foreign learners , 2014, LREC.

[23]  Bernard Lété,et al.  MANULEX: A grade-level lexical database from French elementary school readers , 2004, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[24]  M. Wesche,et al.  Assessing Second Language Vocabulary Knowledge: Depth Versus Breadth. , 1996 .

[25]  B. Laufer,et al.  Lexical threshold revisited: Lexical text coverage, learners' vocabulary size and reading comprehension , 2010 .

[26]  L. Eon Bottou Online Learning and Stochastic Approximations , 1998 .

[27]  Anna Papst,et al.  Learning Vocabulary In Another Language , 2016 .