An Interactive Recommendation System for 2nd Language Vocabulary Learning - Vocabulometer 2.0

According to linguistics, one of the major challenges for language learners is to find materials adapted to their skill. By continuously analyzing the reading activity of the user, the proposed system estimates the user's vocabulary and recommends documents adapted to her/his skill. In this article we improve the Vocabulometer platform by taking into account the user's interest for some topics and the document complexity. We implemented a text classification method and complex word identification analysis that refine the recommendation process and guarantees a level of difficulty of recommended texts in accordance with the level of vocabulary of each user.

[1]  Derek Greene,et al.  Practical solutions to the problem of diagonal dominance in kernel document clustering , 2006, ICML.

[2]  Murat Can Ganiz,et al.  Semantic text classification: A survey of past and recent advances , 2018, Inf. Process. Manag..

[3]  Lucia Specia,et al.  Complex Word Identification: Challenges in Data Annotation and System Performance , 2017, NLP-TEA@IJCNLP.

[4]  Nicholas Journet,et al.  Vocabulometer: A Web Platform for Document and Reader Mutual Analysis , 2018, 2018 13th IAPR International Workshop on Document Analysis Systems (DAS).

[5]  Amy Beth Warriner,et al.  Concreteness ratings for 40 thousand generally known English word lemmas , 2014, Behavior research methods.

[6]  Lucia Specia,et al.  Text Simplification as Tree Transduction , 2013, STIL.

[7]  Julie Medero,et al.  HMC at SemEval-2016 Task 11: Identifying Complex Words Using Depth-limited Decision Trees , 2016, *SEMEVAL.

[8]  I. S. P. Nation,et al.  Learning Vocabulary in Another Language: Appendixes , 2001 .

[9]  Paul Nation,et al.  How much input do you need to learn the most frequent 9,000 words? , 2014 .

[10]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[11]  Marcin Mironczuk,et al.  A recent overview of the state-of-the-art elements of text classification , 2018, Expert Syst. Appl..

[12]  Stefanos D. Kollias,et al.  An intelligent e-learning system based on learner profiling and learning resources adaptation , 2008, Comput. Educ..

[13]  Paul Nation,et al.  A vocabulary size test , 2020 .

[14]  Lucia Specia,et al.  SemEval 2016 Task 11: Complex Word Identification , 2016, *SEMEVAL.

[15]  Gwo-Jen Hwang,et al.  Development of an adaptive learning system with two sources of personalization information , 2008, Comput. Educ..

[16]  Jeff McQuillan What Can Readers Read after Graded Readers , 2016 .

[17]  Matthew Shardlow,et al.  Out in the Open: Finding and Categorising Errors in the Lexical Simplification Pipeline , 2014, LREC.

[18]  Steven Bird,et al.  NLTK: The Natural Language Toolkit , 2002, ACL 2006.

[19]  Koichi Kise,et al.  Mental State Analysis on Eyewear , 2018, UbiComp/ISWC Adjunct.

[20]  Thomas H. Huckin Second language reading and vocabulary learning , 1995 .

[21]  Koichi Kise,et al.  A proposal of a document image reading-life log based on document image retrieval and eyetracking , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[22]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[23]  P. Nation,et al.  What vocabulary size is needed toread unsimplified texts for pleasure? , 2020 .

[24]  N. Schmitt,et al.  Incidental vocabulary acquisition from an authentic novel : Do Things Fall Apart ? , 2010 .

[25]  Steven Bird,et al.  NLTK: The Natural Language Toolkit , 2002, ACL.

[26]  Hiroshi Nakagawa,et al.  Mining Words in the Minds of Second Language Learners: Learner-Specific Word Difficulty , 2012, COLING.

[27]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[28]  M. Brysbaert,et al.  Age-of-acquisition ratings for 30,000 English words , 2012, Behavior research methods.

[29]  Chih-Kai Chang,et al.  A personalized recommendation-based mobile learning approach to improving the reading performance of EFL students , 2013, Comput. Educ..

[30]  Motoi Iwata,et al.  Estimation of Student's Engagement Using a Smart Chair , 2018, UbiComp/ISWC Adjunct.

[31]  Chih-Ming Chen,et al.  Personalized Intelligent M-learning System for Supporting Effective English Learning , 2006, 2006 IEEE International Conference on Systems, Man and Cybernetics.

[32]  Charu C. Aggarwal,et al.  A Survey of Text Classification Algorithms , 2012, Mining Text Data.