An Intelligent Testing Strategy for Vocabulary Assessment of Chinese Second Language Learners

Vocabulary is one of the most important parts of language competence. Testing of vocabulary knowledge is central to research on reading and language. However, it usually costs a large amount of time and human labor to build an item bank and to test large number of students. In this paper, we propose a novel testing strategy by combining automatic item generation (AIG) and computerized adaptive testing (CAT) in vocabulary assessment for Chinese L2 learners. Firstly, we generate three types of vocabulary questions by modeling both the vocabulary knowledge and learners’ writing error data. After evaluation and calibration, we construct a balanced item pool with automatically generated items, and implement a three-parameter computerized adaptive test. We conduct manual item evaluation and online student tests in the experiments. The results show that the combination of AIG and CAT can construct test items efficiently and reduce test cost significantly. Also, the test result of CAT can provide valuable feedback to AIG algorithms.

[1]  Paul Nation,et al.  A vocabulary size test , 2020 .

[2]  Mark J. Gierl,et al.  Using Automatic Item Generation to Create Solutions and Rationales for Computerized Formative Testing , 2018, Applied psychological measurement.

[3]  Hua-Hua Chang,et al.  Psychometrics Behind Computerized Adaptive Testing , 2015, Psychometrika.

[4]  Gu Hong-bin Automatic generation technology of Chinese multiple-choice items based on ontology , 2010 .

[5]  Maxine Eskénazi,et al.  Automatic Question Generation for Vocabulary Assessment , 2005, HLT.

[6]  Le An Ha,et al.  Semantic Similarity of Distractors in Multiple-Choice Tests: Extrinsic Evaluation , 2009 .

[7]  I. Trancoso,et al.  Automatic Generation of Cloze Question Distractors , 2010 .

[8]  David J. Weiss,et al.  APPLICATION OF COMPUTERIZED ADAPTIVE TESTING TO EDUCATIONAL PROBLEMS , 1984 .

[9]  R. Mitkov,et al.  Computer-Aided Generation of Multiple-Choice Tests , 2003, International Conference on Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003.

[10]  Le An Ha,et al.  A computer-aided environment for generating multiple-choice test items , 2006, Natural Language Engineering.

[11]  Tomoko Kojiri,et al.  Automatic Generation System of Multiple-Choice Cloze Questions and its Evaluation , 2010 .

[12]  Du Jianyi,et al.  Research on Phonetic Symbols of Phonograms in Chinese Mandarin , 2013 .

[13]  F. Baker The basics of item response theory , 1985 .

[14]  Ming Liu,et al.  Automatic Chinese Factual Question Generation , 2017, IEEE Transactions on Learning Technologies.

[15]  Wanxiang Che,et al.  LTP: A Chinese Language Technology Platform , 2010, COLING.

[16]  I. S. P. Nation,et al.  Learning Vocabulary in Another Language: Appendixes , 2001 .

[17]  Ming Liu,et al.  Automatic Chinese Multiple Choice Question Generation Using Mixed Similarity Strategy , 2018, IEEE Transactions on Learning Technologies.

[18]  Vivian Cook,et al.  Second language learning and language teaching , 1991 .

[19]  Donna Gates,et al.  Developing, evaluating, and refining an automatic generator of diagnostic multiple choice cloze questions to assess children's comprehension while reading* , 2016, Natural Language Engineering.

[20]  Sujan Kumar Saha,et al.  Automatic Generation of Multiple Choice Questions Using Wikipedia , 2013, PReMI.

[21]  Arthur C. Graesser,et al.  Experiments on Generating Questions About Facts , 2009, CICLing.

[22]  Montse Maritxalar,et al.  Semantic Similarity Measures for the Generation of Science Tests in Basque , 2014, IEEE Transactions on Learning Technologies.

[23]  Renfen Hu,et al.  The Construction of a Chinese Collocational Knowledge Resource and Its Application for Second Language Acquisition , 2016, COLING.

[24]  Howard Wainer,et al.  Computerized Adaptive Testing: A Primer , 2000 .