论文信息 - Investigating the Application of Automated Writing Evaluation to Chinese Undergraduate English Majors: A Case Study of WriteToLearn

Investigating the Application of Automated Writing Evaluation to Chinese Undergraduate English Majors: A Case Study of WriteToLearn

This study investigated the application of WriteToLearn on Chinese undergraduate English majors’ essays in terms of its scoring ability and the accuracy of its error feedback. Participants were 163 second-year English majors from a university located in Sichuan province who wrote 326 essays from two writing prompts. Each paper was marked by four human raters as well as WriteToLearn. Many-facet Rasch measurement (MFRM) was conducted to calibrate WriteToLearn’s rating performance in scoring the whole set of essays against those of four trained human raters. The accuracy of WriteToLearn’s feedback on 60 randomly selected essays was compared with the feedback provided by human raters. The two main findings related to scoring were that WriteToLearn was more consistent but highly stringent relative to the four trained human raters in scoring essays and that it failed to score 7 essays. In terms of error feedback, WriteToLearn had an overall precision and recall of 49% and 18.7% respectively. These figures did not meet the minimum threshold of 90% precision for it to be a reliable error detecting tool set by Burstein, Chodorow, and Leacock (2003). Furthermore, it had difficulty in identifying the errors made by Chinese undergraduate English majors in the use of articles, prepositions, word choice and expression.

Antony John Kunnan | Sha Liu

[1] Vahid Aryadoust,et al. Predicting EFL writing ability from levels of mental representation measured by Coh-Metrix: A structural equation modeling study , 2015 .

[2] Semire Dikli,et al. Automated Essay Scoring feedback for second language writers: How does it compare to instructor feedback? , 2014 .

[3] Martin Chodorow,et al. CriterionSM Online Essay Evaluation: An Application for Automated Evaluation of Student Essays , 2003, IAAI.

[4] Claudia Leacock,et al. Automated Grammatical Error Correction for Language Learners , 2010, COLING.

[5] Laura K. Allen,et al. A Hierarchical Classification Approach to Automated Essay Scoring. , 2015 .

[6] Dana R. Ferris,et al. Written corrective feedback for individual L2 writers , 2013 .

[7] Na-Rae Han,et al. Detecting errors in English article usage by non-native speakers , 2006, Natural Language Engineering.

[8] Aek Phakiti,et al. The effects of computer-generated feedback on the quality of writing , 2014 .

[9] Sara Cushing Weigle,et al. English language learners and automated scoring of essays: Critical considerations , 2013 .

[10] Andrea Everard,et al. Does spell-checking software need a warning label? , 2005, CACM.

[11] Sara Cushing Weigle. English as a Second Language Writing and Automated Essay Evaluation , 2013 .