Evaluating Children's Composition Based on Chinese Linguistic Features with Machine Learning

The traditional evaluation of composition is human evaluation which is time-consuming, laborious and easily affected by subjective. In recent years, the automatic essay scoring (AES) has become a hot issue in natural language processing, but few research focus on Chinese AES. Hence, this study designed a Chinese AES system and collected 4566 compositions from first grade to sixth grade students. We also extracted 43 linguistic features based on Chinese characteristic, and analysis these compositions based on three model by stepwise multiple regression technique and support vector machine. Results showed that the accuracy of classification is among 70~80%.

[1]  Peter Skehan,et al.  The Influence of Planning and Task Type on Second Language Performance , 1996, Studies in Second Language Acquisition.

[2]  Virginia W. Berninger,et al.  What writing is and how it changes across early and middle childhood development: A multidisciplinary perspective. , 2012 .

[3]  Vladimir Vapnik,et al.  The Nature of Statistical Learning , 1995 .

[4]  Xu Chang-hu On Automated Essay Scoring for Learners of Chinese as a Second Language , 2015 .

[5]  Semire Dikli,et al.  An Overview of Automated Scoring of Essays. , 2006 .

[6]  Beata Beigman Klebanov,et al.  Automated Essay Scoring , 2021, Synthesis Lectures on Human Language Technologies.

[7]  Salvatore Valenti,et al.  An Overview of Current Research on Automated Essay Grading , 2003, J. Inf. Technol. Educ..

[8]  Jill Burstein,et al.  The E-rater® scoring engine: Automated essay scoring with natural language processing. , 2003 .

[9]  Arthur C. Graesser,et al.  Coh-Metrix Measures Text Characteristics at Multiple Levels of Language and Discourse , 2014, The Elementary School Journal.

[10]  Danielle S McNamara,et al.  The tool for the automatic analysis of text cohesion (TAACO): Automatic assessment of local, global, and text cohesion , 2015, Behavior Research Methods.

[11]  Randy M. Kaplan,et al.  SCORING ESSAYS AUTOMATICALLY USING SURFACE FEATURES , 1998 .

[12]  Virginia W. Berninger,et al.  Integrating Low- and High-Level Skills in Instructional Protocols for Writing Disabilities , 1995 .

[13]  Arthur C. Graesser,et al.  Coh-Metrix , 2011 .

[14]  Shunji Inagaki,et al.  Second Language Development in Writing: Measures of Fluency, Accuracy, and Complexity , 1998 .

[15]  Donald E. Powers,et al.  STUMPING E‐RATER: CHALLENGING THE VALIDITY OF AUTOMATED ESSAY SCORING , 2001 .

[16]  Huang Zhi Study of feature selection in HSK automated essay scoring , 2014 .