Effect of Application of Ensemble Method on Machine Learning with Insufficient Training Set in Developing Automated English Essay Scoring System

In order to train a supervised machine learning algorithm, it is necessary to have non-biased labels and a sufficient amount of training data. However, it is difficult to collect the required non-biased labels and a sufficient amount of training data to develop an automatic English Composition scoring system. In addition, an English writing assessment is carried out using a multi-faceted evaluation of the overall level of the answer. Therefore, it is difficult to choose an appropriate machine learning algorithm for such work. In this paper, we show that it is possible to alleviate these problems through ensemble learning. The results of the experiment indicate that the ensemble technique exhibited an overall performance that was better than that of other algorithms.

[1]  Pavel Brazdil,et al.  Comparison of SVM and Some Older Classification Algorithms in Text Classification Tasks , 2006, IFIP AI.

[2]  Haibo He,et al.  Learning from Imbalanced Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[3]  Brian D. Fisher,et al.  University of British Columbia , 2002, INTR.

[4]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[5]  James Parker,et al.  on Knowledge and Data Engineering, , 1990 .

[6]  Kong-Joo Lee,et al.  Implementing Automated English Error Detecting and Scoring System for Junior High School Students , 2007 .

[7]  Semire Dikli,et al.  An Overview of Automated Scoring of Essays. , 2006 .

[8]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[9]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[10]  郑肇葆,et al.  基于Naive Bayes Classifiers的航空影像纹理分类 , 2006 .

[11]  Hung Hum,et al.  Is Naïve Bayes a Good Classifier for Document Classification , 2011 .

[12]  Kong Joo Lee,et al.  Developing an Automated English Sentence Scoring System for Middle-school Level Writing Test by Using Machine Learning Techniques , 2014 .

[13]  Yao Jian-Min,et al.  Automated Essay Scoring Using Multi-classifier Fusion , 2011 .

[14]  Jill Burstein,et al.  AUTOMATED ESSAY SCORING WITH E‐RATER® V.2.0 , 2004 .

[15]  S. Hyakin,et al.  Neural Networks: A Comprehensive Foundation , 1994 .