论文信息 - Reducing Annotation Effort in Automatic Essay Evaluation Using Locality Sensitive Hashing

Reducing Annotation Effort in Automatic Essay Evaluation Using Locality Sensitive Hashing

Automated essay evaluation systems use machine learning models to predict the score for an essay. For such, a training essay set is required which is usually created by human requiring time-consuming effort. Popular choice for scoring is a nearest neighbor model which requires on-line computation of nearest neighbors to a given essay. This is, however, a time-consuming task. In this work, we propose to use locality sensitive hashing that helps to select a small subset of a large set of essays such that it will likely contain the nearest neighbors for a given essay. We provided experiments on real-world data sets provided by Kaggle. According to the experimental results, it is possible to achieve good performance on scoring by using the proposed approach. The proposed approach is efficient with regard to time complexity. Also, it works well in case of a small number of training essays labeled by human and gives comparable results to the case when a large essay sets are used.

Tomás Horváth | Dávid Szabó | Tsegaye Misikir Tashu | Tomáš Horváth | Dávid Szabó

[1] Torsten Zesch,et al. Reducing Annotation Efforts in Supervised Short Answer Scoring , 2015, BEA@NAACL-HLT.

[2] Sumit Basu,et al. Divide and correct: using clusters to grade short answers at scale , 2014, L@S.

[3] Una-May O'Reilly,et al. Large-scale physiological waveform retrieval via locality-sensitive hashing , 2015, 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[4] M. Slaney,et al. Locality-Sensitive Hashing for Finding Nearest Neighbors [Lecture Notes] , 2008, IEEE Signal Processing Magazine.

[5] Magdalena Wolska,et al. Finding a Tradeoff between Accuracy and Rater's Workload in Grading Clustered Short Answers , 2014, LREC.

[6] Tomás Horváth,et al. Pair-Wise: Automatic Essay Evaluation using Word Mover's Distance , 2018, CSEDU.

[7] Nitin Madnani,et al. The Impact of Training Data on Automated Short Answer Scoring Performance , 2015, BEA@NAACL-HLT.

[8] Sumit Basu,et al. Powergrading: a Clustering Approach to Amplify Human Effort for Short Answer Grading , 2013, TACL.