论文信息 - Noise reduction in LSA-based essay assessment

Noise reduction in LSA-based essay assessment

With the Latent Semantic Analysis (LSA), it is possible to automatically grade essays, i.e., free-text responses to examinations, by comparing them to a corpus of available learning materials. In order to get grades that correspond to those given by human assessors, it is crucial to train the system with essays that have already been graded. Noise reduction refers to a process in which individual words used for comparing essays with learning materials are given weight according to their significance. To find out the optimal parameters for noise reduction, the system is trained with different parameters, and the corresponding grades for essays are predicted by each of these models. Three standard validation methods, holdout, bootstrap, and k-fold cross-validation, were applied for noise reduction. In an experiment that consisted of 283 essays from three examinations, each of a different subject, the holdout validation method turned out to give the best predictions, and hence, reduce most of the noise.

Erkki Sutinen | Tuomo Kakkonen | Jari Timonen

[1] Heikki Mannila,et al. Random projection in dimensionality reduction: applications to image and text data , 2001, KDD '01.

[2] Ian H. Witten,et al. Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[3] M. Kenward,et al. An Introduction to the Bootstrap , 2007 .

[4] E. Sutinen,et al. Automatic assessment of the content of essays based on course materials , 2004, ITRE 2004. 2nd International Conference Information Technology: Research and Education.

[5] Fred Karlsson,et al. Constraint Grammar as a Framework for Parsing Running Text , 1990, COLING.

[6] Peter W. Foltz,et al. An introduction to latent semantic analysis , 1998 .

[7] Ron Kohavi,et al. A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[8] E. B. Page,et al. The Computer Moves into Essay Grading: Updating the Ancient Test. , 1995 .

[9] Bob Rehder,et al. How Well Can Passage Meaning be Derived without Using Word Order? A Comparison of Latent Semantic Analysis and Humans , 1997 .

[10] Richard A. Harshman,et al. Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[11] Erkki Sutinen,et al. Semi-Automatic Evaluation Features in Computer-assisted Essay Assessment , 2004, CATE.

[12] R. Dennis Cook,et al. Cross-Validation of Regression Models , 1984 .

[13] Naftali Tishby,et al. Sufficient Dimensionality Reduction , 2003, J. Mach. Learn. Res..

[14] Kimmo Koskenniemi,et al. A General Computational Model for Word-Form Recognition and Production , 1984 .