Automatic Error Detection Method for Japanese Particles

In this article, I propose an approach for detecting appropriate usage models of case particles in the writings of Japanese Second Language learners (JSL) in order to create a Japanese automatic error detection system. As learner corpora are receiving special attention as an invaluable source for the educational feedback to improve teaching material and methodology, automatic methods of error analysis have become necessary to facilitate the development of learner corpora. Particle errors account for a substantial proportion of all grammatical errors by JSL learners and discourage the readers from understanding the meaning of a sentence. To address this issue, I trained Support Vector Machines (SVMs) to learn correct patterns of case particle usages from a Japanese newspaper text corpus. The result differs according to the kind of the particle. The object marker “wo (を)” has the best score of 81.4%. Applying the “wo (を)” model to detect wrong use of the particle, the result shows 92.6% for precision and 34.3% for recall with the 100 instance test set. The result shows 95.2% for precision and 37.6% for recall with the 200 instance test set. Although this is a pilot study, this experiment shows a promising result for Japanese particle error detection. Key terms: Automatic Error Detection, Learner Corpora, Support Vector Machines, N-gram, Case Particle Detection

[1]  J. R. Firth,et al.  A Synopsis of Linguistic Theory, 1930-1955 , 1957 .

[2]  Mirella Lapata,et al.  Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008) , 2008 .

[3]  Martin Chodorow,et al.  The Ups and Downs of Preposition Error Detection in ESL Writing , 2008, COLING.

[4]  Michael Gamon,et al.  Correcting ESL Errors Using Phrasal SMT Techniques , 2006, ACL.

[5]  Yuji Matsumoto,et al.  Japanese Dependency Analysis using Cascaded Chunking , 2002, CoNLL.

[6]  Martin Chodorow,et al.  An Unsupervised Method for Detecting Grammatical Errors , 2000, ANLP.

[7]  Stephen G. Pulman,et al.  Automatically Acquiring Models of Preposition Use , 2007, ACL 2007.

[8]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[9]  Na-Rae Han,et al.  Detection of Grammatical Errors Involving Prepositions , 2007, ACL 2007.

[10]  Kristina Toutanova,et al.  Learning to Predict Case Markers in Japanese , 2006, ACL.

[11]  Graeme Hirst,et al.  Real-Word Spelling Correction with Trigrams: A Reconsideration of the Mays, Damerau, and Mercer Model , 2008, CICLing.

[12]  Rachele De Felice,et al.  A Classifier-Based Approach to Preposition and Determiner Error Correction in L2 English , 2008, COLING.

[13]  Kenji Araki,et al.  Automatic Error Detection and Correction of Japanese Particles Using Features within Bunsetsu , 2007 .

[14]  Robert L. Mercer,et al.  Context based spelling correction , 1991, Inf. Process. Manag..