Modeling ESL Word Choice Similarities By Representing Word Intensions and Extensions

Automatic error correction systems for English as a Second Language(ESL) speakers often rely on the use of a confusion set to limit the choices of possible correction candidates. Typically, the confusion sets are either manually constructed or extracted from a corpus of manually corrected ESL writings. Both options require the involvement of English teachers. This paper proposes a method to automatically construct confusion sets for commonly used prepositions from non-ESL corpus without manual intervention. The proposed method simulates how ESL learners learn both the intensions and extensions of English words from standard English text. Our experimental results suggest that the automatically constructed confusion sets based on the similarities between the learned words’ intensions is competitive with those directly learned from an ESL corpus containing about 150K preposition usages.

[1]  Hwee Tou Ng,et al.  Grammatical Error Correction with Alternating Structure Optimization , 2011, ACL.

[2]  Hermann Ney,et al.  The Alignment Template Approach to Statistical Machine Translation , 2004, CL.

[3]  Tomer Hertz,et al.  Learning a Mahalanobis Metric from Equivalence Constraints , 2005, J. Mach. Learn. Res..

[4]  Michael Gamon,et al.  Correcting ESL Errors Using Phrasal SMT Techniques , 2006, ACL.

[5]  Lillian Lee,et al.  Measures of Distributional Similarity , 1999, ACL.

[6]  Rachele De Felice,et al.  Automatic error detection in non-native English , 2008 .

[7]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[8]  Hwee Tou Ng,et al.  Correcting Semantic Collocation Errors with L1-induced Paraphrases , 2011, EMNLP.

[9]  Robert L. Mercer,et al.  The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[10]  Dan Roth,et al.  Generating Confusion Sets for Context-Sensitive Error Correction , 2010, EMNLP.

[11]  Patrick Pantel,et al.  Discovering word senses from text , 2002, KDD.

[12]  Naftali Tishby,et al.  Distributional Clustering of English Words , 1993, ACL.

[13]  Ming Zhou,et al.  SRL-Based Verb Selection for ESL , 2010, EMNLP.

[14]  Dan Roth,et al.  Algorithm Selection and Model Adaptation for ESL Correction Tasks , 2011, ACL.

[15]  Na-Rae Han,et al.  Detection of Grammatical Errors Involving Prepositions , 2007, ACL 2007.

[16]  David J. Chalmers,et al.  On Sense And Intension , 2002, Noûs.

[17]  Dan Roth,et al.  Training Paradigms for Correcting Errors in Grammar and Usage , 2010, NAACL.

[18]  Jennifer Foster,et al.  Using Parse Features for Preposition Selection and Error Detection , 2010, ACL.

[19]  Claudia Leacock,et al.  Automated Grammatical Error Correction for Language Learners , 2010, COLING.

[20]  Jianfeng Gao,et al.  Using Contextual Speller Techniques and Language Modeling for ESL Error Correction , 2008, IJCNLP.

[21]  Martin Chodorow,et al.  The Ups and Downs of Preposition Error Detection in ESL Writing , 2008, COLING.