Using Selectional Restrictions for Real Word Error Correction

Spell Checking is an integral part of modern word-processing applications. Current spellcheckers can only detect and correct non-word errors. They cannot effectively deal with real-word errors; misspelled words that result in valid English words. Current techniques for detecting real-word errors require huge volume of training corpus and the learned knowledge is represented by opaque set of features that are not apparent. This paper proposes a new method for dealing with real-word errors using selectional preferences of predicates for arguments in a case slot. The method requires very little in terms of resources and can use existing lexicons slightly modified to suit the above task.

[1]  Andrew R. Golding,et al.  A Bayesian Hybrid Method for Context-sensitive Spelling Correction , 1996, VLC@ACL.

[2]  Eric Brill,et al.  Automatic Rule Acquisition for Spelling Correction , 1997, ICML.

[3]  Eric Atwell,et al.  Dealing with ill-formed English text , 1987 .

[4]  Philip Resnik,et al.  Selectional Preference and Sense Disambiguation , 1997 .

[5]  Graeme Hirst,et al.  Lexical chains as representations of context for the detection and correction of malapropisms , 1995 .

[6]  Hang Li,et al.  Generalizing Case Frames Using a Thesaurus and the MDL Principle , 1995, CL.

[7]  J. Rissanen,et al.  Modeling By Shortest Data Description* , 1978, Autom..

[8]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[9]  Robert L. Mercer,et al.  Context based spelling correction , 1991, Inf. Process. Manag..

[10]  Yves Schabes,et al.  Combining Trigram-based and Feature-based Methods for Context-Sensitive Spelling Correction , 1996, ACL.

[11]  Marc Light,et al.  Hiding a Semantic Hierarchy in a Markov Model , 1999, ACL 1999.

[12]  Karen Kukich,et al.  Techniques for automatically correcting words in text , 1992, CSUR.

[13]  Robert A. Wagner,et al.  Order-n correction for regular languages , 1974, CACM.

[14]  Stephen Clark,et al.  An Iterative Approach to Estimating Frequencies over a Semantic Hierarchy , 1999, EMNLP.

[15]  Dan Roth,et al.  Applying Winnow to Context-Sensitive Spelling Correction , 1996, ICML.

[16]  Massimiliano Ciaramita,et al.  Explaining away ambiguity: Learning verb selectional preference with Bayesian networks , 2000, COLING.

[17]  Kenneth Ward Church,et al.  Probability scoring for spelling correction , 1991 .