Detection and correction of non word spelling errors in Hindi language

Hindi, the majority language of India, is still in its infancy stage concerning to natural language processing applications and research. Spelling detection and correction for Hindi language is an important process which has received inadequate attention. Today, there is lots of work available for English Spelling detection and correction but for Hindi, not much work has been done. Unlike English which has only two types of letters i.e. 21 consonants and 5 vowels, Hindi language in addition to 40 consonants, 10 vowels, consists of various types of symbols (ten vowel sign (matras), half letters & halant etc.). So the methods available for English cannot be directly applied for Hindi. Various documents, newspaper, novels, books, magazines, government notice etc. are typed in Hindi so there is need for development of spelling detection and correction tools for Hindi. In this paper author focused on detection and correction of spelling errors in Hindi language for non-word errors.

[1]  Roger Mitton Fifty years of spellchecking , 2010 .

[2]  Yves Schabes,et al.  Combining Trigram-based and Feature-based Methods for Context-Sensitive Spelling Correction , 1996, ACL.

[3]  Ritika Agarwal,et al.  Auto Spell Suggestion for High Quality Speech Synthesis in Hindi , 2014, ArXiv.

[4]  James L. Peterson,et al.  Computer programs for detecting and correcting spelling errors , 1980, CACM.

[5]  Prasenjit Majumder,et al.  N-gram: a language independent approach to IR and NLP , 2002 .

[6]  Error tracking in search engine development , 2012, WSSANLP@COLING.

[7]  Andrew R. Golding,et al.  A Bayesian Hybrid Method for Context-sensitive Spelling Correction , 1996, VLC@ACL.

[8]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[9]  Ayu Purwarianti,et al.  A non word error spell checker for Indonesian using morphologically analyzer and HMM , 2011, Proceedings of the 2011 International Conference on Electrical Engineering and Informatics.

[10]  Sivaji Bandyopadhyay,et al.  Detection and Correction of Phonetic Errors with a New Orthographic Dictionary , 2000, PACLIC.

[11]  V. Dixit,et al.  Design and implementation of a morphology-based spellchecker for Marathi, an Indian language , 2005 .

[12]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[14]  Bidyut Baran Chaudhuri,et al.  A simple real-word error detection and correction using local word bigram and trigram , 2013, ROCLING/IJCLCLP.

[15]  Antonio Zamora,et al.  Automatic spelling correction in scientific and scholarly text , 1984, CACM.

[16]  Karen Kukich,et al.  Techniques for automatically correcting words in text , 1992, CSUR.

[17]  S. Verberne Context-sensitive Spell Checking Based on Word Trigram Probabilities Context-sensitive Spell Checking Based on Word Trigram Probabilities , 2002 .

[18]  Xiang Tong,et al.  A Statistical Approach to Automatic OCR Error Correction in Context , 1996, VLC@COLING.

[19]  David M. W. Powers,et al.  Large scale experiments on correction of confused words , 2001, Proceedings 24th Australian Computer Science Conference. ACSC 2001.

[20]  Dan Roth,et al.  Scaling Up Context-Sensitive Text Correction , 2001, IAAI.

[21]  Graeme Hirst,et al.  Real-Word Spelling Correction with Trigrams: A Reconsideration of the Mays, Damerau, and Mercer Model , 2008, CICLing.

[22]  Graeme Hirst An Evaluation of the Contextual Spelling Checker of Microsoft Of fi ce Word 2007 , 2008 .

[23]  Marcos Zampieri,et al.  Effective Spell Checking Methods Using Clustering Algorithms , 2013, RANLP.

[24]  Fred J. Damerau,et al.  A technique for computer detection and correction of spelling errors , 1964, CACM.

[25]  Dan Roth,et al.  A Winnow-Based Approach to Context-Sensitive Spelling Correction , 1998, Machine Learning.

[26]  Yan Zhang,et al.  A Correcting Model Based on Tribayes for Real-Word Errors in English Essays , 2012, 2012 Fifth International Symposium on Computational Intelligence and Design.

[27]  Bidyut Baran Chaudhuri,et al.  Reversed word dictionary and phonetically similar word grouping based spell-checker to Bangla text , 2014 .

[28]  Robert L. Mercer,et al.  Context based spelling correction , 1991, Inf. Process. Manag..

[29]  Graeme Hirst,et al.  Correcting real-word spelling errors by restoring lexical cohesion , 2005, Natural Language Engineering.

[30]  Heshaam Faili Detection and correction of real-word spelling errors in Persian language , 2010, Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010).

[31]  David Yarowsky,et al.  DECISION LISTS FOR LEXICAL AMBIGUITY RESOLUTION: Application to Accent Restoration in Spanish and French , 1994, ACL.