The effectiveness of morphological rules for an isiZulu spelling checker

This paper shows how morphological analysis contributes to solving the challenges posed by the development of a spelling checker for an agglutinative language like isiZulu. It demonstrates how the incremental implementation of affix removal rules can be used to derive word forms and enhance the lexical and error recall of the system. In the case of the spelling checker the strategies used are mainly based on the use of regular expressions, and more specifically on a process of stemming.

[1]  Gilles-Maurice de Schryver,et al.  Spellcheckers for the South African languages, Part 1: The status quo and options for improvement , 2004 .

[2]  N. J. van Warmelo,et al.  Introduction to the Phonology of the Bantu Languages (Grundriss einer Lautlehre der Bantusprachen) , 1934 .

[3]  Virginia Teller Review of Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition by Daniel Jurafsky and James H. Martin. Prentice Hall 2000. , 2000 .

[4]  Sonja E. Bosch,et al.  Software Tools for Morphological Tagging of Zulu Corpora and Lexicon Development , 2004, LREC.

[5]  Briony Williams,et al.  Analysis of Unknown Words through Morphological Decomposition , 1991, EACL.

[6]  M. González Rodríguez,et al.  Proceedings of the third International Conference on Language Resources and Evaluation , 2002 .

[7]  Wessel Kraaij,et al.  Viewing stemming as recall enhancement , 1996, SIGIR '96.

[8]  Xabier Arregi,et al.  A word-grammar based morphological analyzer for agglutinative languages , 2000, COLING.

[9]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[10]  James H. Martin,et al.  Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, 2nd Edition , 2000, Prentice Hall series in artificial intelligence.

[11]  C. M. Doke,et al.  Zulu-English dictionary, , 1972 .

[12]  Andrei Popescu-Belis,et al.  Corpus-based Evaluation of a French Spelling and Grammar Checker , 2002, LREC.

[13]  M. van Zaanen,et al.  A spellchecker for Afrikaans, based on morphological analysis , 2003 .

[14]  David A. Hull,et al.  A Detailed Analysis of English Stemming Algorithms , 2006 .