Multi Lingual Character Recognition Using Hierarchical Rule Based Classification and Artificial Neural Network

Optical Character Recognition is one of the rapidly growing areas of Artificial Intelligence due to its vast applicability. The technique is used to recognize characters printed on paper or elsewhere. The optical character recognition gains more importance when there are multiple languages present. The complexity of the problem increases for the addition of every language. The identification of character is both difficult and important in the presence of multiple languages. In this paper we propose a technique for multi lingual character identification. The algorithm uses the characteristics of the language to find out the language first. This is done using a rule-based approach. Then we apply neural network of the particular language to find out the exact character. We have coded and tested this using English capital letters, English small letters and Hindi letters. We got an appreciable accuracy using test cases of all languages. This proves the efficiency of the algorithm.

[1]  Haitao Liu,et al.  Feature selection for handwritten Chinese character recognition based on genetic algorithms , 1998, SMC'98 Conference Proceedings. 1998 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.98CH36218).

[2]  Marcus Liwicki,et al.  Handwriting Recognition of Whiteboard Notes , 2005 .

[3]  Shashank Araokar,et al.  Visual Character Recognition using Artificial Neural Networks , 2005, ArXiv.

[4]  Sorin Draghici,et al.  A Neural Network Based Artificial Vision System for Licence Plate Recognition , 1997, Int. J. Neural Syst..

[5]  Sanjika Hewavitharana,et al.  Off-Line Sinhala Handwriting Recognition Using Hidden Markov Models , 2002, ICVGIP.

[6]  Jinping Li,et al.  Character Recognition Based on Hierarchical RBF Neural Networks , 2006, Sixth International Conference on Intelligent Systems Design and Applications.

[7]  Gernot A. Fink,et al.  Unsupervised Estimation of Writing Style Models for Improved Unconstrained Off-line Handwriting Recognition , 2006 .

[8]  Horst Bunke,et al.  Off-line Cursive Handwriting Recognition-On the Influence of Training Set and Vocabulary Size in Multiple Classifier Systems , 2022 .

[9]  Horst Bunke,et al.  Off-line cursive handwriting recognition using multiple classifier systems—on the influence of vocabulary, ensemble, and training set size , 2005 .

[10]  Mohsen Soryani,et al.  Application of Genetic Algorithms to Feature Subset Selection in a Farsi OCR , 2008 .

[11]  Jürgen Schmidhuber,et al.  Unconstrained On-line Handwriting Recognition with Recurrent Neural Networks , 2007, NIPS.

[12]  Mary Czerwinski,et al.  Computers beat Humans at Single Character Recognition in Reading based Human Interaction Proofs (HIPs) , 2005, CEAS.

[13]  Horst Bunke,et al.  Rejection strategies for offline handwritten text line recognition , 2006, Pattern Recognit. Lett..

[14]  A. Graves,et al.  Unconstrained Online Handwriting Recognition with Recurrent Neural Networks , 2007 .