OCR For Printed Urdu Script Using Feed Forward Neural Network

This paper deals with an Optical Character Recognition system for printed Urdu, a popular Pakistani/Indian script and is the third largest understandable language in the world, especially in the subcontinent but fewer efforts are made to make it understandable to computers. Lot of work has been done in the field of literature and Islamic studies in Urdu, which has to be computerized. In the proposed system individual characters are recognized using our own proposed method/ algorithms. The feature detection methods are simple and robust. Supervised learning is used to train the feed forward neural network. A prototype of the system has been tested on printed Urdu characters and currently achieves 98.3% character level accuracy on average .Although the system is script/ language independent but we have designed it for Urdu characters only.

[1]  S. Ohhashi,et al.  Alphanumeric character recognition using a connectionist model with the pocket algorithm , 1989, International 1989 Joint Conference on Neural Networks.

[2]  R.M.K. Sinha,et al.  Integrating word level knowledge in text recognition , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[3]  Yann LeCun,et al.  Handwritten zip code recognition with multilayer networks , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[4]  Theodosios Pavlidis,et al.  On the Recognition of Printed Characters of Any Font and Size , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Kunihiko Fukushima,et al.  Character recognition with selective attention , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[6]  Jin Wang,et al.  Weight smoothing to improve network generalization , 1994, IEEE Trans. Neural Networks.

[7]  Jack L. Meador,et al.  Encoding a priori information in feedforward networks , 1991, Neural Networks.

[8]  Chung-Hsien Wu,et al.  A shunting multilayer perceptron network for confusing/composite pattern recognition , 1991, Pattern Recognit..

[9]  K Fukushima,et al.  Handwritten alphanumeric character recognition by the neocognitron , 1991, IEEE Trans. Neural Networks.

[10]  Alan F. Murray,et al.  IEEE International Conference on Neural Networks , 1997 .