The biggest challenge in the field of image processing is to recognize documents both in printed and handwritten format. Optical Character Recognition (OCR) is a type of document image analysis where scanned digital image that contains either machine printed or handwritten script input into an OCR software engine and translating it into an editable machine readable digital text format. Development of OCRs for Indian script is an active area of research today. We are making an attempt to develop the OCR system for Oriya language, which is the official language of Orissa. Oriya language present great challenges to an OCR designer due to the large number of letters in the alphabet, the sophisticated ways in which they combine, and the complicated graphemes they result in. In this paper, we argue that a number of automatic and semi-automatic tools can ease the development of recognizers for new font styles and new scripts. We discuss briefly and show how they have helped build new OCRs for the purpose of recognizing Oriya script. We have used the Back propagation Neural Network for efficient recognition where the errors were corrected through back propagation and rectified neuron values were transmitted by feed-forward method in the neural network of multiple layers, i.e. the input layer, the output layer and the middle layer or hidden layers.
[1]
Veena Bansal,et al.
A complete OCR for printed Hindi text in Devanagari script
,
2001,
Proceedings of Sixth International Conference on Document Analysis and Recognition.
[2]
Theodosios Pavlidis,et al.
On the Recognition of Printed Characters of Any Font and Size
,
1987,
IEEE Transactions on Pattern Analysis and Machine Intelligence.
[3]
Bala Srinivasan,et al.
Application of artificial neural network model for optical character recognition
,
1997,
1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.
[4]
Rafael C. González,et al.
Digital image processing using MATLAB
,
2006
.
[5]
Veena Bansal,et al.
Partitioning and searching dictionary for correction of optically read Devanagari character strings
,
2002,
International Journal on Document Analysis and Recognition.
[6]
Bidyut Baran Chaudhuri,et al.
Compound character recognition by run-number-based metric distance
,
1998,
Electronic Imaging.
[7]
Bidyut Baran Chaudhuri,et al.
A complete printed Bangla OCR system
,
1998,
Pattern Recognit..