Prediction of OCR accuracy using a Neural Network

A method for predicting the accuracy achieved by an OCR system on an input image is presented. It is assumed that there is an ideal prediction function. A neural network is trained to estimate the unknown ideal function. In this project, multilayer perceptrons were trained to predict the character accuracy performance of two OCR systems using the backpropagation training method. The results show that this approach is sound. The feasibility of using an accuracy prediction system as a lter to discriminate good quality images (for OCR) from poor quality images (for manual keying) was also examined using a cost model of a large-scale document conversion process. Results show that a prediction system can reduce the total cost of converting a set of documents.

[1]  Stephen V. Rice,et al.  The Fourth Annual Test of OCR Accuracy , 1995 .

[2]  Thomas A. Nartker,et al.  Prediction of OCR accuracy using simple image features , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[3]  T. A. Nartker,et al.  A Preliminary Report on OCR Problems in LSS Document Conversion , 1992 .

[4]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.