Optimal Parameter Selection Technique for a Neural Network Based Local Thresholding Method

Abstract Thresholding of a given image into binary image is a necessary step for most image analysis and recognition techniques. In document recognition application, success of OCR mostly depends on the quality of the thresholded image. Non-uniform illumination, low contrast and complex background make it challenging in this application. In this paper, selection of optimal parameters for Neural Network (NN) based local thresholding approach for grey scale composite document image with non-uniform background is proposed. NN-based local image thresholding technique uses 8 statistical and textural image features to obtain a feature vector for each pixel from a window of size (2n + 1)x(2n + 1), where n ≥ 1. An exhaustive search was conducted on these features and found pixel value, mean and entropy are the optimal features at window size 3x3. To validate these 3 features some non-uniform watermarked document images with known binary document images called base documents are used. Characters were extracted from these watermarked documents using the proposed 3 features. The difference between the thresholded document and base document is the noise. A quantitative measure Peak-Signal-to-Noise ratio (PSNR) is used to measure the noise. In case of unknown base document characters were extracted through the proposed 3 features and used in a commercial OCR to obtain the character recognition rate. The average recognition rate 99.25% and PSNR shows that the proposed 3 features are the optimal compare to the NN-based thresholding technique with different parameters presented in the literature.

[1]  Adnan Khashman,et al.  Document Image binarisation Using a Supervised Neural Network , 2008, Int. J. Neural Syst..

[2]  Bülent Sankur,et al.  Survey over image thresholding techniques and quantitative performance evaluation , 2004, J. Electronic Imaging.

[3]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[4]  Wayne Niblack,et al.  An introduction to digital image processing , 1986 .

[5]  M. Ahmadi,et al.  Local thresholding of composite documents using multi-layer perceptron neural network , 2004, The 2004 47th Midwest Symposium on Circuits and Systems, 2004. MWSCAS '04..

[6]  Anil K. Jain,et al.  Goal-Directed Evaluation of Binarization Methods , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Maher A. Sid-Ahmed,et al.  Computer analysis of composite documents with non-uniform background , 2004 .

[8]  Rafael C. González,et al.  Local Determination of a Moving Contrast Edge , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Maher A. Sid-Ahmed,et al.  Image processing, theory, algorithms and architectures , 1995 .

[10]  Y. J. Zhang,et al.  A survey on evaluation methods for image segmentation , 1996, Pattern Recognit..

[11]  Majid Ahmadi,et al.  A hidden Markov model-based character extraction method , 2008, Pattern Recognit..

[12]  Josef Kittler,et al.  Minimum error thresholding , 1986, Pattern Recognit..

[13]  Nikos Papamarkos,et al.  A technique for fuzzy document binarization , 2001, DocEng '01.

[14]  Rasit Köker,et al.  NEURAL NETWORK BASED AUTOMATIC THRESHOLD SELECTION FOR AN INDUSTRIAL VISION SYSTEM , 2003 .

[15]  Yasser M. Alginahi Thesholding and Character Recognition in Security Documents with Watermarked Background , 2008, 2008 Digital Image Computing: Techniques and Applications.

[16]  P.K Sahoo,et al.  A survey of thresholding techniques , 1988, Comput. Vis. Graph. Image Process..