Global Binarization of Document Images Using a Neural Network

In degraded scanned documents, where considerable background noise or variation in contrast and illumination exists, pixels may not be easily classified as foreground or background pixels. Thus, the need to perform document binarization in order to enhance the document image by separating foregrounds (text) from backgrounds. A new approach that combines a global thresholding method and a supervised neural network classifier is proposed to enhance scanned documents and to separate foreground and background layers. Thresholding is first applied using mass-difference thresholding to obtain various local optimum threshold values in an image. The neural network is then trained using these values at its input and a single global optimum threshold value for the entire image at its output. Compared with other methods, experimental results show that this combined approach is computationally cost effective and is capable of enhancing degraded documents with superior foreground and background separation results.

[1]  Anil K. Jain,et al.  Goal-Directed Evaluation of Binarization Methods , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Bülent Sankur,et al.  Survey over image thresholding techniques and quantitative performance evaluation , 2004, J. Electronic Imaging.

[3]  Nikos Papamarkos,et al.  Optimal combination of document binarization techniques using a self-organizing map neural network , 2007, Eng. Appl. Artif. Intell..

[4]  Rafael C. González,et al.  Local Determination of a Moving Contrast Edge , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Ioannis Pratikakis,et al.  An Adaptive Binarization Technique for Low Quality Historical Documents , 2004, Document Analysis Systems.

[6]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[7]  Zheru Chi,et al.  Combined thresholding and neural network approach for vein pattern extraction from leaf images , 2006 .

[8]  Ergina Kavallieratou,et al.  Cleaning and Enhancing Historical Document Images , 2005, ACIVS.

[9]  Josef Kittler,et al.  Minimum error thresholding , 1986, Pattern Recognit..

[10]  David S. Doermann,et al.  Machine printed text and handwriting identification in noisy document images , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  A. Khashman,et al.  Novel Thresholding Method for Document Analysis , 2006, 2006 IEEE International Conference on Industrial Technology.

[12]  Salvador España Boquera,et al.  Enhancement and Cleaning of Handwritten Data by Using Neural Networks , 2005, IbPRIA.

[13]  Yaguang Yang,et al.  A text image enhancement system based on segmentation and classification methods , 2004, HDP '04.

[14]  Wayne Niblack,et al.  An introduction to digital image processing , 1986 .