A Novel Thresholding Method for Text Separation and Document Enhancement

Many thresholding-based image enhancement techniques have been developed and used for document analysis, where the simplicity and efficiency of thresholding makes it ideal to use for classifying layers within documents. However, the efficiency of these enhancement techniques can be impaired by the variation of grey levels in different documents, thus causing over-thresholding or under-thresholding. This paper presents a novel global singlestage thresholding method for separating background and foreground layers in text documents. The method finds an optimum thresholding value or exact separation point for each document using the relationship between luminance value and mean intensity of the document without considering peak values in the grey level histogram. The proposed method is implemented using 50 historical documents and five specifically designed words, and then compared to three other efficient and known thresholding methods. Experimental results suggest that the proposed method performs well for text separation and enhancement of document images.

[1]  Bülent Sankur,et al.  Survey over image thresholding techniques and quantitative performance evaluation , 2004, J. Electronic Imaging.

[2]  Ching Y. Suen,et al.  A recursive thresholding technique for image segmentation , 1998, IEEE Trans. Image Process..

[3]  B. Kapralos,et al.  An introduction to digital image processing , 1990 .

[4]  Yan Solihin,et al.  Integral Ratio: A New Class of Global Thresholding Techniques for Handwriting Images , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Wayne Niblack,et al.  An introduction to digital image processing , 1986 .

[6]  Azriel Rosenfeld,et al.  Recovery of temporal information from static images of handwriting , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[7]  J. R. Parker,et al.  Gray Level Thresholding in Badly Illuminated Images , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Josef Kittler,et al.  Minimum error thresholding , 1986, Pattern Recognit..

[9]  Ahmed S. Abutaleb,et al.  Automatic thresholding of gray-level pictures using two-dimensional entropy , 1989, Comput. Vis. Graph. Image Process..

[10]  Anil K. Jain,et al.  Goal-Directed Evaluation of Binarization Methods , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Ergina Kavallieratou,et al.  Cleaning and Enhancing Historical Document Images , 2005, ACIVS.

[12]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[13]  A. D. Brink,et al.  Minimum cross-entropy threshold selection , 1996, Pattern Recognit..

[14]  David S. Doermann,et al.  Machine printed text and handwriting identification in noisy document images , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Andrew K. C. Wong,et al.  A new method for gray-level picture thresholding using the entropy of the histogram , 1985, Comput. Vis. Graph. Image Process..