Comparative study of thresholding techniques for gray-level document image binarization

Binarization is difficult for document images with poor contrast or illumination, intensive noise and sources type-related degradation. A new technique based on local thresholding is described in this paper. The idea of our technique is to update locally the threshold value whenever the Laplacian sign of the input image changes along the raster-scan line. The Differential of Gaussian (DoG) is used to define the sign image. This technique is tested with many images including different types of document components and degradations. The results are compared with a number of well-known techniques. It was shown that the proposed technique outperformed all other techniques studied.

[1]  D Marr,et al.  Theory of edge detection , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[2]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[3]  Sargur N. Srihari,et al.  An object attribute thresholding algorithm for document image binarization , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[4]  Sargur N. Srihari,et al.  Document Image Binarization Based on Texture Features , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Alfred M. Bruckstein,et al.  A new method for image segmentation , 1988, [1988 Proceedings] 9th International Conference on Pattern Recognition.

[6]  Ching Y. Suen,et al.  A recursive thresholding technique for image segmentation , 1998, IEEE Trans. Image Process..

[7]  P.K Sahoo,et al.  A survey of thresholding techniques , 1988, Comput. Vis. Graph. Image Process..