Adaptive binarization method for degraded document images based on surface contrast variation

AbstractDocument binarization is an important technique in document image analysis and recognition. Generally, binarization methods are ineffective for degraded images. Several binarization methods have been proposed; however, none of them are effective for historical and degraded document images. In this paper, a new binarization method is proposed for degraded document images. The proposed method based on the variance between pixel contrast, it consists of four stages: pre-processing, geometrical feature extraction, feature selection, and post-processing. The proposed method was evaluated based on several visual and statistical experiments. The experiments were conducted using five International Document Image Binarization Contest benchmark datasets specialized for binarization testing. The results compared with five adaptive binarization methods: Niblack, Sauvola thresholding, Sauvola compound algorithm, NICK, and Bataineh. The results show that the proposed method performs better than other methods in all binarization cases.

[1]  Nicholas R. Howe,et al.  A Laplacian Energy for Document Binarization , 2011, 2011 International Conference on Document Analysis and Recognition.

[2]  Nikos Papamarkos,et al.  Robust document binarization with OFF center-surround cells , 2011, Pattern Analysis and Applications.

[3]  Nikos Papamarkos,et al.  An Evaluation Technique for Binarization Algorithms , 2008, J. Univers. Comput. Sci..

[4]  Constantine Kotropoulos,et al.  Color image histogram equalization by absolute discounting back-off , 2007, Comput. Vis. Image Underst..

[5]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[6]  Its'hak Dinstein,et al.  Binarization, character extraction, and writer identification of historical Hebrew calligraphy documents , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[7]  Ioannis Andreadis,et al.  A Center-Surround Histogram for content-based image retrieval , 2011, Pattern Analysis and Applications.

[8]  Ioannis Pratikakis,et al.  ICDAR 2009 Document Image Binarization Contest (DIBCO 2009) , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[9]  Patrenahalli M. Narendra,et al.  A Separable Median Filter for Image Noise Smoothing , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Chien-Hsing Chou,et al.  A binarization method with learning-built rules for document images produced by cameras , 2010, Pattern Recognit..

[11]  Ioannis Pratikakis,et al.  ICDAR 2011 Document Image Binarization Contest (DIBCO 2011) , 2011, 2011 International Conference on Document Analysis and Recognition.

[12]  Jung-Hua Wang,et al.  Improved median filter using minmax algorithm for image processing , 1997 .

[13]  Yung-Hsiang Chiu,et al.  Parameter-free based two-stage method for binarizing degraded document images , 2012, Pattern Recognit..

[14]  Ioannis Pratikakis,et al.  H-DIBCO 2010 - Handwritten Document Image Binarization Competition , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[15]  Ioannis Pratikakis,et al.  Adaptive degraded document image binarization , 2006, Pattern Recognit..

[16]  Nicole Vincent,et al.  Comparison of Niblack inspired binarization methods for ancient documents , 2009, Electronic Imaging.

[17]  Ioannis Pratikakis,et al.  ICFHR 2012 Competition on Handwritten Document Image Binarization (H-DIBCO 2012) , 2012, 2012 International Conference on Frontiers in Handwriting Recognition.

[18]  E. Chichilnisky,et al.  Functional Asymmetries in ON and OFF Ganglion Cells of Primate Retina , 2002, The Journal of Neuroscience.

[19]  Matti Pietikäinen,et al.  Adaptive document binarization , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[20]  Linda G. Shapiro,et al.  Computer Vision , 2001 .

[21]  Khairuddin Omar,et al.  An adaptive local binarization method for document images based on a novel thresholding method and dynamic windows , 2011, Pattern Recognit. Lett..

[22]  Ioannis Pratikakis,et al.  DIBCO 2009: document image binarization contest , 2011, International Journal on Document Analysis and Recognition (IJDAR).

[23]  Guy Lapalme,et al.  A systematic analysis of performance measures for classification tasks , 2009, Inf. Process. Manag..

[24]  Matti Pietikäinen,et al.  Adaptive document image binarization , 2000, Pattern Recognit..

[25]  B. Kapralos,et al.  I An Introduction to Digital Image Processing , 2022 .