Binarization of degraded document image based on contrast enhancement

Because of the different types of document degradation such as uneven illumination, image contrast variation and bleeding-through, binarization for degraded document images is still an enormous challenge for all scholars. This paper presents a new binarization method for degraded document images. The proposed algorithm focuses on the differences of image grayscale contrast in different areas. Firstly, theory of quadtree is used to divide areas adaptively. Secondly, various contrast enhancements are selected to adjust local grayscale contrast for different contrast areas. Lastly, the frequency of gray value is utilized to calculate threshold. The proposed algorithm was tested on the datasets of Document Image Binarization Contest (DIBCO) (DIBCO 2009, H-DIBCO 2010, DIBCO 2011, H-DIBCO 2012). Compared with other five classical algorithms, the binaried images using proposed algorithm gain the highest F-measure and PSNR.

[1]  Heng-Da Cheng,et al.  Fuzzy partition of two-dimensional histogram and its application to thresholding , 1999, Pattern Recognit..

[2]  Chien-Hsing Chou,et al.  A binarization method with learning-built rules for document images produced by cameras , 2010, Pattern Recognit..

[3]  Nikos Papamarkos,et al.  A neuro-fuzzy technique for document binarisation , 2003, Neural Computing & Applications.

[4]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[5]  Azriel Rosenfeld,et al.  Digital Picture Processing , 1976 .

[6]  Luigi Cinque,et al.  Image thresholding using fuzzy entropies , 1998, IEEE Trans. Syst. Man Cybern. Part B.

[7]  Rahul Sharma,et al.  Adaptive binarization of severely degraded and non-uniformly illuminated documents , 2014, International Journal on Document Analysis and Recognition (IJDAR).

[8]  Yan Zhang,et al.  Document Image Binarization Based on NFCM , 2009, 2009 2nd International Congress on Image and Signal Processing.

[9]  M. Valizadeh,et al.  A novel hybrid algorithm for binarization of badly illuminated document images , 2009, 2009 14th International CSI Computer Conference.

[10]  Nikos Papamarkos,et al.  A technique for fuzzy document binarization , 2001, DocEng '01.

[11]  Matti Pietikäinen,et al.  Adaptive document image binarization , 2000, Pattern Recognit..

[12]  Jiangtao Wen,et al.  A new binarization method for non-uniform illuminated document images , 2013, Pattern Recognit..

[13]  Ioannis Pratikakis,et al.  Adaptive degraded document image binarization , 2006, Pattern Recognit..

[14]  Wayne Niblack,et al.  An introduction to digital image processing , 1986 .