Improved Degraded Document Image Binarization Using Median Filter for Background Estimation

In this paper, we propose a new binarization method suitable for images having a variety of sizes and degradation levels. It is mainly based on the idea of estimating a document background surface by new smoothing approach through a median filter and compute-Sharp-Peak to replace iterative polynomial. The resulting document image is then segmented by a global threshold binarization. The simulation results confirm that the performance of the proposed method is generally competitive to that of the existing methods. For highly degraded document images, specifically the documents in the BICKLEY DIARY data-base, the performance of the former is substantially better than that of the latter. We show how this approach outperforms the existing and widely used binarization methods in terms of accuracy, F-measure, PSNR, NRM, MPM, DRD, recall and precision. DOI: http://dx.doi.org/10.5755/j01.eie.24.3.20982

[1]  Bülent Sankur,et al.  Survey over image thresholding techniques and quantitative performance evaluation , 2004, J. Electronic Imaging.

[2]  Rashmi Saini,et al.  Document Image Binarization Techniques, Developments and Related Issues: A Review , 2015 .

[3]  Ioannis Pratikakis,et al.  Adaptive degraded document image binarization , 2006, Pattern Recognit..

[5]  S. P. Bhosale,et al.  Highly Optimized and Robust Binarization Technique for Degraded Document Image , 2014 .

[6]  Liansheng Wang,et al.  Broken and degraded document images binarization , 2017, Neurocomputing.

[7]  Yung-Hsiang Chiu,et al.  Parameter-free based two-stage method for binarizing degraded document images , 2012, Pattern Recognit..

[8]  Shijian Lu,et al.  Document image binarization using background estimation and stroke edges , 2010, International Journal on Document Analysis and Recognition (IJDAR).

[9]  Mohamed Cheriet,et al.  A multi-scale framework for adaptive binarization of degraded document images , 2010, Pattern Recognit..

[10]  ABDERRAHMANE KEFALI,et al.  Text / Background separation in the degraded document images by combining several thresholding techniques , 2014 .

[11]  Gueesang Lee,et al.  Stroke Width-Based Contrast Feature for Document Image Binarization , 2014, J. Inf. Process. Syst..

[12]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[13]  Frédéric Bouchara,et al.  Super-Resolved Binarization of Text Based on the FAIR Algorithm , 2011, 2011 International Conference on Document Analysis and Recognition.

[14]  Prashali Chaudhary,et al.  AN EFFECTIVE AND ROBUST TECHNIQUE FOR THE BINARIZATION OF DEGRADED DOCUMENT IMAGES , 2014 .

[15]  Khairuddin Omar,et al.  An adaptive local binarization method for document images based on a novel thresholding method and dynamic windows , 2011, Pattern Recognit. Lett..

[16]  Lasko Laskov Adaptive Document Image Binarization with Application in Processing Astronomical Logbooks , 2012 .

[17]  S. L. Lahudkar,et al.  Phase-Based Binarization of Ancient Document Images , 2016 .

[18]  Thierry Géraud,et al.  Efficient multiscale Sauvola’s binarization , 2013, International Journal on Document Analysis and Recognition (IJDAR).