Broken and degraded document images binarization

Document image binarization refers to the conversion of a document image into a binary image. For broken and severely degraded document images, binarization is a very challenging process. Unlike the traditional methods that separate the foreground from the background, this paper presents a new framework for the binarization of broken and degraded document images and restoring the quality of the document images. In our approach, the non-local means method is extended and used to remove noises from the input document image in the step of pre-process. Then the proposed method binarizes the document image which takes advantage of the quick adaptive thresholding proposed by Pierre D. Wellner. To get more pleasing binarization results, the binarized document image is post-processed finally. There are three measures in the post-process step: de-speckle, preserve stroke connectivity and improve quality of text regions. Experimental results show significant improvement in the binarization of the broken and degraded document images collected from various sources including degraded and broken books, magazines and document files.

[1]  Chien-Hsing Chou,et al.  A binarization method with learning-built rules for document images produced by cameras , 2010, Pattern Recognit..

[2]  Mohamed Cheriet,et al.  AdOtsu: An adaptive and parameterless generalization of Otsu's method for document image binarization , 2012, Pattern Recognit..

[3]  Hugo Vieira Neto,et al.  Processamento digital de imagens , 1999 .

[4]  Pierre Wellner,et al.  Adaptive Thresholding for the DigitalDesk , 1993 .

[5]  Ioannis Pratikakis,et al.  Adaptive degraded document image binarization , 2006, Pattern Recognit..

[6]  P. Stubberud,et al.  Adaptive image restoration of text images that contain touching or broken characters , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[7]  Rae-Hong Park,et al.  Document image binarization based on topographic analysis using a water flow model , 2002, Pattern Recognit..

[8]  Feng Lin,et al.  Off-line handwritten Chinese character stroke extraction , 2002, Object recognition supported by user interaction for service robots.

[9]  Ioannis Pratikakis,et al.  Performance Evaluation Methodology for Historical Document Image Binarization , 2013, IEEE Transactions on Image Processing.

[10]  Zhengzhou Li,et al.  Speckle reduction by adaptive window anisotropic diffusion , 2009, Signal Process..

[11]  A. Katartzis,et al.  A Bayesian approach to nonlinear diffusion based on a Laplacian prior for ideal image gradient , 2005, IEEE/SP 13th Workshop on Statistical Signal Processing, 2005.

[12]  Elisa H. Barney Smith,et al.  Pre-Processing of Degraded Printed Documents by Non-local Means and Total Variation , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[13]  Bidyut Baran Chaudhuri,et al.  A Global-to-Local Approach to Binarization of Degraded Document Images , 2014, 2014 22nd International Conference on Pattern Recognition.

[14]  C. V. Jawahar,et al.  Contextual restoration of severely degraded document images , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Frédéric Bouchara,et al.  Document Image Binarisation Using Markov Field Model , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[16]  Tatiana Novikova,et al.  Fast and accurate scene text understanding with image binarization and off-the-shelf OCR , 2015, International Journal on Document Analysis and Recognition (IJDAR).

[17]  Mohamed Cheriet,et al.  A spatially adaptive statistical method for the binarization of historical manuscripts and degraded document images , 2011, Pattern Recognit..

[18]  Matti Pietikäinen,et al.  Adaptive document image binarization , 2000, Pattern Recognit..

[19]  Nicholas R. Howe,et al.  A Laplacian Energy for Document Binarization , 2011, 2011 International Conference on Document Analysis and Recognition.

[20]  Jean-Michel Morel,et al.  A non-local algorithm for image denoising , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[21]  Wayne Niblack,et al.  An introduction to digital image processing , 1986 .

[22]  Thierry Géraud,et al.  Efficient multiscale Sauvola’s binarization , 2013, International Journal on Document Analysis and Recognition (IJDAR).

[23]  Thomas S. Huang,et al.  A generalization of median filtering using linear combinations of order statistics , 1983 .

[24]  Azriel Rosenfeld,et al.  Histogram concavity analysis as an aid in threshold selection , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[25]  Bohyung Han,et al.  Kernel-based Bayesian filtering for object tracking , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[26]  Dayang Rohaya,et al.  Document Image Binarization Using Retinex and Global Thresholding , 2015 .

[27]  Rahul Sharma,et al.  Adaptive binarization of severely degraded and non-uniformly illuminated documents , 2014, International Journal on Document Analysis and Recognition (IJDAR).

[28]  Hong Yan,et al.  Linking broken character borders with variable sized masks to improve recognition , 1996, Pattern Recognit..

[29]  Ching Y. Suen,et al.  Thinning Methodologies - A Comprehensive Survey , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  Anil K. Jain,et al.  Goal-Directed Evaluation of Binarization Methods , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[31]  Efstathios Stamatatos,et al.  Improving the quality of degraded document images , 2006, Second International Conference on Document Image Analysis for Libraries (DIAL'06).

[32]  Ophir Frieder,et al.  Interactive degraded document enhancement and ground truth generation , 2008, Electronic Imaging.