Foreground-Background Regions Guided Binarization of Camera-Captured Document Images

Binarization is an important preprocessing step in several document image processing tasks. Nowadays handheld camera devices are in widespread use, that allow fast and flexible document image capturing. But, they may produce degraded grayscale image, especially due to bad shading or non-uniform illumination. State-of-the-art binarization techniques, which are designed for scanned images, do not perform well on camera-captured documents. Furthermore, local adaptive binarization methods, like Niblack [1], Sauvola [2], etc, are sensitive to free parameter values, which are fixed for whole image. In this paper, we describe a novel binarization technique using ridges-guided local binarization method, in which appropriate free parameter value(s) is(are) selected for each pixel depending on the presence or absence of ridge(s) in the local neighborhood of a pixel. Our method gives a novel way of automatically selecting parameter values for local binarization method, this improves binarization results for both scanned and camera-captured document images relative to previous methods. Experimental results on a subset of CBDAR 2007 document image dewarping contest dataset show a decrease in OCR error rate using reported method with respect to other stat-of-the-art bianrization methods.

[1]  Syed Saqib Bukhari,et al.  Ridges Based Curled Textline Region Detection from Grayscale Camera-Captured Document Images , 2009, CAIP.

[2]  A.W.M. Smeulders,et al.  An introduction to image processing , 1991 .

[3]  Nikos Papamarkos,et al.  Automatic Evaluation of Document Binarization Results , 2005, CIARP.

[4]  M. Goldbaum,et al.  Detection of blood vessels in retinal images using two-dimensional matched filters. , 1989, IEEE transactions on medical imaging.

[5]  M. Riley,et al.  Time-Frequency Representations for Speech Signals , 1987 .

[6]  Thomas M. Breuel,et al.  Performance Evaluation and Benchmarking of Six-Page Segmentation Algorithms , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  L. O'Gorman,et al.  Matched filter design for fingerprint image enhancement , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[8]  Bülent Sankur,et al.  Survey over image thresholding techniques and quantitative performance evaluation , 2004, J. Electronic Imaging.

[9]  Ioannis Pratikakis,et al.  Adaptive degraded document image binarization , 2006, Pattern Recognit..

[10]  Hsi-Jian Lee,et al.  Binarization of color document images via luminance and saturation color features , 2002, IEEE Trans. Image Process..

[11]  André Marion,et al.  Introduction to Image Processing , 1990, Springer US.

[12]  Thomas M. Breuel,et al.  OCR Based Thresholding , 2009, MVA.

[13]  Thomas M. Breuel,et al.  Structural Mixtures for Statistical Layout Analysis , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.

[14]  Faisal Shafait Document Image Dewarping Contest , 2007 .

[15]  In-Jung Kim,et al.  Multi-window binarization of camera image for document recognition , 2004, Ninth International Workshop on Frontiers in Handwriting Recognition.

[16]  Berthold K. P. Horn SHAPE FROM SHADING: A METHOD FOR OBTAINING THE SHAPE OF A SMOOTH OPAQUE OBJECT FROM ONE VIEW , 1970 .

[17]  Matti Pietikäinen,et al.  Adaptive document image binarization , 2000, Pattern Recognit..

[18]  Shijian Lu,et al.  Thresholding of badly illuminated document images through photometric correction , 2007, DocEng '07.

[19]  Thomas M. Breuel,et al.  Efficient implementation of local adaptive thresholding techniques using integral images , 2008, Electronic Imaging.

[20]  Syed Saqib Bukhari,et al.  Script-Independent Handwritten Textlines Segmentation Using Active Contours , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[21]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[22]  Thomas M. Breuel,et al.  Background variability modeling for statistical layout analysis , 2008, 2008 19th International Conference on Pattern Recognition.

[23]  Stefano Messelodi,et al.  Geometric Layout Analysis Techniques for Document Image Understanding: a Review , 2008 .

[24]  Horst Bunke,et al.  Text extraction from colored book and journal covers , 2000, International Journal on Document Analysis and Recognition.

[25]  Nikos A. Nikolaou,et al.  Text binarization in color documents , 2006, Int. J. Imaging Syst. Technol..

[26]  Lawrence O'Gorman Binarization and Multithresholding of Document Images Using Connectivity , 1994, CVGIP Graph. Model. Image Process..

[27]  J. M. White,et al.  Image Thresholding for Optical Character Recognition and Other Applications Requiring Character Image Extraction , 1983, IBM J. Res. Dev..