A Phase Congruency Based Document Binarization

In this paper, three new methods proposed for binarization of degraded documents and manuscripts. Phase congruency used to select regions of interest (ROI) of document's foreground. The main idea is to achieve an optimal recall measure (recall˜1), while the precision value is at an acceptable level. Further processing should be performed to focus on the ROI. We also used a modified adaptive thresholding method in the next step. This method uses a global variance, a global mean and local means for thresholding. Finally, a new method called early exclusion criterion (EEC) proposed for document enhancement. The experimental results on the datasets introduced in DIBCO 2009, H-DIBCO 2010 and DIBCO 2011 shows that near optimal recall value (recall˜0.99) obtained, while precision measure's value is acceptable.

[1]  Thomas M. Breuel,et al.  Efficient implementation of local adaptive thresholding techniques using integral images , 2008, Electronic Imaging.

[2]  Laurence Likforman-Sulem,et al.  Document Recognition and Retrieval XVII , 2007 .

[3]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[4]  André Marion,et al.  Introduction to Image Processing , 1990, Springer US.

[5]  Pierre Soille,et al.  Morphological Image Analysis: Principles and Applications , 2003 .

[6]  Mohamed Cheriet,et al.  Beyond pixels and regions: A non-local patch means (NLPM) method for content-level restoration, enhancement, and reconstruction of degraded document images , 2011, Pattern Recognit..

[7]  Derek Bradley,et al.  Adaptive Thresholding using the Integral Image , 2007, J. Graph. Tools.

[8]  A.W.M. Smeulders,et al.  An introduction to image processing , 1991 .

[9]  Peter Kovesi,et al.  Image Features from Phase Congruency , 1995 .

[10]  Ioannis Pratikakis,et al.  Improved document image binarization by using a combination of multiple binarization techniques and adapted edge information , 2008, 2008 19th International Conference on Pattern Recognition.

[11]  Ioannis Pratikakis,et al.  ICDAR 2009 Document Image Binarization Contest (DIBCO 2009) , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[12]  Chengjun Liu,et al.  A Bayesian Discriminating Features Method for Face Detection , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Pierre Wellner,et al.  Adaptive Thresholding for the DigitalDesk , 1993 .

[14]  Mohamed Cheriet,et al.  A multi-scale framework for adaptive binarization of degraded document images , 2010, Pattern Recognit..

[15]  Ioannis Pratikakis,et al.  H-DIBCO 2010 - Handwritten Document Image Binarization Competition , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[16]  Øivind Due Trier,et al.  Evaluation of Binarization Methods for Document Images , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Mohamed Cheriet,et al.  A spatially adaptive statistical method for the binarization of historical manuscripts and degraded document images , 2011, Pattern Recognit..

[18]  Matti Pietikäinen,et al.  Adaptive document image binarization , 2000, Pattern Recognit..

[19]  Bülent Sankur,et al.  Survey over image thresholding techniques and quantitative performance evaluation , 2004, J. Electronic Imaging.

[20]  Mohamed Cheriet,et al.  AdOtsu: An adaptive and parameterless generalization of Otsu's method for document image binarization , 2012, Pattern Recognit..

[21]  Ioannis Pratikakis,et al.  Adaptive degraded document image binarization , 2006, Pattern Recognit..

[22]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[23]  Ioannis Pratikakis,et al.  ICDAR 2011 Document Image Binarization Contest (DIBCO 2011) , 2011, 2011 International Conference on Document Analysis and Recognition.