Historical Document Binarization Based on Phase Information of Images

In this paper, phase congruency features are used to develop a binarization method for degraded documents and manuscripts. Also, Gaussian and median filtering are used in order to improve the final binarized output. Gaussian filter is used for further enhance the output and median filter is applied to remove noises. To detect bleed-through degradation, a feature map based on regional minima is proposed and used. The proposed binarization method provides output binary images with high recall values and competitive precision values. Promising experimental results obtained on the DIBCO'09, H-DIBCO'10 and DIBCO'11 datasets, and this shows the robustness of the proposed binarization method against a large number of different types of degradation.

[1]  Ioannis Pratikakis,et al.  ICDAR 2009 Document Image Binarization Contest (DIBCO 2009) , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[2]  Hossein Ziaei Nafchi,et al.  A Phase Congruency Based Document Binarization , 2012, ICISP.

[3]  Ioannis Pratikakis,et al.  H-DIBCO 2010 - Handwritten Document Image Binarization Competition , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[4]  Peter Kovesi,et al.  Image Features from Phase Congruency , 1995 .

[5]  Pierre Wellner,et al.  Adaptive Thresholding for the DigitalDesk , 1993 .

[6]  Ioannis Pratikakis,et al.  Improved document image binarization by using a combination of multiple binarization techniques and adapted edge information , 2008, 2008 19th International Conference on Pattern Recognition.

[7]  Shijian Lu,et al.  Document image binarization using background estimation and stroke edges , 2010, International Journal on Document Analysis and Recognition (IJDAR).

[8]  Mohamed Cheriet,et al.  A multi-scale framework for adaptive binarization of degraded document images , 2010, Pattern Recognit..

[9]  Ioannis Pratikakis,et al.  Call for Participation DIBCO 2009 - Document Image Binarization Contest , 2009 .

[10]  D. Burr,et al.  Feature detection in human vision: a phase-dependent energy model , 1988, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[11]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[12]  Shijian Lu,et al.  Binarization of historical document images using the local maximum and minimum , 2010, DAS '10.

[13]  Mohamed Cheriet,et al.  AdOtsu: An adaptive and parameterless generalization of Otsu's method for document image binarization , 2012, Pattern Recognit..

[14]  Derek Bradley,et al.  Adaptive Thresholding using the Integral Image , 2007, J. Graph. Tools.

[15]  Ioannis Pratikakis,et al.  ICDAR 2011 Document Image Binarization Contest (DIBCO 2011) , 2011, 2011 International Conference on Document Analysis and Recognition.

[16]  Shijian Lu,et al.  A Self-Training Learning Document Binarization Framework , 2010, 2010 20th International Conference on Pattern Recognition.

[17]  Friedrich M. Wahl,et al.  Document Analysis System , 1982, IBM J. Res. Dev..

[18]  Pierre Soille,et al.  Morphological Image Analysis: Principles and Applications , 2003 .

[19]  Shijian Lu,et al.  Combination of Document Image Binarization Techniques , 2011, 2011 International Conference on Document Analysis and Recognition.

[20]  Volker Märgner,et al.  New Binarization Approach Based on Text Block Extraction , 2011, 2011 International Conference on Document Analysis and Recognition.