Region Based Local Binarization Approach for Handwritten Ancient Documents

Due to the fact that historical handwritten documents present many degradations, pre-processing of such documents is considered as a big challenge. Most pre-processing methods and specifically binarization return better results when they are applied on printed documents. We present in this paper a binarization approach adaptive for handwritten historical documents based on extraction of regions-of-interest. During our tests several images datasets are used, the benchmarking datasets for binarization DIBCO 2009 and H-DIBCO 2010 (15 images) as well as complete handwritten documents from the IAM historical database (about 60 images). The evaluation of the proposed binarization method is based on several evaluation metrics for binarization. The results show that the proposed method fit with handwritten historical documents (FM about 88%) for images of the binarization competitions.

[1]  Hsi-Jian Lee,et al.  Efficiently extracting and classifying objects for analyzing color documents , 2009, Machine Vision and Applications.

[2]  Ioannis Pratikakis,et al.  ICDAR 2011 Document Image Binarization Contest (DIBCO 2011) , 2011, 2011 International Conference on Document Analysis and Recognition.

[3]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[4]  Ioannis Pratikakis,et al.  H-DIBCO 2010 - Handwritten Document Image Binarization Competition , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[5]  Ioannis Pratikakis,et al.  An Objective Evaluation Methodology for Document Image Binarization Techniques , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.

[6]  Volker Märgner,et al.  A design of a preprocessing framework for large database of historical documents , 2011, HIP '11.

[7]  Volker Märgner,et al.  New Binarization Approach Based on Text Block Extraction , 2011, 2011 International Conference on Document Analysis and Recognition.

[8]  Shijian Lu,et al.  Document image binarization using background estimation and stroke edges , 2010, International Journal on Document Analysis and Recognition (IJDAR).

[9]  Elisa H. Barney Smith,et al.  An analysis of binarization ground truthing , 2010, DAS '10.

[10]  Ioannis Pratikakis,et al.  Adaptive degraded document image binarization , 2006, Pattern Recognit..

[11]  Hamid Amiri,et al.  New method for the selection of binarization parameters based on noise features of historical documents , 2011, MOCR_AND '11.

[12]  Majid Ahmadi,et al.  A binarization method for scanned documents based on hidden Markov model , 2006, 2006 IEEE International Symposium on Circuits and Systems.

[13]  Shijian Lu,et al.  Binarization of historical document images using the local maximum and minimum , 2010, DAS '10.

[14]  Ehsanollah Kabir,et al.  Binarization of degraded document image based on feature space partitioning and classification , 2010, International Journal on Document Analysis and Recognition (IJDAR).

[15]  Rafael Dueire Lins,et al.  ICFHR 2010 Contest: Quantitative Evaluation of Binarization Algorithms , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[16]  Ioannis Pratikakis,et al.  DIBCO 2009: document image binarization contest , 2011, International Journal on Document Analysis and Recognition (IJDAR).

[17]  Alicia Fornés,et al.  Transcription alignment of Latin manuscripts using hidden Markov models , 2011, HIP '11.

[18]  Øivind Due Trier,et al.  Evaluation of Binarization Methods for Document Images , 1995, IEEE Trans. Pattern Anal. Mach. Intell..