A novel scheme for handwritten binarization method on sundanese palm leaf document images

Palm leaf, one type of ancient Sundanese writing medium, is presenting novel challenges in document image analysis. There are certain characteristics of Sundanese palm leaf document images offering some suitable challenges for handwritten binarization process such as non-uniform illumination, shadow, smear and random noise. We evaluated the performance of five binarization methods, in both common methods (Otsu, Niblack, Sauvola, Bernsen) and current methods (Howe) using the comparative experimental studies. After investigation, we proposed a new scheme for handwritten binarization process that adds special filtering to remove non-text regions. The results show that our proposed filter can improve the performance of all common binarization processes up to 85%. Niblack method which implements our proposed filter gave the highest performance on binarization process. In this research, the images dataset was obtained from sample images of Sundanese palm leaf manuscripts written in XV-XVII century. The evaluation criteria provided by ICDAR 2013 Document Image Binarization Contest (DIBCO) were used in these experiments.

[1]  Jing Lin,et al.  PixLabeler: User Interface for Pixel-Level Labeling of Elements in Document Images , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[2]  Rafael Dueire Lins,et al.  A New Method for Shading Removal and Binarization of Documents Acquired with Portable Digital Cameras , 2010 .

[3]  Rupinder Kaur,et al.  Review of Robust Document Image BINARIZATION Technique for Degraded Document Images , 2015 .

[4]  Nicholas R. Howe,et al.  Document binarization with automatic parameter tuning , 2013, International Journal on Document Analysis and Recognition (IJDAR).

[5]  Abdelkrim Meziane,et al.  A new efficient binarization method: application to degraded historical document images , 2017, Signal Image Video Process..

[6]  Nikolaos Ntogas,et al.  A binarization algorithm for historical manuscripts , 2008, ICC 2008.

[7]  Made Windu Antara Kesiman,et al.  An initial study on the construction of ground truth binarized images of ancient palm leaf manuscripts , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[8]  Ioannis Pratikakis,et al.  ICDAR 2013 Document Image Binarization Contest (DIBCO 2013) , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[9]  Gerald Schaefer,et al.  Historic handwritten manuscript binarisation using whale optimisation , 2016, 2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[10]  Liansheng Wang,et al.  Broken and degraded document images binarization , 2017, Neurocomputing.

[11]  Matti Pietikäinen,et al.  Adaptive document image binarization , 2000, Pattern Recognit..