Arabic calligraphy recognition based on binarization methods and degraded images

Optical Font Recognition is one of the main challenges in this time. The available methods of optical font recognition are deal with the recent documents and fonts types. However, there are neglected in dealing with the historical and regarded documents. Moreover, they have neglected languages that are not belong into Asian or Latin. Regarding to those types of documents, we proposed a new framework of optical font recognition for Arabic calligraphy. We enhance binarization method based on previous works. By introducing that, we achieve better quality images at the preprocessing stage. Then we generate text block before passing mailing to post-processing stages. Then, we extract the features based on edge direction matrixes. In the classification stage, we apply backpropagation neural network to identify the font type of the calligraphy. We observe that our proposal method achieve better performance in both preprocessing and post processing.

[1]  Matti Pietikäinen,et al.  Adaptive document binarization , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[2]  Alessandro Sperduti,et al.  Optical font recognition for multi-font OCR and document processing , 1999, Proceedings. Tenth International Workshop on Database and Expert Systems Applications. DEXA 99.

[3]  Mohamed A. Ismail,et al.  Techniques for language identification for hybrid Arabic-English document images , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[4]  David S. Doermann,et al.  Font identification using the grating cell texture operator , 2005, IS&T/SPIE Electronic Imaging.

[5]  Tomassini Marco,et al.  A Survey of Artificial Neural Network-Based Modeling in Agroecology , 2008, Soft Computing Applications in Industry.

[6]  Amandeep Kaur,et al.  Hough transform based fast skew detection and accurate skew correction methods , 2008, Pattern Recognit..

[7]  Nicole Vincent,et al.  Comparison of Niblack inspired binarization methods for ancient documents , 2009, Electronic Imaging.

[8]  Khairuddin Omar,et al.  A Statistical Global Feature Extraction Method for Optical Font Recognition , 2011, ACIIDS.

[9]  Khairuddin Omar,et al.  Generating an Arabic Calligraphy Text Blocks for Global Texture Analysis , 2011 .

[10]  Khairuddin Omar,et al.  Adaptive Thresholding Methods for Documents Image Binarization , 2011, MCPR.