HACDB: Handwritten Arabic characters database for automatic character recognition

Automatic off-line Arabic handwriting recognition based on segmentation still faces big challenges. A database, covering all shapes of handwritten Arabic characters, is required to facilitate the recognition process. This paper introduces a new database for handwritten Arabic characters (HACDB), designed to cover all shapes of Arabic characters including overlapping ones. It contains 6,600 shapes of characters written by 50 writers. This database can be used for training and testing the words for their recognition after segmentation. Also, it presents the possibility for comparing different approaches and evaluate their accuracy on a common base.

[1]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[2]  Horst Bunke,et al.  The IAM-database: an English sentence database for offline handwriting recognition , 2002, International Journal on Document Analysis and Recognition.

[3]  M. Pechwitz,et al.  IFN/ENIT: database of handwritten arabic words , 2002 .

[4]  Jinhai Cai,et al.  Handwriting Recognition - Soft Computing and Probabilistic Approaches , 2003, Studies in Fuzziness and Soft Computing.

[5]  Sherif Abdelazeem,et al.  A Two-Stage System for Arabic Handwritten Digit Recognition Tested on a New Large Database , 2007, Artificial Intelligence and Pattern Recognition.

[6]  L. Deng,et al.  The MNIST Database of Handwritten Digit Images for Machine Learning Research [Best of the Web] , 2012, IEEE Signal Processing Magazine.

[7]  Jonathan J. Hull,et al.  A Database for Handwritten Text Recognition Research , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Jawad Hasan Yasin AlKhateeb,et al.  Word based off-line handwritten Arabic classification and recognition : design of automatic recognition system for large vocabulary offline handwritten Arabic words using machine learning approaches , 2010 .

[9]  Somaya Al-Máadeed,et al.  A data base for Arabic handwritten text recognition research , 2002, Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition.

[10]  Farhad Faradji,et al.  A Comprehensive Isolated Farsi/Arabic Character Database for Handwritten OCR Research , 2006 .

[11]  Mohammad S. Khorsheed,et al.  Automatic Processing of Handwritten Arabic Forms using Neural Networks , 2005, IEC.

[12]  Ching Y. Suen,et al.  Databases for recognition of handwritten Arabic cheques , 2003, Pattern Recognit..