Gabor features for offline Arabic handwriting recognition

Many feature extraction approaches for off-line handwriting recognition (OHR) rely on accurate binarization of gray-level images. However, high-quality binarization of most real-world documents is extremely difficult due to varying characteristics of noises artifacts common in such documents. Unlike most of these features, Gabor features do not require binarization of the document images, and thus are likely to be more robust to noises in document images. To demonstrate the efficacy of our proposed Gabor features, we perform subword recognition for off-line Arabic handwritten images using Support Vector Machines (SVM). We also compare the recognition performance with other binarization based features which have been proven to be effective in capturing shape characteristics of handwritten Arabic subwords, such as GSC (a set of gradient, structure, and concavity features) and skeleton based Graph features. Our preliminary experimental results show that Gabor features outperform Graph features and are slightly better than GSC features for Arabic subword recognition. In addition, by combining Gabor and GSC features, we obtain a significant reduction in classification error rate over using GSC or Gabor features alone.

[1]  Yoshihiko Hamamoto,et al.  A gabor filter-based method for recognizing handwritten numerals , 1998, Pattern Recognit..

[2]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[3]  Saeed Mozaffari,et al.  Structural decomposition and statistical description of Farsi/Arabic handwritten numeric characters , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[4]  Chafic Mokbel,et al.  Arabic handwriting recognition using baseline dependant features and hidden Markov modeling , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[5]  Ioannis Pratikakis,et al.  ICDAR 2009 Document Image Binarization Contest (DIBCO 2009) , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[6]  Venu Govindaraju,et al.  Offline Arabic handwriting recognition: a survey , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Qiang Huo,et al.  Offline recognition of handwritten Chinese characters using Gabor features, CDHMM modeling and MCE training , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  Moumita Ghosh,et al.  A novel approach for structural feature extraction: Contour vs. direction , 2004, Pattern Recognit. Lett..

[9]  Thomas G. Dietterich Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms , 1998, Neural Computation.

[10]  Rohit Prasad,et al.  Stochastic Segment Modeling for Offline Handwriting Recognition , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[11]  Jinhai Cai,et al.  Handwriting Recognition , 2003 .

[12]  Jinhai Cai,et al.  Handwriting Recognition - Soft Computing and Probabilistic Approaches , 2003, Studies in Fuzziness and Soft Computing.

[13]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[14]  Geetha Srikantan,et al.  A multiple feature/resolution approach to handprinted digit and character recognition , 1996, Int. J. Imaging Syst. Technol..

[15]  Cheng-Lin Liu,et al.  Gabor feature extraction for character recognition: comparison with gradient feature , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[16]  Changsong Liu,et al.  Gabor filters-based feature extraction for character recognition , 2005, Pattern Recognit..

[17]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[18]  R. Manmatha,et al.  Classification models for historical manuscript recognition , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[19]  Øivind Due Trier,et al.  Evaluation of Binarization Methods for Document Images , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Haikal El Abed,et al.  Invariant Primitives for Handwritten Arabic Script: A Contrastive Study of Four Feature Sets , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[21]  Seungjin Choi,et al.  A Bayesian network classifier and hierarchical Gabor features for handwritten numeral recognition , 2006, Pattern Recognit. Lett..