Feature Design for Offline Arabic Handwriting Recognition: Handcrafted vs Automated?

In handwriting recognition, design of relevant feature is a very important but daunting task. On one hand, handcraft design of features is difficult, depending on expert knowledge and on heuristics. On the other hand, biologically inspired neural networks are able to learn automatically features from the input image, but requires a good underlying model. The goal of this paper is to evaluate the performance of automatically learned features compared to handcrafted features, as they provide a promising alternative to the difficult task of features handcrafting. In this work, the recognition model is based on the long short-term memory (LSTM) and connectionist temporal classification (CTC) neural networks. This model has been shown to outperform the well-known HMM model for various handwriting tasks, thanks to its reliable probabilistic modeling. In its multidimensional form, called MDLSTM, this network is able to automatically learn features from the input image. For evaluation, we compare the MDLSTM learned features and four state-of-the-art handcrafted features. The IFN/ENIT database has been used as benchmark for Arabic word recognition, where the results are promising.

[1]  Horst Bunke,et al.  Using a Statistical Language Model to Improve the Performance of an HMM-Based Cursive Handwriting Recognition System , 2001, Int. J. Pattern Recognit. Artif. Intell..

[2]  Bernhard Schölkopf,et al.  Kernel Principal Component Analysis , 1997, ICANN.

[3]  Gernot A. Fink,et al.  Bag-of-Features Representations for Offline Handwriting Recognition Applied to Arabic Script , 2012, 2012 International Conference on Frontiers in Handwriting Recognition.

[4]  T. Munich,et al.  Offline Handwriting Recognition with Multidimensional Recurrent Neural Networks , 2008, NIPS.

[5]  José A. Rodríguez-Serrano,et al.  Handwritten word-spotting using hidden Markov models and universal vocabularies , 2009, Pattern Recognit..

[6]  Chafic Mokbel,et al.  Combining Slanted-Frame Classifiers for Improved HMM-Based Arabic Handwriting Recognition , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  R. Manmatha,et al.  Features for word spotting in historical manuscripts , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[8]  M. Pechwitz,et al.  IFN/ENIT: database of handwritten arabic words , 2002 .

[9]  Adel M. Alimi,et al.  New features for complex Arabic fonts in cascading recognition system , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[10]  Yuzuru Tanaka,et al.  Slit Style HOG Feature for Document Image Word Spotting , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[11]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[12]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[13]  R. Manmatha,et al.  Word image matching using dynamic time warping , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[15]  Yoshua Bengio,et al.  Extracting and composing robust features with denoising autoencoders , 2008, ICML '08.

[16]  Hany Ahmed,et al.  Off-Line Arabic Handwriting Recognition system based on concavity features and HMM classifier , 2012 .