Arabic Handwriting Recognition

This thesis explores a number of different techniques for use in the field of Arabic Handwriting Recognition. A review of previous work in the field is conducted, and then various techniques are explored in the context of classifying town names from the IFN/ENIT database. A baseline-finding algorithm using Principal Components Analysis is implemented, and the change in performance from reducing the influence of certain word features is also demonstrated. Several simple methods of town name classification are investigated, including a scheme using Tangent Features. These model the variations in the training examples in order to improve generalisation, and perform with 94% accuracy on a small 10-class lexicon. Moment invariants are considered as useful features for classification, but fail to surpass the performance of simpler methods. An approach where town names are split into parts and traced to recover temporal information is conceived, and found to have encouraging performance and several useful properties.

[1]  Sargur N. Srihari,et al.  Variable duration hidden Markov model and morphological segmentation for handwritten word recognition , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Pietro Perona,et al.  Learning to Recognize Volcanoes on Venus , 1998, Machine Learning.

[3]  Venu Govindaraju,et al.  Holistic handwritten word recognition using temporal features derived from off-line images , 1996, Pattern Recognit. Lett..

[4]  Yang He,et al.  Off-line handwritten word recognition using HMM with adaptive length Viterbi algorithm , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5).

[5]  N. D. Gorsky,et al.  Experiments with handwriting recognition using holographic representation of line images , 1994, Pattern Recognit. Lett..

[6]  Christophe Parisse Global Word Shape Processing in Off-Line Recognition of Handwriting , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Sherif Sami El-Dabi,et al.  Arabic character recognition system: A statistical approach for recognizing cursive typewritten text , 1990, Pattern Recognit..

[8]  Lawrence R. Rabiner,et al.  A tutorial on Hidden Markov Models , 1986 .

[9]  Adnan Amin,et al.  Recognition of printed arabic text based on global features and decision tree learning techniques , 2000, Pattern Recognit..

[10]  Anil K. Jain,et al.  A robust and fast skew detection algorithm for generic documents , 1996, Pattern Recognit..

[11]  Ching Y. Suen,et al.  HMM word recognition engine , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[12]  Mohamed A. Ismail,et al.  A graph-based segmentation and feature extraction framework for Arabic text recognition , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[13]  Karim Faez,et al.  Handwritten Farsi (Arabic) word recognition: a holistic approach using discrete HMM , 2001, Pattern Recognit..

[14]  Volker Märgner,et al.  Baseline estimation for Arabic handwritten words , 2002, Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition.

[15]  Eric Lecolinet,et al.  A multi-classifier combination strategy for the recognition of handwritten cursive words , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[16]  Eric Lecolinet,et al.  A Survey of Methods and Strategies in Character Segmentation , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  M. Teague Image analysis via the general theory of moments , 1980 .

[18]  Ehud Rivlin,et al.  Skew detection via principal components analysis , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[19]  Michel Gilloux,et al.  Strategies for handwritten words recognition using hidden Markov models , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[20]  Chee-Way Chong,et al.  Translation invariants of Zernike moments , 2003, Pattern Recognit..

[21]  Paul D. Gader,et al.  Handwritten Word Recognition Using Segmentation-Free Hidden Markov Modeling and Segmentation-Based Dynamic Programming Techniques , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  J.-C. Simon,et al.  Off-line cursive word recognition , 1992, Proc. IEEE.

[23]  Andrew D. Bagdanov,et al.  Projection profile based skew estimation algorithm for JBIG compressed images , 1998, International Journal on Document Analysis and Recognition.

[24]  Sargur N. Srihari,et al.  On-Line and Off-Line Handwriting Recognition: A Comprehensive Survey , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  Theo Pavlidis,et al.  A vectorizer and feature extractor for document recognition , 1986 .

[26]  Jan Flusser,et al.  Affine moment invariants: a new tool for character recognition , 1994, Pattern Recognit. Lett..

[27]  Sargur N. Srihari,et al.  Off-Line Cursive Script Word Recognition , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[28]  Hirobumi Nishida,et al.  An approach to integration of off-line and on-line recognition of handwriting , 1995, Pattern Recognit. Lett..

[29]  John M. Trenkle,et al.  Word-level recognition of multifont Arabic text using a feature vector matching approach , 1996, Electronic Imaging.

[30]  Steve Grand,et al.  Growing Up with Lucy: How to Build an Android in Twenty Easy Steps , 2004 .

[31]  Andrew D. Bagdanov,et al.  Projection profile based skew estimation algorithm for JBIG compressed images , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[32]  M. Pechwitz,et al.  IFN/ENIT: database of handwritten arabic words , 2002 .

[33]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[34]  Alireza Khotanzad,et al.  Invariant Image Recognition by Zernike Moments , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[35]  Ehud Rivlin,et al.  Offline cursive script word recognition – a survey , 1999, International Journal on Document Analysis and Recognition.

[36]  Azriel Rosenfeld,et al.  A method of detecting the orientation of aligned components , 1986, Pattern Recognit. Lett..

[37]  Volker Märgner,et al.  HMM based approach for handwritten arabic word recognition using the IFN/ENIT - database , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[38]  L. D. Harmon,et al.  Automatic recognition of print and script , 1972 .

[39]  Ming-Kuei Hu,et al.  Visual pattern recognition by moment invariants , 1962, IRE Trans. Inf. Theory.

[40]  Jan Flusser,et al.  Pattern recognition by affine moment invariants , 1993, Pattern Recognit..