An Assessment of Arabic Handwriting Recognition Technology

Automated methods for the recognition of Arabic script are at an early stage compared to their counterparts for the recognition of Latin and Chinese scripts. An assessment of the technology for Arabic handwriting recognition is provided based on the published literature. An introduction to the Arabic script is given followed by a description of algorithms for the processes involved: segmentation, feature extraction, classification, and search. Existing corpora for Arabic are described together with a design for corpus collection. The paper is concluded by identifying technology gaps and providing a bibliography of the recent literature on Arabic recognition.

[1]  Ramzi A. Haraty,et al.  Arabic Text Recognition , 2004, Int. Arab J. Inf. Technol..

[2]  Yves Lecourtier,et al.  Segmentation and coding of Arabic handwritten words , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[3]  Venu Govindaraju,et al.  Pre-processing methods for handwritten Arabic documents , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[4]  Character representation and recognition using quad tree-based fractal encoding scheme , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[5]  Hamid Amiri,et al.  Arabic Handwritten Words Recognition Based on a Planar Hidden Markov Model , 2005, Int. Arab J. Inf. Technol..

[6]  Samir Al-Emami,et al.  Off-line Arabic character recognition , 1992, Computer.

[7]  Harish Srinivasan,et al.  Handwritten Arabic Word Spotting using the CEDARABIC Document Analysis System , 2005 .

[8]  Volker Märgner,et al.  Arabic Handwriting Recognition Competition , 2005, ICDAR.

[9]  M. Pechwitz,et al.  IFN/ENIT: database of handwritten arabic words , 2002 .

[10]  A. Dehghani,et al.  Off-line recognition of isolated Persian handwritten characters using multiple hidden Markov models , 2001, Proceedings International Conference on Information Technology: Coding and Computing.

[11]  Gyeonghwan Kim,et al.  A Lexicon Driven Approach to Handwritten Word Recognition for Real-Time Applications , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Venu Govindaraju,et al.  Segmentation and pre-recognition of Arabic handwriting , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[13]  Sebastiano Impedovo,et al.  Frontiers in Handwriting Recognition , 1994 .

[14]  Sargur N. Srihari,et al.  Spotting words in handwritten Arabic documents , 2006, Electronic Imaging.

[15]  Sargur N. Srihari,et al.  A statistical approach to line segmentation in handwritten documents , 2007, Electronic Imaging.

[16]  Mokhtar Sellami,et al.  Arabic Words Recognition with Classifiers Combination: An Application to Literal Amounts , 2004, AIMSA.

[17]  Sargur N. Srihari,et al.  Segmentation-Based And Segmentation-Free Methods for Spotting Handwritten Arabic Words , 2006 .

[18]  Ching Y. Suen,et al.  Databases for recognition of handwritten Arabic cheques , 2003, Pattern Recognit..

[19]  G. R. Partridge,et al.  Proceedings of the National Electronics Conference: Volume X , 1956 .

[20]  John Domingue,et al.  Artificial Intelligence: Methodology, Systems, and Applications, 12th International Conference, AIMSA 2006, Varna, Bulgaria, September 12-15, 2006, Proceedings , 2006, AIMSA.

[21]  F. Nadir,et al.  Benefit of multiclassifier systems for Arabic handwritten words recognition , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[22]  Ramzi A. Haraty,et al.  A neuro-heuristic approach for segmenting handwritten Arabic text , 2001, Proceedings ACS/IEEE International Conference on Computer Systems and Applications.

[23]  Mokhtar Sellami,et al.  A HYBRID APPROACH FOR ARABIC LITERAL AMOUNTS RECOGNITION , 2004 .

[24]  Adnan Amin,et al.  Hand-printed arabic character recognition system using an artificial network , 1996, Pattern Recognit..

[25]  Saeed Mozaffari,et al.  Structural decomposition and statistical description of Farsi/Arabic handwritten numeric characters , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[26]  Adnan Amin Recognition of hand-printed characters based on structural description and inductive logic programming , 2003, Pattern Recognit. Lett..

[27]  Chafic Mokbel,et al.  Arabic handwriting recognition using baseline dependant features and hidden Markov modeling , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[28]  Najoua Essoukri Ben Amara,et al.  Classification of Arabic script using multiple sources of information: State of the art and perspectives , 2003, Document Analysis and Recognition.

[29]  Volker Märgner,et al.  HMM based approach for handwritten arabic word recognition using the IFN/ENIT - database , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[30]  P. Adibi,et al.  NASTAALIGH HANDWRITTEN WORD RECOGNITION USING A CONTINUOUS-DENSITY VARIABLE-DURATION HMM , 2005 .

[31]  Najoua Essoukri Ben Amara,et al.  Planar Markov modeling for Arabic writing recognition: advancement state , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[32]  Mokhtar Sellami,et al.  Rule Based Neural Networks Construction for Handwritten Arabic City-Names Recognition , 2004, AIMSA.

[33]  Karim Faez,et al.  Handwritten Farsi (Arabic) word recognition: a holistic approach using discrete HMM , 2001, Pattern Recognit..

[34]  Venu Govindaraju,et al.  Transcript mapping for handwritten Arabic documents , 2007, Electronic Imaging.

[35]  Stefano Levialdi,et al.  Image Analysis and Processing , 1987 .

[36]  Murray J. J. Holt,et al.  Recognition of Off-Line Cursive Handwriting , 1998, Comput. Vis. Image Underst..

[37]  W. F. Clocksin,et al.  Towards automatic transcription of Syriac handwriting , 2003, 12th International Conference on Image Analysis and Processing, 2003.Proceedings..

[38]  Sargur N. Srihari,et al.  Binary Vector Dissimilarity Measures for Handwriting Identification , 2003, IS&T/SPIE Electronic Imaging.

[39]  Venu Govindaraju,et al.  Offline Arabic handwriting recognition: a survey , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40]  R. J. Green,et al.  Recognition of Handwritten Cursive Arabic Characters , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[41]  S. S. Upda,et al.  Recognition of Arabic Characters , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[42]  Mohammad S. Khorsheed,et al.  Recognising handwritten Arabic manuscripts using a single hidden Markov model , 2003, Pattern Recognit. Lett..

[43]  Sargur N. Srihari,et al.  Individuality of numerals , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[44]  Dave Elliman,et al.  Off-line recognition of handwritten Arabic words using multiple hidden Markov models , 2004, Knowl. Based Syst..

[45]  Mokhtar Sellami,et al.  Off-line handwritten Arabic character segmentation algorithm: ACSA , 2002, Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition.

[46]  Abdel Belaïd,et al.  Combination of local and global vision modelling for Arabic handwritten words recognition , 2002, Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition.

[47]  Sargur N. Srihari,et al.  Versatile Search of Scanned Arabic Handwriting , 2006, SACH.

[48]  Laurence Likforman-Sulem,et al.  Document Recognition and Retrieval XVII , 2007 .

[49]  Hussein Almuallim,et al.  A Method of Recognition of Arabic Cursive Handwriting , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[50]  Edwin R. Hancock,et al.  Learning mixtures of point distribution models with the EM algorithm , 2003, Pattern Recognit..

[51]  M Volker,et al.  ICDAR 2007 - Arabic Handwriting Recognition Competition , 2007 .