Survey on Segmentation and Recognition of Handwritten Arabic Script

The issue of handwritten recognition in Arabic script nature has attracted many researchers from both academic and industrial fields. But their efforts have not reached satisfying outcomes till now. In this paper, a survey has been done in segmentation and recognition of handwritten documents in Arabic script. Most of the previous published works have been analyzed, and some remedies have been suggested. Various strategies used for creating a powerful recognition system have been summarized. This paper presents various algorithms with respect to text, word and characters segmentation and recognition of Arabic document. It analyzes the recognition stage of Arabic script depending on segmentation strategies.

[1]  Amani Ali Ahmed Ali,et al.  An Efficient Character Segmentation Algorithm for Recognition of Arabic Handwritten Script , 2019, 2019 International Conference on Data Science and Communication (IconDSC).

[2]  Xianglong Tang,et al.  A new algorithm for machine printed Arabic character segmentation , 2004, Pattern Recognit. Lett..

[3]  Subhadip Basu,et al.  Text Line Segmentation for Unconstrained Handwritten Document Images Using Neighborhood Connected Component Analysis , 2009, PReMI.

[4]  Hussein Almuallim,et al.  A Method of Recognition of Arabic Cursive Handwriting , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Zabih Ghassemlooy,et al.  Automatic segmentation for Arabic characters in handwriting documents , 2011, 2011 18th IEEE International Conference on Image Processing.

[6]  Laurence Likforman-Sulem,et al.  Text line segmentation of historical documents: a survey , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[7]  Awais Adnan,et al.  Urdu ligature recognition using multi-level agglomerative hierarchical clustering , 2017, Cluster Computing.

[8]  Dzulkifli Mohamad,et al.  Off-line hand-written character recognition using integrated 1D HMMs based on feature extraction filters , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[9]  Fareeha Anwar Online Urdu Handwritten Text Recognition For Mobile Devices Using Intelligent Techniques , 2019 .

[10]  Venu Govindaraju,et al.  Line separation for complex document images using fuzzy runlength , 2004, First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings..

[11]  Abderrazak Zahour,et al.  Arabic hand-written text-line extraction , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[12]  Guang Liu,et al.  Ligature based Urdu Nastaleeq sentence recognition using gated bidirectional long short term memory , 2017, Cluster Computing.

[13]  Ioannis Pratikakis,et al.  A Block-Based Hough Transform Mapping for Text Line Detection in Handwritten Documents , 2006 .

[14]  Muhammad Sher,et al.  Numeral recognition for Urdu script in unconstrained environment , 2009, 2009 International Conference on Emerging Technologies.

[15]  Pinar Duygulu Sahin,et al.  A Hybrid for Line Segmentation in Handwritten Documents , 2012, 2012 International Conference on Frontiers in Handwriting Recognition.

[16]  Jayant Kumar,et al.  Segmentation of Handwritten Textlines in Presence of Touching Components , 2011, 2011 International Conference on Document Analysis and Recognition.

[17]  Ching Y. Suen,et al.  Arabic Handwritten Text Line Extraction by Applying an Adaptive Mask to Morphological Dilation , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[18]  T. Pavlidis Algorithms for Graphics and Image Processing , 1981, Springer Berlin Heidelberg.

[19]  Jayant Kumar,et al.  Handwritten Arabic text line segmentation using affinity propagation , 2010, DAS '10.

[20]  Venu Govindaraju,et al.  Segmentation of Arabic Handwriting Based on both Contour and Skeleton Segmentation , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[21]  Mokhtar Sellami,et al.  Cursive Arabic Script Segmentation and Recognition System , 2005 .

[22]  M. Pechwitz,et al.  IFN/ENIT: database of handwritten arabic words , 2002 .

[23]  Fadoua Bouafif Samoud,et al.  Text lines and PAWs segmentation of handwritten Arabic document by two hybrid methods , 2014, 2014 1st International Conference on Advanced Technologies for Signal and Image Processing (ATSIP).

[24]  Basilios Gatos,et al.  Handwritten Text Line Segmentation by Shredding Text into its Lines , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[25]  Yves Lecourtier,et al.  Segmentation and coding of Arabic handwritten words , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[26]  Gyu Sang Choi,et al.  Recognition of Urdu Handwritten Characters Using Convolutional Neural Network , 2019, Applied Sciences.

[27]  Amani Ali Ahmed Ali,et al.  A novel features and classifiers fusion technique for recognition of Arabic handwritten character script , 2019, SN Applied Sciences.

[28]  Klaus D. Tönnies,et al.  Robust Line Detection in Historical Church Registers , 2001, DAGM-Symposium.

[29]  Muazzam Maqsood,et al.  An Efficient Segmentation Technique for Urdu Optical Character Recognizer (OCR) , 2019, Lecture Notes in Networks and Systems.

[30]  Hua Wang,et al.  Offline Handwritten Arabic Character Segmentation with Probabilistic Model , 2006, Document Analysis Systems.

[31]  Mohamed S. El-Mahallawy,et al.  Histogram-Based Lines and Words Decomposition for Arabic Omni Font-Written OCR Systems; Enhancements and Evaluation , 2007, CAIP.

[32]  Noorzaily Mohamed Noor Off-line Handwriting Text Line Segmentation : A Review , 2008 .

[33]  Manal A. Abdullah,et al.  Off-Line Arabic Handwriting Character Recognition Using Word Segmentation , 2012, ArXiv.

[34]  Abdel Belaïd,et al.  Noname manuscript No. (will be inserted by the editor) A General Approach for Multi-oriented Text Line Extraction of Handwritten Documents , 2011 .

[35]  Venu Govindaraju,et al.  2009 10th International Conference on Document Analysis and Recognition A Steerable Directional Local Profile Technique for Extraction of Handwritten Arabic Text Lines , 2022 .

[36]  Slim Kanoun,et al.  A Database for Arabic Handwritten Text Image Recognition and Writer Identification , 2012, 2012 International Conference on Frontiers in Handwriting Recognition.

[37]  Abdel Belaïd,et al.  Multi-font Numerals Recognition for Urdu Script based Languages , 2009 .

[38]  Jaafar Alghazo,et al.  Multi-Language Handwritten Digits Recognition based on Novel Structural Features , 2019, Journal of Imaging Science and Technology.

[39]  Amani Ali Ahmed Ali,et al.  Arabic Handwritten Character Recognition Using Machine Learning Approaches , 2019, 2019 Fifth International Conference on Image Information Processing (ICIIP).

[40]  Ramzi A. Haraty,et al.  A neuro-heuristic approach for segmenting handwritten Arabic text , 2001, Proceedings ACS/IEEE International Conference on Computer Systems and Applications.

[41]  Amani Ali Ahmed Ali,et al.  Efficient Algorithms for Text Lines and Words Segmentation for Recognition of Arabic Handwritten Script , 2019, Emerging Research in Computing, Information, Communication and Applications.

[42]  M. Sellami,et al.  MOrpho-LEXical analysis for correcting OCR-generated Arabic words (MOLEX) , 2002, Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition.

[43]  Adel M. Alimi,et al.  Unsupervised Block Covering Analysis for Text-Line Segmentation of Arabic Ancient Handwritten Document Images , 2010, 2010 20th International Conference on Pattern Recognition.

[44]  Muhammad Imran Razzak,et al.  Handwritten Urdu character recognition using one-dimensional BLSTM classifier , 2017, Neural Computing and Applications.

[45]  Liangrui Peng,et al.  A Novel Similar Character Discrimination Method for Online Handwritten Urdu Character Recognition in Half Forms , 2018 .

[46]  Yusra Osman Segmentation algorithm for Arabic handwritten text based on contour analysis , 2013, 2013 INTERNATIONAL CONFERENCE ON COMPUTING, ELECTRICAL AND ELECTRONIC ENGINEERING (ICCEEE).

[47]  Klaus D. Tönnies,et al.  Line detection and segmentation in historical church registers , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[48]  J.-C. Simon,et al.  Off-line cursive word recognition , 1992, Proc. IEEE.

[49]  Yago Saez,et al.  A Survey of Handwritten Character Recognition with MNIST and EMNIST , 2019, Applied Sciences.

[50]  Laurence Likforman-Sulem,et al.  Text Line Segmentation of Historical Arabic Documents , 2007 .

[51]  Y. Lecourtier,et al.  A new approach for Latin/Arabic character segmentation , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[52]  Mohamed Cheriet,et al.  Visual aspect of cursive arabic handwriting recognition , 1998 .

[53]  Jianmin Jiang,et al.  Interactive knowledge discovery for baseline estimation and word segmentation in handwritten arabic text , 2009 .