Urdu ligature recognition techniques-A review

Optical Character Recognition plays indispensable role to recognize images from natural text images. As we know numerous languages are used all over the world but Urdu OCR is an emerging research area. Researchers on Urdu ligature recognition have received a great attention and continuously lot of research work is ongoing. Ligatures based Urdu language OCR is more preferable than characters. Hence, researchers are focusing more on ligature recognition in current time. Some reviews of different ligature recognition techniques for cursive Urdu OCR script are also provided in this paper. The cursive script structure has a variety of shapes and multiple writing styles produce many problems. Ligature recognition on script languages such as Latin, Chinese and Japanese is tough but it is more complex for Urdu script. Consequently, required advanced techniques for Urdu ligature recognition. The aim of this review is to explore several new opportunities and challenges for future research and compared the existing different techniques till date.

[1]  Ching Y. Suen,et al.  Holistic Urdu Handwritten Word Recognition Using Support Vector Machine , 2010, 2010 20th International Conference on Pattern Recognition.

[2]  Aejaz Farooq Ganai,et al.  Projection profile based ligature segmentation of Nastaleeq Urdu OCR , 2016, 2016 4th International Symposium on Computational and Business Intelligence (ISCBI).

[3]  Faisal Shafait,et al.  A segmentation-free approach to Arabic and Urdu OCR , 2013, Electronic Imaging.

[4]  Mahmood K. Pathan,et al.  Nastaliq optical character recognition , 2008, ACM-SE 46.

[5]  Gurpreet Singh Lehal,et al.  Recognition of Nastalique Urdu ligatures , 2013, MOCR '13.

[6]  Neeta Nain,et al.  Ligature Segmentation Approach for Urdu Handwritten Text Documents , 2014, ICTCS '14.

[7]  F. Shafait,et al.  Layout Analysis of Urdu Document Images , 2006, 2006 IEEE International Multitopic Conference.

[8]  M. Hussain,et al.  Online Urdu Ligature Recognition using Spatial Temporal Neural Processing , 2005, 2005 Pakistan Section Multitopic Conference.

[9]  Fareeha Anwar,et al.  Relative Magnitude of Gaussian Curvature Using Neural Network and Object Rotation of Two Degrees of Freedom , 2007, MVA.

[10]  Gurpreet Singh Lehal Choice of recognizable units for URDU OCR , 2012, DAR '12.

[11]  Xiaojie Wang,et al.  Line and Ligature Segmentation of Urdu Nastaleeq Text , 2017, IEEE Access.

[12]  Quintin Gee,et al.  Implementation Challenges for Nastaliq Character Recognition , 2008, IMTIC.

[13]  Samee Ullah Khan,et al.  The optical character recognition of Urdu-like cursive scripts , 2014, Pattern Recognit..

[14]  Gurpreet Singh Lehal Ligature Segmentation for Urdu OCR , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[15]  Gurpreet Singh Lehal,et al.  Offline Urdu OCR using Ligature based Segmentation for Nastaliq Script , 2015 .

[16]  Abdul Wahab,et al.  Optical character recognition system for Urdu , 2010, 2010 International Conference on Information and Emerging Technologies.

[17]  Shehzad Khalid,et al.  Recognition of Urdu ligatures - a holistic approach , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[18]  Sarmad Hussain,et al.  Adapting Tesseract for Complex Scripts: An Example for Urdu Nastalique , 2014, 2014 11th IAPR International Workshop on Document Analysis Systems.

[19]  Sarmad Hussain,et al.  Improving Nastalique specific pre-recognition process for Urdu OCR , 2009, 2009 IEEE 13th International Multitopic Conference.

[20]  Dharam Veer Sharma,et al.  State of the art in Nastaleeq Script Recognition , 2016 .

[21]  Sohail Abdul Sattar,et al.  Segmentation of Nastaliq script for OCR , 2009 .

[22]  Faisal Shafait,et al.  Search Space Reduction for Holistic Ligature Recognition in Urdu Nastalique Script , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[23]  Muhammad Sher,et al.  HMM and fuzzy logic: A hybrid approach for online Urdu script-based languages' character recognition , 2010, Knowl. Based Syst..

[24]  Imran Siddiqi,et al.  Optical Character Recognition System for Urdu Words in Nastaliq Font , 2016 .