Stroke Sequence Identification in Handwritten Urdu Alphabets Using Convolutional Neural Networks

There is a lack of attention to early childhood handwriting in developing countries like Pakistan, where a single teacher teaches 50 to 100 students at a time. Children need a lot of attention from the teacher during their early stages (4-8 years), especially when they start writing the alphabet to assure that they adopted a correct stroke sequence. It is almost impossible for a teacher to pay attention to all his/her students to observe their stroke sequences. Due to a lack of teacher attention, the students may write the alphabet with correct or incorrect stroke sequences. If a student follows an incorrect stroke sequence, it badly affects the writing speed and the beauty of his/her cursive handwriting. As the research on this topic is concerned, so far, it has been conducted mainly in Chinese and Japanese languages to find out the correct stroke sequence. Arabic has received far fewer studies and Urdu is the most neglected till now. In this paper, we have proposed Urdu Handwriting E-Tutor (UHET). UHET is a Computer Vision based method that continuously monitors the handwriting activity of a child and successfully points out whether the stroke sequence (while writing an Urdu alphabet) is correct or not. To conduct this study, we created a new dataset that consists of images and videos of five Urdu alphabets. UHET exploits Convolutional Neural Network to train the model for predicting the alphabet (written by the child) and its stroke sequence. Results show that our UHET performs well achieving 80% accuracy on the average on the given data set.

[1]  Morteza Daneshmand,et al.  Persian handwritten digit, character and word recognition using deep learning , 2020, International Journal on Document Analysis and Recognition (IJDAR).

[2]  Mehdi Bonyani,et al.  Persian Handwritten Digit, Character, and Words Recognition by Using Deep Learning Methods , 2020, ArXiv.

[3]  R. Ezhilarasi,et al.  Optical Character Recognition for English Handwritten Text Using Recurrent Neural Network , 2020, 2020 International Conference on System, Computation, Automation and Networking (ICSCAN).

[4]  Najwa Altwaijry,et al.  Arabic handwriting recognition system using convolutional neural network , 2020, Neural Computing and Applications.

[5]  Naveen Kumar Gondhi,et al.  Machine Learning approaches for Nastaliq style Urdu handwritten recognition: A survey , 2020, 2020 6th International Conference on Advanced Computing and Communication Systems (ICACCS).

[6]  Ahsan Ullah,et al.  Pioneer dataset and automatic recognition of Urdu handwritten characters using a deep autoencoder and convolutional neural network , 2019, SN Applied Sciences.

[7]  Jun Guo,et al.  A New Method for Stroke Order Recognition of Handwritten Chinese Characters , 2018, ICVIP.

[8]  Muhammad Imran Razzak,et al.  Urdu Nasta’liq text recognition system based on multi-dimensional recurrent neural network and statistical features , 2017, Neural Computing and Applications.

[9]  Muhammad Imran Razzak,et al.  Urdu Nasta’liq text recognition using implicit segmentation based on multi-dimensional long short term memory neural networks , 2016, SpringerPlus.

[10]  Fareeha Anwar,et al.  Urdu Writing Rules for Online Input in PDA’s , 2015 .

[11]  Alex Graves,et al.  Grid Long Short-Term Memory , 2015, ICLR.

[12]  Sebastiano Impedovo,et al.  A novel prototype generation technique for handwriting digit recognition , 2014, Pattern Recognit..

[13]  Saad Bin Ahmed,et al.  Offline Printed Urdu Nastaleeq Script Recognition with Bidirectional LSTM Networks , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[14]  Faisal Shafait,et al.  A segmentation-free approach to Arabic and Urdu OCR , 2013, Electronic Imaging.

[15]  Dzulkifli Mohamad,et al.  Finding Critical Points of Handwritten Persian/Arabic Character , 2012 .

[16]  Li Yujian,et al.  A Normalized Levenshtein Distance Metric , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  S. Hochreiter,et al.  Long Short-Term Memory , 1997, Neural Computation.

[18]  C. Chui Wavelets: A Tutorial in Theory and Applications , 1992 .

[19]  Ingrid Daubechies,et al.  The wavelet transform, time-frequency localization and signal analysis , 1990, IEEE Trans. Inf. Theory.

[20]  Talaat S. El-Sheikh,et al.  Real-time arabic handwritten character recognition , 1989, Pattern Recognit..

[21]  George D. C. Cavalcanti,et al.  Handwritten Digit Recognition Using Multiple Feature Extraction Techniques and Classifier Ensemble , 2010 .

[22]  Tracy Hammond,et al.  Urdu Qaeda: Recognition System for Isolated Urdu Characters , 2009 .