Scale and rotation invariant OCR for Pashto cursive script using MDLSTM network

Optical Character Recognition (OCR) of cursive scripts like Pashto and Urdu is difficult due the presence of complex ligatures and connected writing styles. In this paper, we evaluate and compare different approaches for the recognition of such complex ligatures. The approaches include Hidden Markov Model (HMM), Long Short Term Memory (LSTM) network and Scale Invariant Feature Transform (SIFT). Current state of the art in cursive script assumes constant scale without any rotation, while real world data contain rotation and scale variations. This research aims to evaluate the performance of sequence classifiers like HMM and LSTM and compare their performance with descriptor based classifier like SIFT. In addition, we also assess the performance of these methods against the scale and rotation variations in cursive script ligatures. Moreover, we introduce a database of 480,000 images containing 1000 unique ligatures or sub-words of Pashto. In this database, each ligature has 40 scale and 12 rotation variations. The evaluation results show a significantly improved performance of LSTM over HMM and traditional feature extraction technique such as SIFT.

[1]  Saad Bin Ahmed,et al.  Offline Printed Urdu Nastaleeq Script Recognition with Bidirectional LSTM Networks , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[2]  Premkumar Natarajan,et al.  The BBN Byblos Pashto OCR system , 2004, HDP '04.

[3]  U. Pal,et al.  Recognition of printed Urdu script , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[4]  Marc-Peter Schambach,et al.  Low resolution Arabic recognition with multidimensional recurrent neural networks , 2013, MOCR '13.

[5]  Inam Shamsher,et al.  Urdu compound Character Recognition using feed forward neural networks , 2009, 2009 2nd IEEE International Conference on Computer Science and Information Technology.

[6]  Nadir Durrani,et al.  Urdu Word Segmentation , 2010, NAACL.

[7]  Riaz Ahmad,et al.  Scale and rotation invariant recognition of cursive Pashto script using SIFT features , 2010, 2010 6th International Conference on Emerging Technologies (ICET).

[8]  Samee Ullah Khan,et al.  The optical character recognition of Urdu-like cursive scripts , 2014, Pattern Recognit..

[9]  Farooq Ahmed,et al.  Shape analysis of Pashto script and creation of image database for OCR , 2009, 2009 International Conference on Emerging Technologies.

[10]  S. A. Husain A multi-tier holistic approach for Urdu Nastaliq recognition , 2002 .

[11]  Faisal Shafait,et al.  A segmentation-free approach to Arabic and Urdu OCR , 2013, Electronic Imaging.

[12]  Sarmad Hussain,et al.  Segmentation Free Nastalique Urdu OCR , 2010 .

[13]  Thomas M. Breuel,et al.  An Evaluation of HMM-Based Techniques for the Recognition of Screen Rendered Text , 2011, 2011 International Conference on Document Analysis and Recognition.

[14]  Fareeha Anwar,et al.  Relative Magnitude of Gaussian Curvature Using Neural Network and Object Rotation of Two Degrees of Freedom , 2007, MVA.