Deep learning based isolated Arabic scene character recognition

The technological advancement and sophistication in cameras and gadgets prompt researchers to have focus on image analysis and text understanding. The deep learning techniques demonstrated well to assess the potential for classifying text from natural scene images as reported in recent years. There are variety of deep learning approaches that prospects the detection and recognition of text, effectively from images. In this work, we presented Arabic scene text recognition using Convolutional Neural Networks (ConvNets) as a deep learning classifier. As the scene text data is slanted and skewed, thus to deal with maximum variations, we employ five orientations with respect to single occurrence of a character. The training is formulated by keeping filter size 3 × 3 and 5 × 5 with stride value as 1 and 2. During text classification phase, we trained network with distinct learning rates. Our approach reported encouraging results on recognition of Arabic characters from segmented Arabic scene images.

[1]  Mohammad Rahmati,et al.  A Hybrid Approach to Localize Farsi Text in Natural Scene Images , 2012, INNS-WC.

[2]  Xiaohang Ren,et al.  A novel scene text detection algorithm based on convolutional neural network , 2016, 2016 Visual Communications and Image Processing (VCIP).

[3]  Andreas Dengel,et al.  ICDAR 2011 Robust Reading Competition Challenge 2: Reading Text in Scene Images , 2011, 2011 International Conference on Document Analysis and Recognition.

[4]  Chunheng Wang,et al.  Scene Text Recognition Using Part-Based Tree-Structured Character Detection , 2013, CVPR 2013.

[5]  Adel M. Alimi,et al.  Arabic characters recognition in natural scenes using sparse coding for feature representations , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[6]  Xiaodong Yang,et al.  Scene text recognition in multiple frames based on text tracking , 2014, 2014 IEEE International Conference on Multimedia and Expo (ICME).

[7]  Muhammad Imran Razzak,et al.  UCOM offline dataset-an urdu handwritten dataset generation , 2017, Int. Arab J. Inf. Technol..

[8]  Lionel Prevost,et al.  2009 10th International Conference on Document Analysis and Recognition Text Detection and Localization in Complex Scene Images using Constrained AdaBoost Algorithm , 2022 .

[9]  Jiri Matas,et al.  Efficient Scene text localization and recognition with local character refinement , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[10]  Dimosthenis Karatzas,et al.  A Fine-Grained Approach to Scene Text Script Identification , 2016, 2016 12th IAPR Workshop on Document Analysis Systems (DAS).

[11]  Huizhong Chen,et al.  Robust text detection in natural images with edge-enhanced Maximally Stable Extremal Regions , 2011, 2011 18th IEEE International Conference on Image Processing.

[12]  Qiang Guo,et al.  Memory Matters: Convolutional Recurrent Neural Network for Scene Text Recognition , 2016, ArXiv.

[13]  Jacqueline L. Feild,et al.  Improving Text Recognition in Images of Natural Scenes , 2014 .

[14]  Chunheng Wang,et al.  Scene Text Recognition Using Structure-Guided Character Detection and Linguistic Knowledge , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Kai Chen,et al.  Text Localization and Recognition in Complex Scenes Using Local Features , 2010, ACCV.

[16]  Yonatan Wexler,et al.  Detecting text in natural scenes with stroke width transform , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[17]  Andrej Ikica Text detection methods in images of natural scenes , 2013 .

[18]  Adel M. Alimi,et al.  Arabic Text Recognition in Video Sequences , 2013, ArXiv.

[19]  Kai Chen,et al.  A CNN Based Scene Chinese Text Recognition Algorithm With Synthetic Data Engine , 2016, ArXiv.

[20]  Morteza Zahedi,et al.  Farsi/Arabic optical font recognition using SIFT features , 2011, WCIT.

[21]  Sabine Süsstrunk,et al.  Text Recognition in Natural Images using Multiclass Hough Forests , 2013, VISAPP.

[22]  Ramzi A. Haraty,et al.  Arabic Text Recognition , 2004, Int. Arab J. Inf. Technol..

[23]  Muhammad Imran Razzak,et al.  Evaluation of cursive and non-cursive scripts using recurrent neural networks , 2015, Neural Computing and Applications.

[24]  Yizhou Yu,et al.  Harvesting Discriminative Meta Objects with Deep CNN Features for Scene Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).