A deep learning based prediction of arabic manuscripts handwriting style

With the increasing amounts of existing unorganized images on the internet today and the necessity to use them efficiently in various types of applications. There is a critical need to discover rigid models that can classify and predict images successfully and instantaneously. Therefore, this study aims to collect Arabic manuscripts images in a dataset and predict their handwriting styles using the most powerful and trending technologies. There are many types of Arabic handwriting styles, including Al-Reqaa, Al-Nask, Al-Thulth, Al-Kufi, Al-Hur, Al-Diwani, Al-Farsi, Al-Ejaza, Al-Maghrabi, AlTaqraa, etc. However, the study classified the collected dataset images according to the handwriting styles and focused on only six types of handwriting styles that existed in the collected Arabic manuscripts. To reach our goal, we applied the MobileNet pre-trained deep learning model on our classified dataset images to automatically capture and extract the features from them. Afterward, we evaluated the performance of the developed model by computing its recorded evaluation metrics. We reached that MobileNet convolutional neural network is a promising technology since it reached 0.9583 as the highest recorded accuracy and 0.9633 as the average F-score.

[1]  Hong Liang,et al.  Text feature extraction based on deep learning: a review , 2017, EURASIP Journal on Wireless Communications and Networking.

[2]  Mahmoud Al-Ayyoub,et al.  Deep learning for Arabic NLP: A survey , 2017, J. Comput. Sci..

[3]  Jiwen Lu,et al.  PCANet: A Simple Deep Learning Baseline for Image Classification? , 2014, IEEE Transactions on Image Processing.

[4]  Bo Chen,et al.  MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[5]  A. Hussain,et al.  Deep Learning: Fundamentals, Theory and Applications , 2019, Cognitive Computation Trends.

[6]  Vipin Tyagi Research Issues for Next Generation Content-Based Image Retrieval , 2017 .

[7]  Stéphane Dupont,et al.  Towards Good Practices for Image Retrieval Based on CNN Features , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[8]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[9]  Siti Norul Huda Sheikh Abdullah,et al.  Arabic calligraphy recognition based on binarization methods and degraded images , 2011, 2011 International Conference on Pattern Analysis and Intelligence Robotics.

[10]  Yue Huang,et al.  Image Retrieval Algorithm Based on Convolutional Neural Network , 2016 .

[11]  Alireza Alaei,et al.  A comparative study of different texture features for document image retrieval , 2019, Expert Syst. Appl..

[12]  Wei Xu,et al.  CNN-RNN: A Unified Framework for Multi-label Image Classification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Rashad Al-Jawfi,et al.  Handwriting Arabic character recognition LeNet using neural network , 2009, Int. Arab J. Inf. Technol..

[14]  El Asnaoui Khalid,et al.  Content-Based Image Retrieval Using Convolutional Neural Networks , 2017 .

[15]  Jinyuan Jia,et al.  A learning framework for shape retrieval based on multilayer perceptrons , 2019, Pattern Recognit. Lett..

[16]  A. V. Reddy,et al.  A Survey on Applications and Performance of Deep Convolution Neural Network Architecture for Image Segmentation , 2018 .

[17]  Maha Al-Yahya Stylometric analysis of classical Arabic texts for genre detection , 2018, Electron. Libr..

[18]  Mohamed Ezz,et al.  Classification of Arabic Writing Styles in Ancient Arabic Manuscripts , 2019, International Journal of Advanced Computer Science and Applications.