Towards Author Recognition of Ancient Arabic Manuscripts Using Deep Learning: A Transfer Learning Approach

Due to the significance of ancient Arabic manuscripts and their role in enriching valuable historical information, this study aims to collect Arabic manuscripts in a dataset and classify its images to predict their authors. We accomplished this study through two main phases. First is the data collection phase. Arabic manuscripts gathered, including 52 Arabic Authors. Second is the models’ development phase to extract the visual features from the images and train the networks on them. We built four deep learning models named: MobileNetV1, DenseNet201, ResNet50, and VGG19. We configured the models by tuning their learning hyperparameters toward optimizing their recognition process. Afterward, we performed a comparative analysis between all the models to measure their performance. Eventually, we reached that minimizing the learning rate, combining “Sigmoid” with “Softmax”, and increasing the number of neurons on the final classification dense layer improved the networks’ recognition performance significantly since all utilized deep learning models reached above 95% validation accuracy.

[1]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Nils Schaetti UniNE at CLEF 2017: TF-IDF and Deep-Learning for Author Profiling , 2017, CLEF.

[3]  Zenghui Wang,et al.  Deep Convolutional Neural Networks for Image Classification: A Comprehensive Review , 2017, Neural Computation.

[4]  Jian Sun,et al.  Identity Mappings in Deep Residual Networks , 2016, ECCV.

[5]  Douglas Bagnall,et al.  Author Identification Using Multi-headed Recurrent Neural Networks , 2015, CLEF.

[6]  Mahmoud Al-Ayyoub,et al.  Deep learning for Arabic NLP: A survey , 2017, J. Comput. Sci..

[7]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[8]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[9]  Ting He,et al.  Deep Learning based Authorship Identification , 2017 .

[10]  Mohak Shah,et al.  Comparative Study of Deep Learning Software Frameworks , 2015, 1511.06435.

[11]  Bo Chen,et al.  MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[12]  Nikos Komodakis,et al.  Wide Residual Networks , 2016, BMVC.

[13]  Zaher Al Aghbari,et al.  Word stretching for effective segmentation and classification of historical Arabic handwritten documents , 2009, 2009 Third International Conference on Research Challenges in Information Science.

[14]  Volker Märgner,et al.  On writer identification for Arabic historical manuscripts , 2017, International Journal on Document Analysis and Recognition (IJDAR).

[15]  Lamiaa A. Elrefaei,et al.  Arabic Manuscript Content Based Image Retrieval: A Comparison between SURF and BRISK Local Features , 2018 .

[16]  Lambert Schomaker,et al.  Deep Adaptive Learning for Writer Identification based on Single Handwritten Word Images , 2018, Pattern Recognit..

[17]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[18]  Ahmed Bouridane,et al.  KERTAS: dataset for automatic dating of ancient Arabic manuscripts , 2018, International Journal on Document Analysis and Recognition (IJDAR).

[19]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[20]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Alireza Alaei,et al.  A comparative study of different texture features for document image retrieval , 2019, Expert Syst. Appl..

[23]  Hong Liang,et al.  Text feature extraction based on deep learning: a review , 2017, EURASIP Journal on Wireless Communications and Networking.

[24]  W. Marsden I and J , 2012 .

[25]  Peter Goldsborough,et al.  A Tour of TensorFlow , 2016, ArXiv.

[26]  K. Sreedhar,et al.  Image Retrieval Techniques : A Survey , 2016 .

[27]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.