Synchronous Multi-Stream Hidden Markov Model for offline Arabic handwriting recognition without explicit segmentation

Arabic handwriting recognition is still a challenging task due especially to the unlimited variation in human handwriting, the large variety of Arabic character shapes, the presence of ligature between characters and overlapping of the components. In this paper, we propose an offline Arabic-handwritten recognition system for Tunisian city names. A review of the literature shows that the Hidden Markov Model (HMM) adopting the sliding window approach are the mainly used models, which gives good results when a relevant feature-extraction process is performed. However, these models are utilized especially to model one dimensional signal. Consequently, to model bi-dimensional signals or multiple features, a solution based on combining multi-classifiers and then a post-treatment selecting the best hypothesis is applied. The problem considered in this case consists in searching the best way to combine the contribution of these classifiers. In this study, we put forward an extension of the HMM, which can surmount this problem. Our proposed system is based on a synchronous multi-stream HMM which has the advantage of efficiently modelling the interaction between multiple features. These features are composed of a combination of statistical and structural ones, which are extracted over the columns and rows using a sliding window approach. In fact, two word models are implemented based on the holistic and analytical approaches without any explicit segmentation. In the first approach, all the classes share the same architecture nevertheless, the parameters are different. In the second approach, each class has its own model by concatenating their components models. The results carried out on the IFN/ENIT database show that the analytical approach performs better than the holistic one and that the data fusion model is more efficient than the state fusion model.

[1]  Rosaria Rossini,et al.  A Bayesian approach for initialization of weights in backpropagation neural net with application to character recognition , 2016, Neurocomputing.

[2]  Stephen J. Maybank,et al.  Activity recognition using a supervised non-parametric hierarchical HMM , 2016, Neurocomputing.

[3]  Ahmed Lawgali A Survey on Arabic Character Recognition , 2015 .

[4]  Sabri A. Mahmoud,et al.  Arabic handwriting recognition using structural and syntactic pattern attributes , 2013, Pattern Recognit..

[5]  Ingrid Bouwer Utne,et al.  Use of dynamic Bayesian networks for life extension assessment of ageing systems , 2015, Reliab. Eng. Syst. Saf..

[6]  Hamid Amiri,et al.  Arabic Handwritten Words Recognition Based on a Planar Hidden Markov Model , 2005, Int. Arab J. Inf. Technol..

[7]  Najoua Essoukri Ben Amara,et al.  Arabic Handwriting Recognition Based on Synchronous Multi-stream HMM Without Explicit Segmentation , 2015, HAIS.

[8]  Erik Cambria,et al.  Fusing audio, visual and textual clues for sentiment analysis from multimodal content , 2016, Neurocomputing.

[9]  Imran Siddiqi,et al.  Segmentation techniques for recognition of Arabic-like scripts: A comprehensive survey , 2015, Education and Information Technologies.

[10]  Denis Deratani Mauá,et al.  Hidden Markov models with set-valued parameters , 2016, Neurocomputing.

[11]  Jürgen Schmidhuber,et al.  Offline Handwriting Recognition with Multidimensional Recurrent Neural Networks , 2008, NIPS.

[12]  Imran Siddiqi,et al.  Offline cursive Urdu-Nastaliq script recognition using multidimensional recurrent neural networks , 2016, Neurocomputing.

[13]  Stuart J. Russell,et al.  Dynamic bayesian networks: representation, inference and learning , 2002 .

[14]  Edwin Villanueva,et al.  Efficient methods for learning Bayesian network super-structures , 2014, Neurocomputing.

[15]  Abdelmajid Ben Hamadou,et al.  Off-line handwritten word recognition using multi-stream hidden Markov models , 2010, Pattern Recognit. Lett..

[16]  Chafic Mokbel,et al.  Combining Slanted-Frame Classifiers for Improved HMM-Based Arabic Handwriting Recognition , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Jianmin Jiang,et al.  Multi-class class classification of unconstrained handwritten Arabic words using machine learning approaches , 2009 .

[18]  Xue Li,et al.  An Effective Approach to Offline Arabic Handwriting Recognition , 2013, Comput. Electr. Eng..

[19]  Alfons Juan-Císcar,et al.  Handwriting word recognition using windowed Bernoulli HMMs , 2014, Pattern Recognit. Lett..

[20]  Qiang Guo,et al.  Convolutional feature learning and Hybrid CNN-HMM for scene number recognition , 2016, Neurocomputing.

[21]  Gheith A. Abandah,et al.  Recognizing handwritten Arabic script through efficient skeleton-based grapheme segmentation algorithm , 2010, 2010 10th International Conference on Intelligent Systems Design and Applications.

[22]  Zaher Al Aghbari,et al.  Automatic Segmentation based Recognition of Handwritten Arabic Words , 2011 .

[23]  Mohamed Ali Mahjoub,et al.  Multiple models of Bayesian networks applied to offline recognition of Arabic handwritten city names , 2013, ArXiv.

[24]  Laurence Likforman-Sulem,et al.  Recognition of degraded characters using dynamic Bayesian networks , 2008, Pattern Recognit..

[25]  Patrice Aknin,et al.  A dynamic Bayesian network to represent discrete duration models , 2010, Neurocomputing.

[26]  Yasser M. Alginahi,et al.  A survey on Arabic character segmentation , 2012, International Journal on Document Analysis and Recognition (IJDAR).

[27]  Georg Heigold,et al.  Confidence-Based Discriminative Training for Model Adaptation in Offline Arabic Handwriting Recognition , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[28]  Najoua Essoukri Ben Amara,et al.  Dynamic hierarchical Bayesian network for Arabic handwritten word recognition , 2013, Fourth International Conference on Information and Communication Technology and Accessibility (ICTA).

[29]  H. H. Avilés-Arriaga,et al.  A Comparison of Dynamic Naive Bayesian Classifiers and Hidden , 2011 .

[30]  Chao Li,et al.  Pattern recognition approach to identify loose particle material based on modified MFCC and HMMs , 2015, Neurocomputing.

[31]  Mokhtar Sellami,et al.  Semi-continuous HMMs with explicit state duration for unconstrained Arabic word modeling and recognition , 2008, Pattern Recognit. Lett..

[32]  Adel M. Alimi,et al.  2009 10th International Conference on Document Analysis and Recognition Combining Multiple HMMs Using On-line and Off-line Features for Off-line Arabic Handwriting Recognition , 2022 .

[33]  Friedhelm Schwenker,et al.  Fusion paradigms in cognitive technical systems for human-computer interaction , 2015, Neurocomputing.

[34]  Paul D. Gader,et al.  Multi-stream continuous hidden Markov models with application to landmine detection , 2013, EURASIP J. Adv. Signal Process..

[35]  Marc-Peter Schambach Model length adaptation of an HMM based cursive word recognition system , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[36]  Haikal El Abed,et al.  Comparison of Two Different Feature Sets for Offline Recognition of Handwritten Arabic Words , 2006 .

[37]  Afif Masmoudi,et al.  Discrete exponential Bayesian networks: Definition, learning and application for density estimation , 2014, Neurocomputing.

[38]  Za'er Salim Abo-Hammour,et al.  Numerical solution of systems of second-order boundary value problems using continuous genetic algorithm , 2014, Inf. Sci..