The Use of Hidden Markov Model in Natural ARABIC Language Processing: a survey

Abstract Hidden Markov Model is an empirical tool that can be used in many applications related to natural language processing. In this paper a comparative study was conducted between different applications in natural Arabic language processing that uses Hidden Markov Model such as morphological analysis, part of speech tagging, text classification, and name entity recognition. Comparative results showed that HMM can be used in different layers of natural language processing, but mainly in pre-processing phase such as: part of speech tagging, morphological analysis and syntactic structure; however in high level applications text classification their use is limited to certain number of researches.

[1]  Hongfei Lin,et al.  Twitter part-of-speech tagging using pre-classification Hidden Markov model , 2012, 2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[2]  Mohd Zakree Ahmad Nazri,et al.  Automatic Part of Speech Tagging for Arabic: An Experiment Using Bigram Hidden Markov Model , 2010, RSKT.

[3]  Dia AbuZeina,et al.  A Survey of Markov Chain Models in Linguistics Applications , 2016 .

[4]  Kevin Duh,et al.  POS Tagging of Dialectal Arabic: A Minimally Supervised Approach , 2005, SEMITIC@ACL.

[5]  Mohammad Hajjar,et al.  A System for Evaluation of Arabic Root Extraction Methods , 2010, 2010 Fifth International Conference on Internet and Web Applications and Services.

[6]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[7]  Yahya O. Mohamed Elhadj,et al.  Statistical Part-of-Speech Tagger for Traditional Arabic Texts , 2009 .

[8]  Nazlia Omar,et al.  Improving Arabic Part-of-Speech Tagging through Morphological Analysis , 2011, ACIIDS.

[9]  Ahmed H. Aliwy,et al.  Combining POS taggers in master-slaves technique for highly inflected languages as Arabic , 2015, 2015 International Conference on Cognitive Computing and Information Processing(CCIP).

[10]  Medhat Awadalla,et al.  DACS Dewey index-based Arabic Document Categorization System , 2012 .

[11]  Azzeddine Mazroui,et al.  A Markovian approach for arabic root extraction , 2011, Int. Arab J. Inf. Technol..

[12]  Slim Kanoun,et al.  A new-arabic-text classification system using a Hidden Markov Model , 2014, Int. J. Knowl. Based Intell. Eng. Syst..

[13]  Khalil Sima'an,et al.  Smoothing a Lexicon-based POS Tagger for Arabic and Hebrew , 2007, SEMITIC@ACL.

[14]  Medhat Awadalla,et al.  Hidden markov model based Arabic morphological analyzer , 2011 .

[15]  Kristina Toutanova,et al.  Unsupervised Bilingual Morpheme Segmentation and Alignment with Context-rich Hidden Semi-Markov Models , 2011, ACL.

[16]  Abdelmonaime Lachkar,et al.  Hybrid Part-Of-Speech Tagger for Non-Vocalized Arabic Text , 2013 .

[17]  Al-Falahi Ahmed,et al.  Authorship Attribution in Arabic Poetry Context UsingMarkov Chain classifier , 2015 .

[18]  Reda A. El-Khoribi,et al.  An Intelligent System Based on Statistical Learning For Searching in Arabic Text , 2006 .

[19]  Nazlia Omar,et al.  Probabilistic Arabic part of speech tagger with unknown words handling , 2016 .