Word Sense Disambiguation using Skip Gram Model to Create a Historical Dictionary for Arabic

The evolution of the Arabic language from antiquity to the present days has given birth to several linguistic registers ascribed to the great periods of the history of the Arabic language. They can be classified as: Old Arabic, Classical Arabic and Modern Standard Arabic. In this work, we propose a method that aims to disambiguate words in Modern Standard Arabic. This method consists of measuring the semantic relation between the context of use of the ambiguous word and its sense definitions. Within the context of creating a historical dictionary for Arabic, and to disambiguate a word, we need to take into consideration the historical period in which the word appeared. This method disambiguates Arabic words takes into account that a word may have an old meaning but appears in a modern document.

[1]  Nadir Durrani,et al.  Farasa: A Fast and Furious Segmenter for Arabic , 2016, NAACL.

[2]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[3]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[4]  Mounir Zrigui,et al.  A Hybrid Approach for Arabic Word Sense Disambiguation , 2012, Int. J. Comput. Process. Orient. Lang..

[5]  El Habib Ben Lahmar,et al.  Word Sense Disambiguation Approach for Arabic Text , 2016 .

[6]  M. Zrigui,et al.  Word Sense disambiguation for Arabic language using the variants of the Lesk algorithm , 2011 .

[7]  Mounir Zrigui,et al.  A Semi-Supervised Method for Arabic Word Sense Disambiguation Using a Weighted Directed Graph , 2013, IJCNLP.

[8]  Mounir Zrigui,et al.  Combination of information retrieval methods with LESK algorithm for Arabic word sense disambiguation , 2011, Artificial Intelligence Review.

[9]  Michael E. Lesk,et al.  Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone , 1986, SIGDOC '86.

[10]  Didier Schwab,et al.  Semantic Similarity of Arabic Sentences with Word Embeddings , 2017, WANLP@EACL.

[11]  Abdelmonaime Lachkar,et al.  Word sense disambiguation for arabic text categorization , 2016, Int. Arab J. Inf. Technol..

[12]  Arafat Awajan,et al.  Arabic Word Sense Disambiguation - Survey , 2017, 2017 International Conference on New Trends in Computing Sciences (ICTCS).

[13]  Motaz Saad,et al.  OSAC: Open Source Arabic Corpora , 2010 .

[14]  Didier Schwab,et al.  Ant colony algorithm for Arabic word sense disambiguation through English lexical information , 2015, Int. J. Metadata Semant. Ontologies.

[15]  Jason Weston,et al.  A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.

[16]  Mounir Zrigui,et al.  Lexical Disambiguation of Arabic Language: An Experimental Study , 2012, Polytech. Open Libr. Int. Bull. Inf. Technol. Sci..

[17]  Arafat Awajan,et al.  Arabic Word Sense Disambiguation Using Wikipedia , 2016 .

[18]  Geoffrey E. Hinton,et al.  A Scalable Hierarchical Distributed Language Model , 2008, NIPS.

[19]  Mohamed El Bachir Menai,et al.  Word sense disambiguation using evolutionary algorithms - Application to Arabic language , 2014, Comput. Hum. Behav..