Arabic Domain Terminology Extraction: A Literature Review - (Short Paper)

Domain terminology extraction is an important step in many applications such as ontology building and information retrieval. Analyzing a corpus to automatically extract key terms is a difficult task, especially in the case of Arabic language. The complexity of spelling, morphology and semantics of Arabic makes natural language processing tasks quite difficult. In addition to the complexity of Arabic, the challenges related to domain terminology extraction are caused by the inherent difficulty in determining whether a word or a phrase represents or not a given text. All these problems have not restricted the multitude of Arabic terminology extraction approaches in the ontology building process. Therefore, this article presents a literature review in the field of Arabic terminology extraction focusing on the specificities of this language.

[1]  Josef van Genabith,et al.  Automatic Extraction of Arabic Multiword Expressions , 2010, MWE@COLING.

[2]  Daniel Jurafsky,et al.  Automatic Tagging of Arabic Text: From Raw Text to Base Phrase Chunks , 2004, NAACL.

[3]  Nizar Habash,et al.  Arabic Morphological Tagging, Diacritization, and Lemmatization Using Lexeme Models and Feature Ranking , 2008, ACL.

[4]  Eric Atwell,et al.  aConCorde: Towards an open-source, extendable concordancer for Arabic , 2006 .

[5]  Driss Aboutajdine,et al.  A Multi-Word Term Extraction Program for Arabic Language , 2008, LREC.

[6]  Adrien Bougouin État de l'art des méthodes d'extraction automatique de termes-clés , 2013 .

[7]  Abdelkader El Mahdaouy,et al.  A Study of Association Measures and their Combination for Arabic MWT Extraction , 2014, ArXiv.

[8]  William J. Black,et al.  Arabic part of speech tagging using Tranformation-Based Learning , 2009 .

[9]  Jan Hajiÿc,et al.  Feature-Based Tagger of Approximations of Functional Arabic Morphology , 2005 .

[10]  Narjès Bellamine Ben Saoud,et al.  Arabic Morphological Analysis and Disambiguation Using a Possibilistic Classifier , 2012, ICIC.

[11]  Mohammed Albared,et al.  Arabic term extraction using combined approach on Islamic document , 2013 .

[12]  Ibrahim Bounhas,et al.  ArabOnto: experimenting a new distributional approach for building Arabic ontological resources , 2011, Int. J. Metadata Semant. Ontologies.

[13]  Ibrahim Bounhas,et al.  Organizing Contextual Knowledge for Arabic Text Disambiguation and Terminology Extraction , 2011 .

[14]  A. Rey La terminologie : noms et notions , 1979 .

[15]  Zaia Alimazighi,et al.  Automatic Construction of Ontology from Arabic Texts , 2012, ICWIT.

[16]  Tarek El-Shishtawy,et al.  Arabic Keyphrase Extraction using Linguistic knowledge and Machine Learning Techniques , 2012, ArXiv.

[17]  Ahmed A. Rafea,et al.  KP-Miner: A keyphrase extraction system for English and Arabic documents , 2009, Inf. Syst..