A Possibilistic Approach for Arabic Domain Terminology Extraction and Translation

This paper proposes a hybrid possibilistic approach for bilingual terminology extraction using possibility and necessity measures. On the one hand, we extract domain-relevant terms from the source language, and on the other hand, we build a co-occurrence-based translation graph, which is mined to translate terms in the target language. We compare our approach with different state-of-the art approaches. Experimental results show that the possibilistic approach reaches better results in terms of Recall, Precision and Mean Average Precision (MAP). The differences between the compared approaches show that our contribution is significant in terms of p-value.

[1]  Emmanuel Morin,et al.  Efficient Data Selection for Bilingual Terminology Extraction from Comparable Corpora , 2016, COLING.

[2]  Geoffrey Zweig,et al.  Linguistic Regularities in Continuous Space Word Representations , 2013, NAACL.

[3]  Didier Dubois,et al.  Possibility Theory and Its Applications: Where Do We Stand? , 2015, Handbook of Computational Intelligence.

[4]  Véronique Hoste,et al.  Language-Independent Bilingual Terminology Extraction from a Multilingual Parallel Corpus , 2009, EACL.

[5]  Wiem Lahbib,et al.  Arabic-English Domain Terminology Extraction from Aligned Corpora , 2014, OTM Conferences.

[6]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[7]  Eric P. Xing,et al.  HM-BiTAM: Bilingual Topic Exploration, Word Alignment, and Translation , 2007, NIPS.

[8]  Cherif Chiraz Latiri,et al.  Bilingual Lexicon Extraction from Comparable Corpora Based on Closed Concepts Mining , 2017, PAKDD.

[9]  Najmus Saher Shah Review of Indexing Techniques Applied in Information Retrieval , 2016 .

[10]  Pierre Zweigenbaum,et al.  Building Specialized Bilingual Lexicons Using Large Scale Background Knowledge , 2013, EMNLP.

[11]  Andy Way,et al.  DCU Terminology Translation System for Medical Query Subtask at WMT14 , 2014, WMT@ACL.

[12]  Narjès Bellamine Ben Saoud,et al.  A hybrid possibilistic approach for Arabic full morphological disambiguation , 2015, Data Knowl. Eng..

[13]  Marie-Francine Moens,et al.  Bilingual Distributed Word Representations from Document-Aligned Comparable Data , 2015, J. Artif. Intell. Res..

[14]  Narjès Bellamine Ben Saoud,et al.  Experimenting a discriminative possibilistic classifier with reweighting model for Arabic morphological disambiguation , 2015, Comput. Speech Lang..

[15]  L. Zadeh Fuzzy sets as a basis for a theory of possibility , 1999 .

[16]  Kamel Smaïli,et al.  A New Language Model Based on Possibility Theory , 2016, CICLing.

[17]  Wiem Lahbib,et al.  Arabic Terminology Extraction and Enrichment Based on Domain-Specific Text Mining , 2015, 2015 IEEE 27th International Conference on Tools with Artificial Intelligence (ICTAI).

[18]  M Alguliyev Rasim,et al.  A NEW SIMILARITY MEASURE AND MATHEMATICAL MODEL FOR TEXT SUMMARIZATION (eng.) , 2015 .