A Comparative Study on Arabic Stemmers

is considered as a pre-processing step in many applications: text mining, information retrieval, machine translation etc. The Arabic language has many special cases or properties that affect stemming or any automatic method, it depends on both inflectional and derivational morphology to produce the various forms of the language words. Many researchers have proposed algorithms to solve the problems of stemming. This paper aims to make a comparison study among the existing Arabic stemmers, the comparison study is based on the methodologies, the usage, main idea, algorithm, the affixes, limitations, output, and the stemmers' sensitivity for both diacritics and context.

[1]  Sahar Alwadei,et al.  Building an Arabic Words Generator , 2015 .

[2]  Z. Kchaou,et al.  Arabic stemming with two dictionaries , 2008, 2008 International Conference on Innovations in Information Technology.

[3]  A. BOUDLAL,et al.  A Morphosyntactic analysis system for Arabic texts , 2010 .

[4]  Kazem Taghva,et al.  Arabic stemming without a root dictionary , 2005, International Conference on Information Technology: Coding and Computing (ITCC'05) - Volume II.

[5]  Mohamed S. Abdel-Wahab,et al.  An Intelligent System For Arabic Text Categorization , 2006 .

[6]  Claire Fautsch,et al.  Algorithmic stemmers or morphological analysis? An evaluation , 2009, J. Assoc. Inf. Sci. Technol..

[7]  Lisa Ballesteros,et al.  Light Stemming for Arabic Information Retrieval , 2007 .

[8]  Ibrahim A. Al-Kharashi,et al.  Arabic morphological analysis techniques: A comprehensive survey , 2004, J. Assoc. Inf. Sci. Technol..

[9]  Fredric C. Gey,et al.  Building an Arabic Stemmer for Information Retrieval , 2002, TREC.

[10]  Younes Jaafar,et al.  Benchmark of Arabic morphological analyzers challenges and solutions , 2014, 2014 9th International Conference on Intelligent Systems: Theories and Applications (SITA-14).

[11]  Jian-Yun Nie,et al.  Effective Stemming for Arabic Information Retrieval , 2006, BCS.

[12]  Abdelmajid Ben Hamadou,et al.  The MORPH2 new version: A robust morphological analyzer for Arabic texts , 2010 .

[13]  Leah S. Larkey,et al.  Structured queries, language modeling, and relevance modeling in cross-language information retrieval , 2005, Inf. Process. Manag..

[14]  Amna A. Al Kaabi,et al.  Arabic Light Stemmer : Anew Enhanced Approach , 2005 .

[15]  Mohammed A. Otair COMPARATIVE ANALYSIS OF ARABIC STEMMING ALGORITHMS , 2013 .

[16]  Navjot Kaur,et al.  Improving Performance of MANETs using Multi-Criteria Multipath Routing Protocol , 2015 .

[17]  Jessica Lin,et al.  A novel Arabic lemmatization algorithm , 2008, AND '08.

[18]  Sameh H. Ghwanmeh,et al.  Enhanced Algorithm for Extracting the Root of Arabic Words , 2009, 2009 Sixth International Conference on Computer Graphics, Imaging and Visualization.

[19]  Bassam H. Hammo Towards enhancing retrieval effectiveness of search engines for diacritisized Arabic documents , 2008, Information Retrieval.

[20]  Naglaa Thabet Stemming the Qur’an , 2004 .

[21]  May Y. Al-Nashashibi,et al.  Stemming techniques for Arabic words: A comparative study , 2010, 2010 2nd International Conference on Computer Technology and Development.

[22]  Ophir Frieder,et al.  On arabic search: improving the retrieval effectiveness via a light stemming approach , 2002, CIKM '02.

[23]  Claire Fautsch,et al.  Algorithmic stemmers or morphological analysisq An evaluation , 2009 .

[24]  Ahmed A. Rafea,et al.  An accuracy-enhanced light stemmer for arabic text , 2011, TSLP.

[25]  Lisa Ballesteros,et al.  Improving stemming for Arabic information retrieval: light stemming and co-occurrence analysis , 2002, SIGIR '02.

[26]  Kareem Darwish,et al.  Stemming techniques of Arabic Language: Comparative Study from the Information Retrieval Perspective , 2009 .

[27]  Kareem Darwish,et al.  Building a Shallow Arabic Morphological Analyser in One Day , 2002, SEMITIC@ACL.

[28]  Riyad Al-Shalabi,et al.  Building an effective rule-based light stemmer for Arabic language to inprove search effectiveness , 2008, 2008 International Conference on Innovations in Information Technology.