Evaluating the retrieval performance of Addaall stemmer for Arabic news of al-Jazeera

This paper examines the performance of Addaall stemmers for Arabic news of al-Jazeera. The morphological variation of Arabic word is a factor that affects Arabic information retrieval performance. Many morphological stemmers were developed to improve Arabic information retrieval such as Buckwalter, Khoja stemmers. Addaall stemmer is among the recent developments pertaining to the enhancement of the effectiveness of Arabic IR. The objective of the paper is to provide an analysis of Addaall's stemmer performance for Arabic information. Furthermore, it sheds some lights on the effectiveness of Addaall in relation to other stemmers. The paper assumes that previous studies did not examine extensively the requirements and limitations of Addaall and its prospects for Arabic IR effectiveness. So far there is no standard stemmer for Arabic IR that can be used and generalized. Therefore, this research will examine and evaluate the retrieval performance of Addaall stemmer using root and light stemming searches. It will use Al-Jazeera news as a test document. The paper concludes that Addaall offers a horizon for further enhancing the effectiveness of Arabic IR.

[1]  Haidar Moukdad A comparison of root and stemming techniques for the retrieval of Arabic documents , 2001 .

[2]  Naglaa Thabet Stemming the Qur’an , 2004 .

[3]  Mamoun Hattab,et al.  Addaall Arabic Search Engine: Improving Search based on Combination of Morphological Analysis and Generation Considering Semantic Patterns , 2009 .

[4]  Martha W. Evens,et al.  Stemming methodologies over individual query words for an Arabic information retrieval system , 1999 .

[5]  Ophir Frieder,et al.  On arabic search: improving the retrieval effectiveness via a light stemming approach , 2002, CIKM '02.

[6]  Martha W. Evens,et al.  Stemming Methodologies Over Individual Query Words for an Arabic Information Retrieval System , 1999, J. Am. Soc. Inf. Sci..

[7]  Lisa Ballesteros,et al.  Improving stemming for Arabic information retrieval: light stemming and co-occurrence analysis , 2002, SIGIR '02.

[8]  Lisa Ballesteros,et al.  Light Stemming for Arabic Information Retrieval , 2007 .

[9]  Fredric C. Gey,et al.  Building an Arabic Stemmer for Information Retrieval , 2002, TREC.

[10]  Martha W. Evens,et al.  Comparing Words, Stems, and Roots as Index Terms in an Arabic Information Retrieval System , 1994, J. Am. Soc. Inf. Sci..

[11]  Andrew Large,et al.  Information Retrieval from Full-Text Arabic Databases: Can Search Engines Designed for English Do the Job? , 2001 .

[12]  Ibrahim A. Al-Kharashi Micro-AIRS: a microcomputer-based arabic information retrieval system comparing words, stems, and roots as index terms , 1992 .

[13]  Ibrahim Abu El-Khair Arabic information retrieval , 2007 .

[14]  Amna A. Al Kaabi,et al.  Arabic Light Stemmer : Anew Enhanced Approach , 2005 .