ARABIC ROOT BASED STEMMER

This paper presents a new (root-based) stemming algorithm for Arabic language. As other natural languages not all the words used in Arabic language has roots, some of these are borrowed from other languages, e.g. as the word " " television, so in this case the stemmer will fail to get the right root because these foreign words have no root. This algorithm is based on affix removal beside a knowledge from structural linguistics. The implementation and evaluation of this algorithm shows a noticeable improvement in the accuracy relative to previous algorithms.