Modified stemmer for a medical system — evaluated using predefined metrics

This study explores the importance of stemmers, its strength, and similarity metrics with newly defined stemmer called modified stemmer and also with predefined stemmers or stemming algorithms. This paper also evaluates the performance using metrics such as SWF (Stemmed Word's Factor), WCSF (Word's Correct Stemmed Factor), and WACF (Word's Average Conflation Factor). When compared with other stemmers, the precision of proper root form seems to be good with paice/husk and with modified stemmers.

[1]  Xin Li,et al.  Context sensitive stemming for web search , 2007, SIGIR.

[2]  William B. Frakes,et al.  Stemming Algorithms , 1992, Information Retrieval: Data Structures & Algorithms.

[3]  Nicola Orio,et al.  A novel method for stemmer generation based on hidden markov models , 2003, CIKM '03.

[4]  L. R. Rasmussen,et al.  In information retrieval: data structures and algorithms , 1992 .

[5]  David A. Hull,et al.  A Detailed Analysis of English Stemming Algorithms , 2006 .

[6]  S GiridharN,et al.  A Prospective Study of Stemming Algorithms for Web Text Mining , 2011 .

[7]  Julie Beth Lovins,et al.  Development of a stemming algorithm , 1968, Mech. Transl. Comput. Linguistics.

[8]  S. M. Eissen,et al.  Further Enhancement to the Porter’s Stemming Algorithm , 2004 .

[9]  James Mayfield,et al.  Single n-gram stemming , 2003, SIGIR.

[10]  W. Bruce Croft,et al.  Corpus-based stemming using cooccurrence of word variants , 1998, TOIS.

[11]  Chris D. Paice Method for Evaluation of Stemming Algorithms Based on Error Counting , 1996, J. Am. Soc. Inf. Sci..

[12]  Donna Harman,et al.  How effective is suffixing , 1991 .

[13]  Chris D. Paice,et al.  Another stemmer , 1990, SIGF.

[14]  Chris D. Paice An evaluation method for stemming algorithms , 1994, SIGIR '94.

[15]  Christopher J. Fox,et al.  Strength and similarity of affix removal stemming algorithms , 2003, SIGF.

[16]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[17]  Donna K. Harman,et al.  How effective is suffixing? , 1991, J. Am. Soc. Inf. Sci..

[18]  M. F. Porter,et al.  An algorithm for suffix stripping , 1997 .

[19]  Prasenjit Majumder,et al.  YASS: Yet another suffix stripper , 2007, TOIS.

[20]  Sandeep R. Sirsat,et al.  Strength and Accuracy Analysis of Affix Removal Stemming Algorithms , 2013 .

[21]  Wessel Kraaij,et al.  Viewing stemming as recall enhancement , 1996, SIGIR '96.

[22]  Chris D. Paice,et al.  Method for Evaluation of Stemming Algorithms Based on Error Counting , 1996, J. Am. Soc. Inf. Sci..