A Comparative Study of Support Vector Machines Applied to the Supervised Word Sense Disambiguation Problem in the Medical Domain

We have applied flve supervised learning approaches to word sense disambiguation in the medical domain. Our objective is to evaluate Support Vector Machines (SVMs) in comparison with other well known supervised learning algorithms including the na˜‡ve Bayes classifler, C4.5 decision trees, decision lists and boosting approaches. Based on these results we introduce further reflnements of these approaches. We have made use of unigram and bigram features selected using difierent fre- quency cut-ofi values and window sizes along with the statistical signif- icance test of the log likelihood measure for bigrams. Our results show that overall, the best SVM model was most accurate in 27 of 60 cases, compared to 22, 14, 10 and 14 for the na˜‡ve Bayes, C4.5 decision trees, decision list and boosting methods respectively.

[1]  Thomas C. Rindflesch,et al.  Using Symbolic Knowledge in the UMLS to Disambiguate Words in Small Datasets with a Naïve Bayes Classifier , 2004, MedInfo.

[2]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[3]  Ian H. Witten,et al.  Generating Accurate Rule Sets Without Global Optimization , 1998, ICML.

[4]  Marine Carpuat,et al.  Semantic role labeling with Boosting, SVMs, Maximum Entropy, SNOW, and Decision Lists , 2004, SENSEVAL@ACL.

[6]  Marc Weeber,et al.  Developing a test collection for biomedical word sense disambiguation , 2001, AMIA.

[7]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[8]  J. Platt Sequential Minimal Optimization : A Fast Algorithm for Training Support Vector Machines , 1998 .

[9]  Hongfang Liu,et al.  Research Paper: A Multi-aspect Comparison Study of Supervised Word Sense Disambiguation , 2004, J. Am. Medical Informatics Assoc..

[10]  Philip Resnik,et al.  Supervised Sense Tagging using Support Vector Machines , 2001, *SEMEVAL.

[11]  Carlo Strapparava,et al.  Domain Kernels for Word Sense Disambiguation , 2005, ACL.

[12]  Hwee Tou Ng,et al.  Supervised Word Sense Disambiguation with Support Vector Machines and multiple knowledge sources , 2004, SENSEVAL@ACL.

[13]  Robert Tibshirani,et al.  Classification by Pairwise Coupling , 1997, NIPS.

[14]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[15]  Satanjeev Banerjee,et al.  The Design, Implementation, and Use of the Ngram Statistics Package , 2003, CICLing.

[16]  Robert E. Schapire,et al.  The Boosting Approach to Machine Learning An Overview , 2003 .