Domain Kernels for Word Sense Disambiguation

In this paper we present a supervised Word Sense Disambiguation methodology, that exploits kernel methods to model sense distinctions. In particular a combination of kernel functions is adopted to estimate independently both syntagmatic and domain similarity. We defined a kernel function, namely the Domain Kernel, that allowed us to plug "external knowledge" into the supervised learning process. External knowledge is acquired from unlabeled data in a totally unsupervised way, and it is represented by means of Domain Models. We evaluated our methodology on several lexical sample tasks in different languages, outperforming significantly the state-of-the-art for each of them, while reducing the amount of labeled training data required for learning.

[1]  Carlo Strapparava,et al.  Pattern abstraction and term similarity for Word Sense Disambiguation: IRST at Senseval-3 , 2004 .

[2]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[3]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[4]  Carlo Strapparava,et al.  The role of domain information in Word Sense Disambiguation , 2002, Natural Language Engineering.

[5]  D. Id,et al.  Evaluating sense disambiguation across diverse parameter spaces , 2002 .

[6]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2003, ICTAI.

[7]  Dustin Boswell,et al.  Introduction to Support Vector Machines , 2002 .

[8]  Walter Daelemans,et al.  GAMBL, genetic algorithm optimization of memory-based WSD , 2004, SENSEVAL@ACL.

[9]  David Yarowsky,et al.  Evaluating sense disambiguation across diverse parameter spaces , 2002, Natural Language Engineering.

[10]  Steven Lawrence Small,et al.  Word expert parsing: a theory of distributed word-based natural language understanding , 1980 .

[11]  Carlo Strapparava,et al.  Unsupervised and supervised exploitation of semantic domains in lexical disambiguation , 2004, Comput. Speech Lang..

[12]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[13]  Bernardo Magnini,et al.  Integrating Subject Field Codes into WordNet , 2000, LREC.

[14]  P. C. Wong,et al.  Generalized vector spaces model in information retrieval , 1985, SIGIR '85.