Word Relatives in Context for Word Sense Disambiguation

The current situation for Word Sense Disambiguation (WSD) is somewhat stuck due to lack of training data. We present in this paper a novel disambiguation algorithm that improves previous systems based on acquisition of examples by incorporating local context information. With a basic configuration, our method is able to obtain state-of-the-art performance. We complemented this work by evaluating other well-known methods in the same dataset, and analysing the comparative results per word. We observed that each algorithm performed better for different types of words, and each of them failed for some particular words. We proposed then a simple unsupervised voting scheme that improved significantly over single systems, achieving the best unsupervised performance on both the Senseval 2 and Senseval 3 lexical sample datasets.

[1]  Dekang Lin,et al.  Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[2]  Kenneth C. Litkowski,et al.  Use of Machine Readable Dictionaries for Word-Sense Disambiguation in SENSEVAL-2 , 2001, *SEMEVAL.

[3]  David Yarowsky,et al.  DECISION LISTS FOR LEXICAL AMBIGUITY RESOLUTION: Application to Accent Restoration in Spanish and French , 1994, ACL.

[4]  Martha Palmer,et al.  The English all-words task , 2004, SENSEVAL@ACL.

[5]  Rada Mihalcea,et al.  Bootstrapping Large Sense Tagged Corpora , 2002, LREC.

[6]  Adam Kilgarriff,et al.  The Senseval-3 English lexical sample task , 2004, SENSEVAL@ACL.

[7]  Eneko Agirre,et al.  Unsupervised WSD based on Automatically Retrieved Examples: The Importance of Bias , 2004, EMNLP.

[8]  Philip Resnik,et al.  The Linguist's Search Engine: An Overview , 2005, ACL.

[9]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[10]  Ted Pedersen The Duluth lexical sample systems in Senseval-3 , 2004, SENSEVAL@ACL.

[11]  Jean Carletta,et al.  Assessing Agreement on Classification Tasks: The Kappa Statistic , 1996, CL.

[12]  Wee Sun Lee,et al.  Learning Semantic Classes for Word Sense Disambiguation , 2005, ACL.

[13]  Hwee Tou Ng,et al.  Scaling Up Word Sense Disambiguation via Parallel Texts , 2005, AAAI.

[14]  Xinglong Wang,et al.  Word Sense Disambiguation Using Sense Examples Automatically Acquired from a Second Language , 2005, HLT.

[15]  Adam Kilgarriff,et al.  English Lexical Sample Task Description , 2001, *SEMEVAL.

[16]  Xinglong Wang,et al.  Word Sense Disambiguation Using Automatically Translated Sense Examples , 2006 .

[17]  Bernardo Magnini,et al.  Integrating Subject Field Codes into WordNet , 2000, LREC.

[18]  George A. Miller,et al.  Using Corpus Statistics and WordNet Relations for Sense Identification , 1998, CL.

[19]  Yasemin Altun,et al.  Broad-Coverage Sense Disambiguation and Information Extraction with a Supersense Sequence Tagger , 2006, EMNLP.

[20]  Cheng Niu,et al.  Word Independent Context Pair Classification Model for Word Sense Disambiguation , 2005, CoNLL.

[21]  Julie Weeds,et al.  Finding Predominant Word Senses in Untagged Text , 2004, ACL.