论文信息 - Good Neighbors Make Good Senses: Exploiting Distributional Similarity for Unsupervised WSD

Good Neighbors Make Good Senses: Exploiting Distributional Similarity for Unsupervised WSD

We present an automatic method for senselabeling of text in an unsupervised manner. The method makes use of distributionally similar words to derive an automatically labeled training set, which is then used to train a standard supervised classifier for distinguishing word senses. Experimental results on the Senseval-2 and Senseval-3 datasets show that our approach yields significant improvements over state-of-the-art unsupervised methods, and is competitive with supervised ones, while eliminating the annotation cost.

Mirella Lapata | Samuel Brody

[1] Jean Véronis,et al. HyperLex: lexical cartography for information retrieval , 2004, Comput. Speech Lang..

[2] Graeme Hirst,et al. Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures , 2004 .

[3] Amanda Spink,et al. Linguistic Aspects of Web Queries. , 2000 .

[4] Ganesh Ramakrishnan,et al. Passage Scoring for Question Answering via Bayesian Inference on Lexical Relations , 2003, TREC.

[5] Julie Elizabeth Weeds,et al. Measures and applications of lexical distributional similarity , 2003 .

[6] George A. Miller,et al. Using Corpus Statistics and WordNet Relations for Sense Identification , 1998, CL.

[7] David Yarowsky,et al. Word-Sense Disambiguation Using Statistical Models of Roget’s Categories Trained on Large Corpora , 2010, COLING.

[8] Philip G. Edmonds. Designing a task for SENSEVAL-2 , 2000 .

[9] Ted Briscoe,et al. Robust Accurate Statistical Annotation of General Text , 2002, LREC.

[10] Dong-Hong Ji,et al. Word Sense Disambiguation Using Label Propagation Based Semi-Supervised Learning , 2005, ACL.

[11] Julie Weeds,et al. Finding Predominant Word Senses in Untagged Text , 2004, ACL.