论文信息 - Evaluating and optimizing the parameters of an unsupervised graph-based WSD algorithm

Evaluating and optimizing the parameters of an unsupervised graph-based WSD algorithm

Veronis (2004) has recently proposed an innovative unsupervised algorithm for word sense disambiguation based on small-world graphs called HyperLex. This paper explores two sides of the algorithm. First, we extend Veronis' work by optimizing the free parameters (on a set of words which is different to the target set). Second, given that the empirical comparison among unsupervised systems (and with respect to supervised systems) is seldom made, we used hand-tagged corpora to map the induced senses to a standard lexicon (WordNet) and a publicly available gold standard (Senseval 3 English Lexical Sample). Our results for nouns show that thanks to the optimization of parameters and the mapping method, HyperLex obtains results close to supervised systems using the same kind of bag-of-words features. Given the information loss inherent in any mapping step and the fact that the parameters were tuned for another set of words, these are very interesting results.

[1] George A. Miller,et al. A Semantic Concordance , 1993, HLT.

[2] Hinrich Schütze,et al. Automatic Word Sense Discrimination , 1998, Comput. Linguistics.

[3] Duncan J. Watts,et al. Collective dynamics of ‘small-world’ networks , 1998, Nature.

[4] Albert,et al. Emergence of scaling in random networks , 1999, Science.

[5] Christiane Fellbaum,et al. Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[6] Patrick Pantel,et al. Discovering word senses from text , 2002, KDD.

[7] Ted Pedersen,et al. Word Sense Discrimination by Clustering Contexts in Vector and Similarity Spaces , 2004, CoNLL.

[8] Martha Palmer,et al. The English all-words task , 2004, SENSEVAL@ACL.

[9] Adam Kilgarriff,et al. The Senseval-3 English lexical sample task , 2004, SENSEVAL@ACL.

[10] Jean Véronis,et al. HyperLex: lexical cartography for information retrieval , 2004, Comput. Speech Lang..

[11] Cheng Niu,et al. Word Independent Context Pair Classification Model for Word Sense Disambiguation , 2005, CoNLL.