Distinguishing systems and distinguishing senses: new evaluation methods for Word Sense Disambiguation

Resnik and Yarowsky (1997) made a set of observations about the state-of-the-art in automatic word sense disambiguation and, motivated by those observations, offered several specific proposals regarding improved evaluation criteria, common training and testing resources, and the definition of sense inventories. Subsequent discussion of those proposals resulted in SENSEVAL, the first evaluation exercise for word sense disambiguation (Kilgarriff and Palmer 2000). This article is a revised and extended version of our 1997 workshop paper, reviewing its observations and proposals and discussing them in light of the SENSEVAL exercise. It also includes a new in-depth empirical study of translingually-based sense inventories and distance measures, using statistics collected from native-speaker annotations of 222 polysemous contexts across 12 languages. These data show that monolingual sense distinctions at most levels of granularity can be effectively captured by translations into some set of second languages, especially as language family distance increases. In addition, the probability that a given sense pair will tend to lexicalize differently across languages is shown to correlate with semantic salience and sense granularity; sense hierarchies automatically generated from such distance matrices yield results remarkably similar to those created by professional monolingual lexicographers.

[1]  Susan McRoy,et al.  Using Multiple Knowledge Sources for Word Sense Discrimination , 1992, Comput. Linguistics.

[2]  Piek Vossen,et al.  User requirments and functional specifications of the EuroWordNet project , 1996 .

[3]  Bonnie J. Dorr,et al.  Role of Word Sense Disalnbiguation in Lexical Acquisition: Predicting Semantics from Syntactic Cues , 1996, COLING.

[4]  Slava M. Katz,et al.  Estimation of probabilities from sparse data for the language model component of a speech recognizer , 1987, IEEE Trans. Acoust. Speech Signal Process..

[5]  Raymond J. Mooney,et al.  Comparative Experiments on Disambiguating Word Senses: An Illustration of the Role of Bias in Machine Learning , 1996, EMNLP.

[6]  Ellen M. Voorhees,et al.  Using WordNet to disambiguate word senses for text retrieval , 1993, SIGIR.

[7]  Philip Resnik,et al.  Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[8]  Janyce Wiebe,et al.  Word-Sense Disambiguation Using Decomposable Models , 1994, ACL.

[9]  Douglas A. Jones,et al.  Acquisition of Semantic Lexicons: Using Word Sense Disambiguation to Improve Precision , 1996 .

[10]  Atro Voutilainen,et al.  Tagging accurately - Don't guess if you know , 1994, ANLP.

[11]  Philip Resnik,et al.  Mining the Web for Bilingual Text , 1999, ACL.

[12]  Nancy Ide,et al.  © 1999 Kluwer Academic Publishers. Printed in the Netherlands Cross-lingual Sense Determination: Can It Work? , 2022 .

[13]  P. Resnik Selection and information: a class-based approach to lexical relationships , 1993 .

[14]  Kenneth Ward Church A Stochastic Parts Program and Noun Phrase Parser for Unrestricted Text , 1988, ANLP.

[15]  David Yarowsky,et al.  A method for disambiguating word senses in a large corpus , 1992, Comput. Humanit..

[16]  R. Burchfield Frequency Analysis of English Usage: Lexicon and Grammar. By W. Nelson Francis and Henry Kučera with the assistance of Andrew W. Mackie. Boston: Houghton Mifflin. 1982. x + 561 , 1985 .

[17]  Hinrich Schütze,et al.  Automatic Word Sense Discrimination , 1998, Comput. Linguistics.

[18]  David Yarowsky,et al.  Unsupervised Word Sense Disambiguation Rivaling Supervised Methods , 1995, ACL.

[19]  Takenobu Tokunaga,et al.  The Use of WordNet in Information Retrieval , 1998, WordNet@ACL/COLING.

[20]  George A. Miller,et al.  Using a Semantic Concordance for Sense Identification , 1994, HLT.

[21]  Mark W. Davis,et al.  New Experiments In Cross-Language Text Retrieval At NMSU's Computing Research Lab , 1996, TREC.

[22]  Ellen M. Voorhees,et al.  Corpus-Based Statistical Sense Resolution , 1993, HLT.

[23]  David Yarowsky,et al.  Word-Sense Disambiguation Using Statistical Models of Roget’s Categories Trained on Large Corpora , 2010, COLING.

[24]  W. Bruce Croft,et al.  Lexical ambiguity and information retrieval , 1992, TOIS.

[25]  David Yarowsky,et al.  One Sense Per Discourse , 1992, HLT.

[26]  Robert L. Mercer,et al.  Class-Based n-gram Models of Natural Language , 1992, CL.

[27]  Frederick Jelinek,et al.  Markov Source Modeling of Text Generation , 1985 .

[28]  Yorick Wilks,et al.  The Grammar of Sense: Is word-sense tagging much more than part-of-speech tagging? , 1996, ArXiv.

[29]  Philip Resnik,et al.  Tagger Evaluation Given Hierarchical Tag Sets , 2000, Comput. Humanit..

[30]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[31]  Jill Fain Lehman Toward the Essential Nature of Statistical Knowledge in Sense Resolution , 1994, AAAI.

[32]  David Yarowsky,et al.  One Sense per Collocation , 1993, HLT.

[33]  Lalit R. Bahl,et al.  A Maximum Likelihood Approach to Continuous Speech Recognition , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Philip Resnik,et al.  Parallel strands: a preliminary investigation into mining the Web for bilingual text , 1998, AMTA.

[35]  Christiane Fellbaum,et al.  Performance And Confidence In A Semantic Annotation Task , 1998 .

[36]  Nancy M. Ide,et al.  Parallel Translations as Sense Discriminators , 1999 .

[37]  Kenneth Ward Church,et al.  Introduction to the Special Issue on Computational Linguistics Using Large Corpora , 1993, Comput. Linguistics.

[38]  Yorick Wilks,et al.  Word Sense Disambiguation using Optimised Combinations of Knowledge Sources , 1998, COLING-ACL.

[39]  Adam Kilgarriff,et al.  SENSEVAL: an exercise in evaluating world sense disambiguation programs , 1998, LREC.

[40]  Nancy Ide,et al.  Introduction to the Special Issue on Word Sense Disambiguation: The State of the Art , 1998, Comput. Linguistics.

[41]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[42]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[43]  Eric Brill,et al.  A corpus-based approach to language learning , 1993 .

[44]  Hwee Tou Ng,et al.  Integrating Multiple Knowledge Sources to Disambiguate Word Sense: An Exemplar-Based Approach , 1996, ACL.

[45]  G. Miller,et al.  Contextual correlates of semantic similarity , 1991 .

[46]  Christiane Fellbaum,et al.  Building Semantic Concordances , 1998 .