A Perspective on Word Sense Disambiguation Methods and Their Evaluation

In this position paper, we make several observations about the state of the art in automatic word sense disambiguation. Motivated by these observations, we offer several specific proposals to the community regarding improved evaluation criteria, common training and testing resources, and the definition of sense inventories.

[1]  Robert L. Mercer,et al.  Class-Based n-gram Models of Natural Language , 1992, CL.

[2]  Janyce Wiebe,et al.  Word-Sense Disambiguation Using Decomposable Models , 1994, ACL.

[3]  Bonnie J. Dorr,et al.  Role of Word Sense Disalnbiguation in Lexical Acquisition: Predicting Semantics from Syntactic Cues , 1996, COLING.

[4]  Slava M. Katz,et al.  Estimation of probabilities from sparse data for the language model component of a speech recognizer , 1987, IEEE Trans. Acoust. Speech Signal Process..

[5]  Atro Voutilainen,et al.  Tagging accurately - Don't guess if you know , 1994, ANLP.

[6]  Jill Fain Lehman Toward the Essential Nature of Statistical Knowledge in Sense Resolution , 1994, AAAI.

[7]  Douglas A. Jones,et al.  Acquisition of Semantic Lexicons: Using Word Sense Disambiguation to Improve Precision , 1996 .

[8]  P. Resnik Selection and information: a class-based approach to lexical relationships , 1993 .

[9]  Kenneth Ward Church A Stochastic Parts Program and Noun Phrase Parser for Unrestricted Text , 1988, ANLP.

[10]  David Yarowsky,et al.  A method for disambiguating word senses in a large corpus , 1992, Comput. Humanit..

[11]  Hwee Tou Ng,et al.  Integrating Multiple Knowledge Sources to Disambiguate Word Sense: An Exemplar-Based Approach , 1996, ACL.

[12]  李幼升,et al.  Ph , 1989 .

[13]  G. Miller,et al.  Contextual correlates of semantic similarity , 1991 .

[14]  John Murphy,et al.  Using WordNet as a Knowledge Base for Measuring Semantic Similarity between Words , 1994 .

[15]  Frederick Jelinek,et al.  Markov Source Modeling of Text Generation , 1985 .

[16]  Kenneth Ward Church A Stochastic Parts Program and Noun Phrase Parser for Unrestricted Text , 1988, ANLP.

[17]  Yorick Wilks,et al.  The Grammar of Sense: Is word-sense tagging much more than part-of-speech tagging? , 1996, ArXiv.

[18]  David Yarowsky,et al.  One Sense per Collocation , 1993, HLT.

[19]  Lalit R. Bahl,et al.  A Maximum Likelihood Approach to Continuous Speech Recognition , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Susan McRoy,et al.  Using Multiple Knowledge Sources for Word Sense Discrimination , 1992, Comput. Linguistics.

[21]  Raymond J. Mooney,et al.  Comparative Experiments on Disambiguating Word Senses: An Illustration of the Role of Bias in Machine Learning , 1996, EMNLP.

[22]  Ellen M. Voorhees,et al.  Corpus-Based Statistical Sense Resolution , 1993, HLT.

[23]  David Yarowsky,et al.  Word-Sense Disambiguation Using Statistical Models of Roget’s Categories Trained on Large Corpora , 2010, COLING.

[24]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[25]  Eric Brill,et al.  A corpus-based approach to language learning , 1993 .

[26]  George A. Miller,et al.  Using a Semantic Concordance for Sense Identification , 1994, HLT.

[27]  W. Bruce Croft,et al.  Lexical ambiguity and information retrieval , 1992, TOIS.

[28]  David Yarowsky,et al.  One Sense Per Discourse , 1992, HLT.