Word sense disambiguation of WordNet glosses

This paper presents a suite of methods and results for the semantic disambiguation of WordNet glosses. WordNet is a resource widely used in natural language processing and artificial intelligence. Intended and designed as a lexical database, WordNet exhibits some deficiencies when used as a knowledge base. By semantically disambiguating the words in the glosses, we add pointers from each word to its concept or synset, and this increases the connectivity between the WordNet concepts by approximately an order of magnitude. We show how lexical chains and other applications can be built on this richly connected WordNet. The semantic disambiguation of the WordNet glosses is performed using automatic methods based on a set of heuristics. The precision of the semantic annotation is improved by using voting between the disambiguation system described here and another WSD system. The entire WordNet 2.0 has been disambiguated with an overall precision of 86% and is available at http://xwn.hlt.utdallas.edu.

[1]  Michael E. Lesk,et al.  Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone , 1986, SIGDOC '86.

[2]  Lucy Vanderwende,et al.  MindNet: Acquiring and Structuring Semantic Information from Text , 1998, COLING-ACL.

[3]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[4]  George A. Miller,et al.  Using a Semantic Concordance for Sense Identification , 1994, HLT.

[5]  Nancy Ide,et al.  Introduction to the Special Issue on Word Sense Disambiguation: The State of the Art , 1998, Comput. Linguistics.

[6]  Sanda M. Harabagiu,et al.  COGEX: A Logic Prover for Question Answering , 2003, NAACL.

[7]  Martin Chodorow,et al.  Extracting Semantic Hierarchies from a Large On-Line Dictionary , 1985, ACL.

[8]  Adrian Novischi Accurate Semantic Annotations via Pattern Matching , 2002, FLAIRS Conference.

[9]  Sanda M. Harabagiu,et al.  Knowledge processing on an extended wordnet , 1998 .

[10]  Roger K. Moore Computer Speech and Language , 1986 .

[11]  Christiane Fellbaum,et al.  Knowledge Processing On An Extended Wordnet , 1998 .

[12]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[13]  Adam Kilgarriff,et al.  Framework and Results for English SENSEVAL , 2000, Comput. Humanit..

[14]  강승식,et al.  [서평]「Electric Words : Dictionaries, Computers and Meanings」 , 1997 .

[15]  George A. Miller,et al.  WordNet 2 - A Morphologically and Semantically Enhanced Resource , 1999 .

[16]  Hiyan Alshawi,et al.  Processing Dictionary Definitions with Phrasal Pattern Hierarchies , 1987, CL.

[17]  Bernardo Magnini,et al.  Integrating Subject Field Codes into WordNet , 2000, LREC.

[18]  Adam Kilgarriff,et al.  English Lexical Sample Task Description , 2001, *SEMEVAL.

[19]  Eric Brill,et al.  Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part-of-Speech Tagging , 1995, CL.

[20]  David Yarowsky,et al.  Unsupervised Word Sense Disambiguation Rivaling Supervised Methods , 1995, ACL.