Making fine-grained and coarse-grained sense distinctions , both manually and automatically

In this paper we discuss a persistent problem arising from polysemy: namely the difficulty of finding consistent criteria for making fine-grained sense distinctions, either manually or automatically. We investigate sources of human annotator disagreements stemming from the tagging for the English Verb Lexical Sample Task in the Senseval-2 exercise in automatic Word Sense Disambiguation. We also examine errors made by a high-performing maximum entropy Word Sense Disambiguation system we developed. Both sets of errors are at least partially reconciled by a more coarse-grained view of the senses, and we present the groupings we use for quantitative coarse-grained evaluation as well as the process by which they were created. We compare the system’s performance with our human annotator performance in light of both fine-grained and coarse-grained sense distinctions and show that well-defined sense groups can be of value in improving word sense disambiguation by both humans and machines.

[1]  Martha Palmer,et al.  Class-Based Construction of a Verb Lexicon , 2000, AAAI/IAAI.

[2]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[3]  Walter Daelemans,et al.  Memory-Based Word Sense Disambiguation , 2000, Comput. Humanit..

[4]  Olga Babko-Malaya,et al.  Different Sense Granularities for Different Applications , 2004, HLT-NAACL 2004.

[5]  Martha Palmer,et al.  Investigating Regular Sense Extensions based on Intersective Levin Classes , 1998, ACL.

[6]  Martha Palmer,et al.  Integrating compositional semantics into a verb lexicon , 2000, COLING.

[7]  Martha Palmer,et al.  Constraining Lexical Selection Across Languages Using TAGs , 1994, ArXiv.

[8]  Adam Kilgarriff,et al.  English Lexical Sample Task Description , 2001, *SEMEVAL.

[9]  George A. Miller,et al.  A Topical/Local Classifier for Word Sense Identification , 2000, Comput. Humanit..

[10]  Hwee Tou Ng,et al.  Exploiting Parallel Texts for Word Sense Disambiguation: An Empirical Study , 2003, ACL.

[11]  David Yarowsky,et al.  The Johns Hopkins SENSEVAL2 system descriptions , 2001 .

[12]  B. Levin,et al.  Admitting Impediments , 2022, Lexical Acquisition: Exploiting On-Line Resources to Build a Lexicon.

[13]  Martha Palmer,et al.  Investigations into the role of lexical semantics in word sense disambiguation , 2004 .

[14]  Martha Palmer,et al.  Combining Contextual Features for Word Sense Disambiguation , 2002, SENSEVAL.

[15]  Adam Kilgarriff,et al.  Framework and Results for English SENSEVAL , 2000, Comput. Humanit..

[16]  Martha Palmer,et al.  Verb semantics for English-Chinese translation , 1995, Machine Translation.

[17]  Uri Zernik,et al.  Lexical acquisition: Exploiting on-line resources to build a lexicon. , 1991 .

[18]  Beth Levin,et al.  English Verb Classes and Alternations: A Preliminary Investigation , 1993 .

[19]  Adam Kilgarriff,et al.  Introduction to the Special Issue on SENSEVAL , 2000, Comput. Humanit..

[20]  Jurij D. Apresjan REGULAR POLYSEMY , 1974 .

[21]  Christiane Fellbaum,et al.  English Tasks: All-Words and Verb Lexical Sample , 2001, *SEMEVAL.

[22]  Rada Mihalcea,et al.  Automatic generation of a coarse grained WordNet , 2001, HTL 2001.

[23]  Daniel Gildea,et al.  The Proposition Bank: An Annotated Corpus of Semantic Roles , 2005, CL.

[24]  Ramesh Krishnamurthy,et al.  Peeling an Onion: The Lexicographer's Experience ofManual Sense-Tagging , 2000, Comput. Humanit..

[25]  Martha Palmer,et al.  Using prepositions to extend a verb lexicon , 2004, HLT-NAACL 2004.

[26]  Leonard Talmy,et al.  Path to Realization: A Typology of Event Conflation , 1991 .

[27]  Louise Guthrie,et al.  Lexical Disambiguation using Simulated Annealing , 1992, COLING.

[28]  Adam Kilgarriff,et al.  The Senseval-3 English lexical sample task , 2004, SENSEVAL@ACL.

[29]  David Yarowsky,et al.  Distinguishing systems and distinguishing senses: new evaluation methods for Word Sense Disambiguation , 1999, Natural Language Engineering.

[30]  William B. Dolan,et al.  Word Sense Ambiguation: Clustering Related Senses , 1994, COLING.

[31]  Mitchell P. Marcus,et al.  Maximum entropy models for natural language ambiguity resolution , 1998 .

[32]  Martha Palmer,et al.  Simple Features for Chinese Word Sense Disambiguation , 2002, COLING.

[33]  Karen Sparck Jones Synonymy and semantic classification , 1986 .

[34]  Christiane Fellbaum,et al.  The Organization of Verbs and Verb Concepts in a Semantic Net , 1999 .

[35]  Adam Kilgarriff,et al.  "I Don’t Believe in Word Senses" , 1997, Comput. Humanit..

[36]  Scott Cotton,et al.  SENSEVAL-2: Overview , 2001, *SEMEVAL.

[37]  David Yarowsky,et al.  The John Hopkins SENSEVAL-2 System Descriptions , 2001, SENSEVAL@ACL.

[38]  James Pustejovsky,et al.  The Generative Lexicon , 1995, CL.

[39]  Patrick Hanks,et al.  Contextual dependency and lexical sets , 1996 .

[40]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[41]  G. Miller,et al.  Semantic networks of english , 1991, Cognition.

[42]  Patrick Hanks,et al.  Do Word Meanings Exist? , 2000, Comput. Humanit..

[43]  Christiane Fellbaum,et al.  Performance And Confidence In A Semantic Annotation Task , 1998 .

[44]  D. Geeraerts Vagueness's puzzles, polysemy's vagaries , 1993 .

[45]  Nancy Ide,et al.  Introduction to the Special Issue on Word Sense Disambiguation: The State of the Art , 1998, Comput. Linguistics.

[46]  Richard M. Schwartz,et al.  Nymble: a High-Performance Learning Name-finder , 1997, ANLP.

[47]  Nancy Ide,et al.  © 1999 Kluwer Academic Publishers. Printed in the Netherlands Cross-lingual Sense Determination: Can It Work? , 2022 .

[48]  Jason Eisner,et al.  Lexical Semantics , 2020, The Handbook of English Linguistics.

[49]  Christiane Fellbaum,et al.  Analysis of a Hand-Tagging Task , 1997, Workshop On Tagging Text With Lexical Semantics: Why, What, And How?.

[50]  Martha Palmer,et al.  Customizing verb definitions for specific semantic domains , 1990, Machine Translation.

[51]  Martha Palmer,et al.  Sense Tagging the Penn Treebank , 2000 .

[52]  Adam L. Berger,et al.  A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[53]  Martha Palmer,et al.  From TreeBank to PropBank , 2002, LREC.

[54]  Dekang Lin,et al.  Automatic Retrieval and Clustering of Similar Words , 1998, ACL.