论文信息 - Making fine-grained and coarse-grained sense distinctions , both manually and automatically - 字舞流文

Making fine-grained and coarse-grained sense distinctions , both manually and automatically

In this paper we discuss a persistent problem arising from polysemy: namely the difficulty of finding consistent criteria for making fine-grained sense distinctions, either manually or automatically. We investigate sources of human annotator disagreements stemming from the tagging for the English Verb Lexical Sample Task in the Senseval-2 exercise in automatic Word Sense Disambiguation. We also examine errors made by a high-performing maximum entropy Word Sense Disambiguation system we developed. Both sets of errors are at least partially reconciled by a more coarse-grained view of the senses, and we present the groupings we use for quantitative coarse-grained evaluation as well as the process by which they were created. We compare the system’s performance with our human annotator performance in light of both fine-grained and coarse-grained sense distinctions and show that well-defined sense groups can be of value in improving word sense disambiguation by both humans and machines.

M. A. R T H A P A L | H. O A T R A N G D A N G

[1] Martha Palmer,et al. Class-Based Construction of a Verb Lexicon , 2000, AAAI/IAAI.

[2] Beatrice Santorini,et al. Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[3] Walter Daelemans,et al. Memory-Based Word Sense Disambiguation , 2000, Comput. Humanit..

[4] Olga Babko-Malaya,et al. Different Sense Granularities for Different Applications , 2004, HLT-NAACL 2004.

[5] Martha Palmer,et al. Investigating Regular Sense Extensions based on Intersective Levin Classes , 1998, ACL.

[6] Martha Palmer,et al. Integrating compositional semantics into a verb lexicon , 2000, COLING.

[7] Martha Palmer,et al. Constraining Lexical Selection Across Languages Using TAGs , 1994, ArXiv.

[8] Adam Kilgarriff,et al. English Lexical Sample Task Description , 2001, *SEMEVAL.

[9] George A. Miller,et al. A Topical/Local Classifier for Word Sense Identification , 2000, Comput. Humanit..

[10] Hwee Tou Ng,et al. Exploiting Parallel Texts for Word Sense Disambiguation: An Empirical Study , 2003, ACL.

[11] David Yarowsky,et al. The Johns Hopkins SENSEVAL2 system descriptions , 2001 .

[12] B. Levin,et al. Admitting Impediments , 2022, Lexical Acquisition: Exploiting On-Line Resources to Build a Lexicon.

[13] Martha Palmer,et al. Investigations into the role of lexical semantics in word sense disambiguation , 2004 .

[14] Martha Palmer,et al. Combining Contextual Features for Word Sense Disambiguation , 2002, SENSEVAL.

[15] Adam Kilgarriff,et al. Framework and Results for English SENSEVAL , 2000, Comput. Humanit..

[16] Martha Palmer,et al. Verb semantics for English-Chinese translation , 1995, Machine Translation.

[17] Uri Zernik,et al. Lexical acquisition: Exploiting on-line resources to build a lexicon. , 1991 .

[18] Beth Levin,et al. English Verb Classes and Alternations: A Preliminary Investigation , 1993 .

[19] Adam Kilgarriff,et al. Introduction to the Special Issue on SENSEVAL , 2000, Comput. Humanit..

[20] Jurij D. Apresjan. REGULAR POLYSEMY , 1974 .

[21] Christiane Fellbaum,et al. English Tasks: All-Words and Verb Lexical Sample , 2001, *SEMEVAL.

[22] Rada Mihalcea,et al. Automatic generation of a coarse grained WordNet , 2001, HTL 2001.

[23] Daniel Gildea,et al. The Proposition Bank: An Annotated Corpus of Semantic Roles , 2005, CL.

[24] Ramesh Krishnamurthy,et al. Peeling an Onion: The Lexicographer's Experience ofManual Sense-Tagging , 2000, Comput. Humanit..

[25] Martha Palmer,et al. Using prepositions to extend a verb lexicon , 2004, HLT-NAACL 2004.

[26] Leonard Talmy,et al. Path to Realization: A Typology of Event Conflation , 1991 .

[27] Louise Guthrie,et al. Lexical Disambiguation using Simulated Annealing , 1992, COLING.

[28] Adam Kilgarriff,et al. The Senseval-3 English lexical sample task , 2004, SENSEVAL@ACL.

[29] David Yarowsky,et al. Distinguishing systems and distinguishing senses: new evaluation methods for Word Sense Disambiguation , 1999, Natural Language Engineering.

[30] William B. Dolan,et al. Word Sense Ambiguation: Clustering Related Senses , 1994, COLING.

[31] Mitchell P. Marcus,et al. Maximum entropy models for natural language ambiguity resolution , 1998 .

[32] Martha Palmer,et al. Simple Features for Chinese Word Sense Disambiguation , 2002, COLING.

[33] Karen Sparck Jones. Synonymy and semantic classification , 1986 .

[34] Christiane Fellbaum,et al. The Organization of Verbs and Verb Concepts in a Semantic Net , 1999 .

[35] Adam Kilgarriff,et al. "I Don’t Believe in Word Senses" , 1997, Comput. Humanit..

[36] Scott Cotton,et al. SENSEVAL-2: Overview , 2001, *SEMEVAL.

[37] David Yarowsky,et al. The John Hopkins SENSEVAL-2 System Descriptions , 2001, SENSEVAL@ACL.

[38] James Pustejovsky,et al. The Generative Lexicon , 1995, CL.

[39] Patrick Hanks,et al. Contextual dependency and lexical sets , 1996 .

[40] Christiane Fellbaum,et al. Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[41] G. Miller,et al. Semantic networks of english , 1991, Cognition.

[42] Patrick Hanks,et al. Do Word Meanings Exist? , 2000, Comput. Humanit..

[43] Christiane Fellbaum,et al. Performance And Confidence In A Semantic Annotation Task , 1998 .

[44] D. Geeraerts. Vagueness's puzzles, polysemy's vagaries , 1993 .

[45] Nancy Ide,et al. Introduction to the Special Issue on Word Sense Disambiguation: The State of the Art , 1998, Comput. Linguistics.

[46] Richard M. Schwartz,et al. Nymble: a High-Performance Learning Name-finder , 1997, ANLP.

[47] Nancy Ide,et al. © 1999 Kluwer Academic Publishers. Printed in the Netherlands Cross-lingual Sense Determination: Can It Work? , 2022 .

[48] Jason Eisner,et al. Lexical Semantics , 2020, The Handbook of English Linguistics.

[49] Christiane Fellbaum,et al. Analysis of a Hand-Tagging Task , 1997, Workshop On Tagging Text With Lexical Semantics: Why, What, And How?.

[50] Martha Palmer,et al. Customizing verb definitions for specific semantic domains , 1990, Machine Translation.

[51] Martha Palmer,et al. Sense Tagging the Penn Treebank , 2000 .

[52] Adam L. Berger,et al. A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[53] Martha Palmer,et al. From TreeBank to PropBank , 2002, LREC.

[54] Dekang Lin,et al. Automatic Retrieval and Clustering of Similar Words , 1998, ACL.