论文信息 - Bootstrapping Large Sense Tagged Corpora

Bootstrapping Large Sense Tagged Corpora

The performance of Word Sense Disambiguation systems largely depends on the availability of sense tagged corpora. Since the semantic annotations are usually done by humans, the size of such corpora is limited to a handful of tagged texts. This paper proposes a generation algorithm that may be used to automatically create large sense tagged corpora. The approach is evaluated through comparative sense disambiguation experiments performed on data provided during the SENSEVAL-2 English all words and English lexical sample tasks.

Rada Mihalcea | Rada Mihalcea

[1] George A. Miller,et al. A Semantic Concordance , 1993, HLT.

[2] Janyce Wiebe,et al. Word-Sense Disambiguation Using Decomposable Models , 1994, ACL.

[3] Rada Mihalcea,et al. Word sense disambiguation with pattern learning and automatic feature selection , 2002, Natural Language Engineering.

[4] George A. Miller,et al. Using Corpus Statistics and WordNet Relations for Sense Identification , 1998, CL.

[5] Dan Klein,et al. Combining Heterogeneous Classifiers for Word Sense Disambiguation , 2001, SENSEVAL@ACL.

[6] Hwee Tou Ng,et al. Integrating Multiple Knowledge Sources to Disambiguate Word Sense: An Exemplar-Based Approach , 1996, ACL.

[7] Eric Brill,et al. Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part-of-Speech Tagging , 1995, CL.

[8] David Yarowsky,et al. Unsupervised Word Sense Disambiguation Rivaling Supervised Methods , 1995, ACL.

[9] Ted Pedersen,et al. A Decision Tree of Bigrams is an Accurate Predictor of Word Sense , 2001, NAACL.

[10] Walter Daelemans,et al. Memory-Based Word Sense Disambiguation , 2000, Comput. Humanit..

[11] Rada Mihalcea,et al. An Iterative Approach to Word Sense Disambiguation , 2000, FLAIRS.

[12] Rada Mihalcea,et al. An Automatic Method for Generating Sense Tagged Corpora , 1999, AAAI/IAAI.