Semi-Supervised Learning for Word Sense Disambiguation: Quality vs. Quantity

In this paper, we discuss the importance of the quality against the quantity of automatically extracted examples for word sense disambiguation (WSD). We first show that we can build a competitive WSD system with a memory-based classifier and a feature set reduced to easily and eciently computable features. We then show that adding automatically annotated examples improves the performance of this system when the examples are carefully selected based on their quality.

[1]  Judita Preiss Probabilistic WSD in Senseval-3 , 2004, SENSEVAL@ACL.

[2]  Nancy Ide,et al.  Introduction to the Special Issue on Word Sense Disambiguation: The State of the Art , 1998, Comput. Linguistics.

[3]  Ted Pedersen,et al.  SenseRelate: : TargetWord-A Generalized Framework for Word Sense Disambiguation , 2005, ACL.

[4]  Walter Daelemans,et al.  TiMBL: Tilburg Memory-Based Learner, version 2.0, Reference guide , 1998 .

[5]  Cheng Niu,et al.  Context clustering for Word Sense Disambiguation based on modeling pairwise context similarities , 2004, SENSEVAL@ACL.

[6]  Silvia Bernardini,et al.  Introducing and evaluating ukWaC , a very large web-derived corpus of English , 2008 .

[7]  Georgiana Dinu,et al.  Sometimes Less Is More : Romanian Word Sense Disambiguation Revisited , 2007 .

[8]  Cristian Grozea,et al.  Finding optimal parameter settings for high performance word sense disambiguation , 2004 .

[9]  Adam Kilgarriff,et al.  The Senseval-3 English lexical sample task , 2004, SENSEVAL@ACL.

[10]  Hwee Tou Ng,et al.  Supervised Word Sense Disambiguation with Support Vector Machines and multiple knowledge sources , 2004, SENSEVAL@ACL.

[11]  Ganesh Ramakrishnan,et al.  A gloss-centered algorithm for disambiguation , 2004, SENSEVAL@ACL.

[12]  Hwee Tou Ng,et al.  Integrating Multiple Knowledge Sources to Disambiguate Word Sense: An Exemplar-Based Approach , 1996, ACL.

[13]  David Yarowsky,et al.  Unsupervised Word Sense Disambiguation Rivaling Supervised Methods , 1995, ACL.

[14]  Jun'ichi Tsujii,et al.  Bidirectional Inference with the Easiest-First Strategy for Tagging Sequence Data , 2005, HLT.

[15]  Rada Mihalcea,et al.  Instance Based Learning with Automatic Feature Selection Applied to Word Sense Disambiguation , 2002, COLING.

[16]  Carlo Strapparava,et al.  Pattern abstraction and term similarity for Word Sense Disambiguation: IRST at Senseval-3 , 2004 .

[17]  Julio Gonzalo,et al.  Automatic Acquisition of Lexical Information and Examples , 2007 .

[18]  Rada Mihalcea,et al.  Co-training and Self-training for Word Sense Disambiguation , 2004, CoNLL.

[19]  Ken Litkowski,et al.  Senseval-3 task: Automatic labeling of semantic roles , 2004, SENSEVAL@ACL.

[20]  Walter Daelemans,et al.  Forgetting Exceptions is Harmful in Language Learning , 1998, Machine Learning.

[21]  Michael E. Lesk,et al.  Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone , 1986, SIGDOC '86.