Selective Sampling for Example-based Word Sense Disambiguation

This paper proposes an efficient example sampling method for example-based word sense disambiguation systems. To construct a database of practical size, a considerable overhead for manual sense disambiguation (overhead for supervision) is required. In addition, the time complexity of searching a large-sized database poses a considerable problem (overhead for search). To counter these problems, our method selectively samples a smaller-sized effective subset from a given example set for use in word sense disambiguation. Our method is characterized by the reliance on the notion of training utility: the degree to which each example is informative for future example sampling when used for the training of the system. The system progressively collects examples by selecting those with greatest utility. The paper reports the effectiveness of our method through experiments on about one thousand sentences. Compared to experiments with other example sampling methods, our method reduced both the overhead for supervision and the overhead for search, without the degeneration of the performance of the system.

[1]  Shlomo Argamon,et al.  Minimizing Manual Annotation Cost in Supervised Training from Corpora , 1996, ACL.

[2]  Kentaro Inui,et al.  To what extent does case contribute to verb sense disambiguation? , 1996, COLING.

[3]  Ellen M. Voorhees,et al.  Using WordNet to disambiguate word senses for text retrieval , 1993, SIGIR.

[4]  Takehito Utsuro Sense Classification of Verbal Polysemy based-on Bilingual Class/Class Association , 1996, COLING.

[5]  H. Schütze,et al.  Dimensions of meaning , 1992, Supercomputing '92.

[6]  Shimon Edelman,et al.  Learning Similarity-based Word Sense Disambiguation from Sparse Data , 1996, VLC@COLING.

[7]  Yoshihiko Nitta,et al.  Co-Occurrence Vectors From Corpora vs. Distance Vectors From Dictionaries , 1994, COLING.

[8]  Takehito Utsuro,et al.  Thesaurus-based Efficient Example Retrieval by Generating Retrieval Queries from Similarities , 1994, COLING.

[9]  Ricardo Baeza-Yates,et al.  Information Retrieval: Data Structures and Algorithms , 1992 .

[10]  Alon Itai,et al.  Word Sense Disambiguation Using a Second Language Monolingual Corpus , 1994, CL.

[11]  Makoto Nagao,et al.  A Method of Case Structure Analysis for Japanese Sentences Based on Examples in Case Frame Dictionary , 1994 .

[12]  Robert L. Mercer,et al.  Word-Sense Disambiguation Using Statistical Methods , 1991, ACL.

[13]  H. Sebastian Seung,et al.  Query by committee , 1992, COLT '92.

[14]  Francesc Ribas,et al.  On Learning more Appropriate Selectional Restrictions , 1995, EACL.

[15]  Eugene Charniak,et al.  Statistical language learning , 1997 .

[16]  Tanaka Hozumi,et al.  To what extent does case contribute to verb sense disambiguation , 1996, COLING 1996.

[17]  Raymond J. Mooney,et al.  Comparative Experiments on Disambiguating Word Senses: An Illustration of the Role of Bias in Machine Learning , 1996, EMNLP.

[18]  David Yarowsky,et al.  Estimating Upper and Lower Bounds on the Performance of Word-Sense Disambiguation Programs , 1992, ACL.

[19]  Makoto Nagao,et al.  A framework of a mechanical translation between Japanese and English by analogy principle , 1984 .

[20]  Ellen M. Voorhees,et al.  Corpus-Based Statistical Sense Resolution , 1993, HLT.

[21]  Tanaka Hozumi,et al.  Automatic thesaurus construction based on grammatical relations , 1995, IJCAI 1995.

[22]  James Pustejovsky,et al.  Lexical Knowledge Representation and Natural Language Processing , 1993, Artif. Intell..

[23]  Janyce Wiebe,et al.  Word-Sense Disambiguation Using Decomposable Models , 1994, ACL.

[24]  Stan Matwin,et al.  A WordNet-based Algorithm for Word Sense Disambiguation , 1995, IJCAI.

[25]  Hiroyuki Kaji,et al.  Learning Translation Templates From Bilingual Text , 1992, COLING.

[26]  Katashi Nagao A Preferential Constraint Satisfaction Technique for Natural Language Analysis , 1992, ECAI.

[27]  David Yarowsky,et al.  Unsupervised Word Sense Disambiguation Rivaling Supervised Methods , 1995, ACL.

[28]  Yorick Wilks,et al.  Subject-Dependent Co-Occurence and Word Sense Disambiguation , 1991, ACL.

[29]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[30]  Marti A. Hearst Noun Homograph Disambiguation Using Local Context in Large Text Corpora , 1991 .

[31]  Hiroshi Nomiyama,et al.  Machine Translation by Case Generalization , 1992, COLING.

[32]  William A. Gale,et al.  A sequential algorithm for training text classifiers , 1994, SIGIR '94.

[33]  Masayuki Kameda A Portable & Quick Japanese Parser: QJP , 1996, COLING.

[34]  Hwee Tou Ng,et al.  Exemplar-Based Word Sense Disambiguation” Some Recent Improvements , 1997, EMNLP.

[35]  P. Resnik Selection and information: a class-based approach to lexical relationships , 1993 .

[36]  David Yarowsky,et al.  A method for disambiguating word senses in a large corpus , 1992, Comput. Humanit..

[37]  Peter Mark Roget,et al.  Roget's International Thesaurus , 1977 .

[38]  David Yarowsky,et al.  Word-Sense Disambiguation Using Statistical Models of Roget’s Categories Trained on Large Corpora , 2010, COLING.

[39]  Fumiyo Fukumoto,et al.  Automatic Recognition of Verbal Polysemy , 1994, COLING.

[40]  Ted Pedersen,et al.  Sequential Model Selection for Word Sense Disambiguation , 1997, ANLP.

[41]  Takenobu Tokunaga,et al.  Automatic Thesaurus Construction based on Grammatical Relations , 1995, IJCAI.

[42]  W. Bruce Croft,et al.  Lexical ambiguity and information retrieval , 1992, TOIS.

[43]  Uri Zernik Lexicon Acquisition: Learning from Corpus by Capitalizing on Lexical Categories , 1989, IJCAI.

[44]  Hwee Tou Ng,et al.  Integrating Multiple Knowledge Sources to Disambiguate Word Sense: An Exemplar-Based Approach , 1996, ACL.

[45]  Donald Hindle,et al.  Noun Classification From Predicate-Argument Structures , 1990, ACL.

[46]  Venkata Subramaniam,et al.  Information Retrieval: Data Structures & Algorithms , 1992 .

[47]  Barry Smyth,et al.  Remembering To Forget: A Competence-Preserving Case Deletion Policy for Case-Based Reasoning Systems , 1995, IJCAI.

[48]  Susan Bonzi,et al.  Semantic interpretation and the resolution of ambiguity , 1989, JASIS.

[49]  Steven L. Lytinen Dynamically Combining Syntax and Semantics in Natural Language Processing , 1986, AAAI.

[50]  Naohiko Uramoto A Best-Match Algorithm for Broad-Coverage Example-Based Disambiguation , 1994, COLING.

[51]  Naohiko Uramoto Example-Based Word-Sense Disambiguation , 1994 .