Unsupervised Sense Disambiguation Using Bilingual Probabilistic Models

We describe two probabilistic models for unsupervised word-sense disambiguation using parallel corpora. The first model, which we call the Sense model, builds on the work of Diab and Resnik (2002) that uses both parallel text and a sense inventory for the target language, and recasts their approach in a probabilistic framework. The second model, which we call the Concept model, is a hierarchical model that uses a concept latent variable to relate different language specific sense labels. We show that both models improve performance on the word sense disambiguation task over previous unsupervised approaches, with the Concept model showing the largest improvement. Furthermore, in learning the Concept model, as a by-product, we learn a sense inventory for the parallel language.

[1]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[2]  Ido Dagan Lexical Disambiguation: Sources of Information and their Statistical Realization , 1991, ACL.

[3]  Robert L. Mercer,et al.  Word-Sense Disambiguation Using Statistical Methods , 1991, ACL.

[4]  David Yarowsky,et al.  Word-Sense Disambiguation Using Statistical Models of Roget’s Categories Trained on Large Corpora , 2010, COLING.

[5]  David Yarowsky,et al.  One Sense per Collocation , 1993, HLT.

[6]  Janyce Wiebe,et al.  A New Approach to Word Sense Disambiguation , 1994, HLT.

[7]  Alon Itai,et al.  Word Sense Disambiguation Using a Second Language Monolingual Corpus , 1994, CL.

[8]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[9]  David Yarowsky,et al.  Unsupervised Word Sense Disambiguation Rivaling Supervised Methods , 1995, ACL.

[10]  Philip Resnik,et al.  Selectional Preference and Sense Disambiguation , 1997 .

[11]  Nancy Ide,et al.  Introduction to the Special Issue on Word Sense Disambiguation: The State of the Art , 1998, Comput. Linguistics.

[12]  David Yarowsky,et al.  Distinguishing systems and distinguishing senses: new evaluation methods for Word Sense Disambiguation , 1999, Natural Language Engineering.

[13]  Kenneth C. Litkowski Senseval: The CL Research Experience , 2000, Comput. Humanit..

[14]  Eneko Agirre,et al.  Combining Supervised and Unsupervised Lexical Knowledge Methods for Word Sense Disambiguation , 2000, Comput. Humanit..

[15]  Adam Kilgarriff,et al.  Framework and Results for English SENSEVAL , 2000, Comput. Humanit..

[16]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[17]  Dekang Lin Word Sense Disambiguation with a Similarity-Smoothed Case Library , 2000, Comput. Humanit..

[18]  Nancy Ide,et al.  © 1999 Kluwer Academic Publishers. Printed in the Netherlands Cross-lingual Sense Determination: Can It Work? , 2022 .

[19]  Philip Resnik,et al.  An Unsupervised Method for Word Sense Tagging using Parallel Corpora , 2002, ACL.

[20]  Philip Resnik,et al.  Word Sense Disambiguation within a Multilingual Framework , 2003 .

[21]  Yoshua Bengio,et al.  Extracting Hidden Sense Probabilities from Bitexts , 2003 .