It Takes Two to Tango: A Bilingual Unsupervised Approach for Estimating Sense Distributions using Expectation Maximization

Several bilingual WSD algorithms which exploit translation correspondences between parallel corpora have been proposed. However, the availability of such parallel corpora itself is a tall task for some of the resource constrained languages of the world. We propose an unsupervised bilingual EM based algorithm which relies on the counts of translations to estimate sense distributions. No parallel or sense annotated corpora are needed. The algorithm relies on a synset-aligned bilingual dictionary and in-domain corpora from the two languages. A symmetric generalized Expectation Maximization formulation is used wherein the sense distributions of words in one language are estimated based on the raw counts of the words in the aligned synset in the target language. The overall performance of our algorithm when tested on 4 language-domain pairs is better than current state-of-the-art knowledge based and bilingual unsupervised ap

[1]  Michael E. Lesk,et al.  Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone , 1986, SIGDOC '86.

[2]  Alon Itai,et al.  Word Sense Disambiguation Using a Second Language Monolingual Corpus , 1994, CL.

[3]  Marianna Apidianaki Translation-oriented Word Sense Induction Based on Parallel Corpora , 2008, LREC.

[4]  Suresh Manandhar,et al.  Word Sense Induction Using Graphs of Collocations , 2008, ECAI.

[5]  ResnikPhilip,et al.  Distinguishing systems and distinguishing senses: new evaluation methods for Word Sense Disambiguation , 1999 .

[6]  Pushpak Bhattacharyya,et al.  Synset Based Multilingual Dictionary: Insights, Applications and Challenges , 2008 .

[7]  Hwee Tou Ng,et al.  Exploiting Parallel Texts for Word Sense Disambiguation: An Empirical Study , 2003, ACL.

[8]  Pushpak Bhattacharyya,et al.  Projecting Parameters for Multilingual Word Sense Disambiguation , 2009, EMNLP.

[9]  Nancy Ide,et al.  Automatic Sense Tagging Using Parallel Corpora , 2001, NLPRS.

[10]  Hang Li,et al.  Word Translation Disambiguation Using Bilingual Bootstrapping , 2004, Computational Linguistics.

[11]  Pushpak Bhattacharyya,et al.  Value for Money: Balancing Annotation Effort, Lexicon Building and Accuracy for Multilingual WSD , 2010, COLING.

[12]  Hiroyuki Kaji,et al.  Unsupervised Word-Sense Disambiguation Using Bilingual Comparable Corpora , 2002, IEICE Trans. Inf. Syst..

[13]  Eneko Agirre,et al.  Personalizing PageRank for Word Sense Disambiguation , 2009, EACL.

[14]  Jean Véronis,et al.  HyperLex: lexical cartography for information retrieval , 2004, Comput. Speech Lang..

[15]  Philip Resnik,et al.  An Unsupervised Method for Word Sense Tagging using Parallel Corpora , 2002, ACL.

[16]  Hwee Tou Ng,et al.  Integrating Multiple Knowledge Sources to Disambiguate Word Sense: An Exemplar-Based Approach , 1996, ACL.

[17]  Julie Weeds,et al.  Finding Predominant Word Senses in Untagged Text , 2004, ACL.

[18]  Nancy Ide,et al.  Fine-Grained Word Sense Disambiguation Based on Parallel Corpora, Word Alignment, Word Clustering and Aligned Wordnets , 2004, COLING.

[19]  Piek T. J. M. Vossen,et al.  SemEval-2010 Task 17: All-Words Word Sense Disambiguation on a Specific Domain , 2009, *SEMEVAL.

[20]  Rada Mihalcea,et al.  Unsupervised Large-Vocabulary Word Sense Disambiguation with Graph-based Algorithms for Sequence Data Labeling , 2005, HLT.

[21]  Eneko Agirre,et al.  Word Sense Disambiguation using Conceptual Density , 1996, COLING.

[22]  Hiroyuki Kaji,et al.  Unsupervised word sense disambiguation using bilingual comparable corpora , 2002, COLING 2002.

[23]  Maria Leonor Pacheco,et al.  of the Association for Computational Linguistics: , 2001 .

[24]  Hwee Tou Ng,et al.  Supervised Word Sense Disambiguation with Support Vector Machines and multiple knowledge sources , 2004, SENSEVAL@ACL.

[25]  David Yarowsky,et al.  A method for disambiguating word senses in a large corpus , 1992, Comput. Humanit..