Probabilistic Latent Semantic Analysis for Unsupervised Word Sense Disambiguation

This paper presents an unsupervised approach for disambiguating between various senses of a word to select the most appropriate sense, based on the context in the text. We have defined a Probabilistic Latent Semantic Analysis (PLSA) based Word Sense Disambiguation (WSD) system in which sense tagged annotated data is not required for training and the system is language independent giving 83% and 74% accuracy for English and Hindi languages respectively. Also, through word sense disambiguation experiments, we have shown that byapplying Word net in this algorithm, performance of our system can be further enhanced.

[1]  Curt Burgess,et al.  Modelling Parsing Constraints with High-dimensional Context Space , 1997 .

[2]  Minoru Sasaki,et al.  Unsupervised learning of word sense disambiguation rules by estimating an optimum iteration number in the EM algorithm , 2003, CoNLL.

[3]  Ehsan Hessami,et al.  Unsupervised Graph-based Word Sense Disambiguation Using lexical relation of WordNet , 2011 .

[4]  Ratna Sanyal,et al.  Semantic document classification and keyword spotting in digital repositories , 2009, MEDES.

[5]  Ted Pedersen,et al.  Learning Probabilistic Models of Word Sense Disambiguation , 2007, ArXiv.

[6]  Adam Kilgarriff,et al.  English Senseval: Report and Results , 2000, LREC.

[7]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[8]  David Yarowsky,et al.  Unsupervised Word Sense Disambiguation Rivaling Supervised Methods , 1995, ACL.

[9]  Sebastian Thrun,et al.  Text Classification from Labeled and Unlabeled Documents using EM , 2000, Machine Learning.

[10]  Paola Velardi,et al.  Structural semantic interconnections: a knowledge-based approach to word sense disambiguation , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Robert Wilensky,et al.  Experiments in Improving Unsupervised Word Sense Disambiguation , 2003 .

[12]  Jean Véronis,et al.  HyperLex: lexical cartography for information retrieval , 2004, Comput. Speech Lang..

[13]  H. Schütze,et al.  Dimensions of meaning , 1992, Supercomputing '92.

[14]  Michalis Vazirgiannis,et al.  Word Sense Disambiguation with Spreading Activation Networks Generated from Thesauri , 2007, IJCAI.

[15]  Mirella Lapata,et al.  Graph Connectivity Measures for Unsupervised Word Sense Disambiguation , 2007, IJCAI.

[16]  Eneko Agirre,et al.  Exploiting domain information for Word Sense Disambiguation of medical documents , 2011, J. Am. Medical Informatics Assoc..

[17]  Richard Wicentowski,et al.  Unsupervised Italian Word Sense Disambiguation using WordNets and Unlabeled Corpora , 2002, SENSEVAL.

[18]  Guergana Savova,et al.  Resolving Ambiguities in Biomedical Text With Unsupervised Clustering Approaches , 2005 .

[19]  Rada Mihalcea,et al.  Bootstrapping Large Sense Tagged Corpora , 2002, LREC.

[20]  Thomas Hofmann,et al.  Unsupervised Learning by Probabilistic Latent Semantic Analysis , 2004, Machine Learning.

[21]  Janyce Wiebe,et al.  Word-Sense Disambiguation Using Decomposable Models , 1994, ACL.

[22]  Atul Kumar,et al.  Effect of Pronoun Resolution on Document Similarity , 2010 .

[23]  Mark Stevenson,et al.  Unsupervised Domain Tuning to Improve Word Sense Disambiguation , 2013, HLT-NAACL.

[24]  Patrick Pantel,et al.  Concept Discovery from Text , 2002, COLING.

[25]  Shou-De Lin,et al.  A Semantics-Enhanced Language Model for Unsupervised Word Sense Disambiguation , 2008, CICLing.

[26]  Rada Mihalcea,et al.  Unsupervised Large-Vocabulary Word Sense Disambiguation with Graph-based Algorithms for Sequence Data Labeling , 2005, HLT.

[27]  Mirella Lapata,et al.  An Experimental Study of Graph Connectivity for Unsupervised Word Sense Disambiguation , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Michael E. Lesk,et al.  Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone , 1986, SIGDOC '86.

[29]  Roberto Navigli,et al.  Word sense disambiguation: A survey , 2009, CSUR.

[30]  Ted Pedersen,et al.  Distinguishing Word Senses in Untagged Text , 1997, EMNLP.

[31]  Eneko Agirre,et al.  Word Sense Disambiguation: Algorithms and Applications , 2007 .