Density Maximization of Context-to-sense Mapping for Unsupervised Word Sense Disambiguation

This paper proposes a novel unsupervised method employing large amount of unlabeled text corpora for all-words word sense disambiguation (WSD), which requires to discriminate huge variety of senses, thus unsupervised methods are desired to avoid constructing costly sense-labeled corpora. Given unlabeled corpora and a dictionary, the proposed method bases on the coherent correspondences between word contexts and word senses, and finds the all-words’ senses that maximize mapping density in context-to-sense product metric space. Experimental results confirmed the efficacy of our unsupervised method by showing the reliability of disambiguation if sufficient variations of word-types are provided in similar context.

[1]  Philip Resnik,et al.  A Perspective on Word Sense Disambiguation Methods and Their Evaluation , 2002 .

[2]  Ted Pedersen,et al.  UMND1: Unsupervised Word Sense Disambiguation Using Contextual Semantic Relatedness , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[3]  Michael E. Lesk,et al.  Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone , 1986, SIGDOC '86.

[4]  Rada Mihalcea,et al.  Unsupervised Large-Vocabulary Word Sense Disambiguation with Graph-based Algorithms for Sequence Data Labeling , 2005, HLT.

[5]  Fernando Gomez,et al.  UCF-WS: Domain Word Sense Disambiguation Using Web Selectors , 2010, SemEval@ACL.

[6]  Ted Pedersen,et al.  WordNet::Similarity - Measuring the Relatedness of Concepts , 2004, NAACL.

[7]  Eneko Agirre,et al.  Personalizing PageRank for Word Sense Disambiguation , 2009, EACL.

[8]  Pushpak Bhattacharyya,et al.  CFILT: Resource Conscious Approaches for All-Words Domain Specific WSD , 2010, SemEval@ACL.

[9]  E. Parzen On Estimation of a Probability Density Function and Mode , 1962 .

[10]  Roberto Navigli,et al.  Word sense disambiguation: A survey , 2009, CSUR.

[11]  Piek T. J. M. Vossen,et al.  Kyoto: An Integrated System for Specific Domain WSD , 2010, SemEval@ACL.

[12]  Hinrich Schütze,et al.  Automatic Word Sense Discrimination , 1998, Comput. Linguistics.

[13]  Eneko Agirre,et al.  Word Sense Disambiguation: Algorithms and Applications , 2007 .

[14]  Dong-Hong Ji,et al.  Word Sense Disambiguation Using Label Propagation Based Semi-Supervised Learning , 2005, ACL.

[15]  Mirella Lapata,et al.  Graph Connectivity Measures for Unsupervised Word Sense Disambiguation , 2007, IJCAI.

[16]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[17]  David W. Conrath,et al.  Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy , 1997, ROCLING/IJCLCLP.

[18]  Ted Pedersen,et al.  Word Sense Discrimination by Clustering Contexts in Vector and Similarity Spaces , 2004, CoNLL.

[19]  Dekang Lin,et al.  Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[20]  Wei Ding,et al.  TreeMatch: A Fully Unsupervised WSD System Using Dependency Knowledge on a Specific Domain , 2010, SemEval@ACL.

[21]  Ted Briscoe,et al.  The Second Release of the RASP System , 2006, ACL.

[22]  Julie Weeds,et al.  Unsupervised Acquisition of Predominant Word Senses , 2007, CL.

[23]  Siva Reddy,et al.  WSD as a Distributed Constraint Optimization Problem , 2010, ACL.

[24]  Jean Véronis,et al.  HyperLex: lexical cartography for information retrieval , 2004, Comput. Speech Lang..

[25]  Stefan Thater,et al.  Word Meaning in Context: A Simple and Effective Vector Model , 2011, IJCNLP.

[26]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[27]  Taher H. Haveliwala Topic-sensitive PageRank , 2002, IEEE Trans. Knowl. Data Eng..

[28]  Mark Stevenson,et al.  IIITH: Domain Specific Word Sense Disambiguation , 2010, SemEval@ACL.

[29]  Piek T. J. M. Vossen,et al.  SemEval-2010 Task 17: All-Words Word Sense Disambiguation on a Specific Domain , 2009, *SEMEVAL.

[30]  Yoshinori Sagisaka,et al.  Density Maximization in Context-Sense Metric Space for All-words WSD , 2013, ACL.

[31]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.