论文信息 - Structured Generative Models of Continuous Features for Word Sense Induction

Structured Generative Models of Continuous Features for Word Sense Induction

We propose a structured generative latent variable model that integrates information from multiple contextual representations for Word Sense Induction. Our approach jointly models global lexical, local lexical and dependency syntactic context. Each context type is associated with a latent variable and the three types of variables share a hierarchical structure. We use skip-gram based word and dependency context embeddings to construct all three types of representations, reducing the total number of parameters to be estimated and enabling better generalization. We describe an EM algorithm to efficiently estimate model parameters and use the Integrated Complete Likelihood criterion to automatically estimate the number of senses. Our model achieves state-of-the-art results on the SemEval-2010 and SemEval-2013 Word Sense Induction datasets.

Suresh Manandhar | Alexandros Komninos

[1] Timothy Baldwin,et al. unimelb: Topic Modelling-based Word Sense Induction for Web Snippet Clustering , 2013, SemEval@NAACL-HLT.

[2] Joakim Nivre,et al. Universal Stanford dependencies: A cross-linguistic typology , 2014, LREC.

[3] Suresh Manandhar,et al. Evaluating Word Sense Induction and Disambiguation Methods , 2013, Lang. Resour. Evaluation.

[4] Christian Hennig,et al. Methods for merging Gaussian mixture components , 2010, Adv. Data Anal. Classif..

[5] Mirella Lapata,et al. Bayesian Word Sense Induction , 2009, EACL.

[6] Jing Wang,et al. A Sense-Topic Model for Word Sense Induction with Unsupervised Data Enrichment , 2015, TACL.

[7] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[8] P. Deb. Finite Mixture Models , 2008 .

[9] Ignacio Iacobacci,et al. SensEmbed: Learning Sense Embeddings for Word and Relational Similarity , 2015, ACL.

[10] Omer Levy,et al. Dependency-Based Word Embeddings , 2014, ACL.

[11] Timothy Baldwin,et al. Word Sense Induction for Novel Sense Detection , 2012, EACL.