论文信息 - Embedding a Semantic Network in a Word Space

Embedding a Semantic Network in a Word Space

We present a framework for using continuousspace vector representations of word meaning to derive new vectors representing the meaning of senses listed in a semantic network. It is a post-processing approach that can be applied to several types of word vector representations. It uses two ideas: first, that vectors for polysemous words can be decomposed into a convex combination of sense vectors; secondly, that the vector for a sense is kept similar to those of its neighbors in the network. This leads to a constrained optimization problem, and we present an approximation for the case when the distance function is the squared Euclidean. We applied this algorithm on a Swedish semantic network, and we evaluate the quality of the resulting sense representations extrinsically by showing that they give large improvements when used in a classifier that creates lexical units for FrameNet frames.

Richard Johansson | Luis Nieto Piña | Richard Johansson

[1] Chih-Jen Lin,et al. LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[2] Christiane Fellbaum,et al. Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[3] Andrew Y. Ng,et al. Improving Word Representations via Global Context and Multiple Word Prototypes , 2012, ACL.

[4] Geoffrey Zweig,et al. Linguistic Regularities in Continuous Space Word Representations , 2013, NAACL.

[5] Richard Johansson,et al. Combining Relational and Distributional Knowledge for Word Sense Disambiguation , 2015, NODALIDA.

[6] Collin F. Baker,et al. A Frames Approach to Semantic Analysis , 2009 .

[7] Patrick Pantel,et al. From Frequency to Meaning: Vector Space Models of Semantics , 2010, J. Artif. Intell. Res..

[8] Caroline Sporleder,et al. Evaluating FrameNet-style semantic parsing: the role of coverage gaps in FrameNet , 2010, COLING.

[9] Hinrich Schütze,et al. Automatic Word Sense Discrimination , 1998, Comput. Linguistics.

[10] Georgiana Dinu,et al. Don’t count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors , 2014, ACL.

[11] Julie Weeds,et al. Unsupervised Acquisition of Predominant Word Senses , 2007, CL.

[12] Maria Toporowska Gronostaj,et al. The Rocky Road towards a Swedish FrameNet - Creating SweFN , 2012, LREC.

[13] Andrew McCallum,et al. Efficient Non-parametric Estimation of Multiple Embeddings per Word in Vector Space , 2014, EMNLP.

[14] Omer Levy,et al. Linguistic Regularities in Sparse and Explicit Word Representations , 2014, CoNLL.

[15] Markus Forsberg,et al. SALDO: a touch of yin to WordNet’s yang , 2013, Lang. Resour. Evaluation.