论文信息 - Combining Relational and Distributional Knowledge for Word Sense Disambiguation

Combining Relational and Distributional Knowledge for Word Sense Disambiguation

We present a new approach to word sense disambiguation derived from recent ideas in distributional semantics. The input to the algorithm is a large unlabeled corpus and a graph describing how senses are related; no sense-annotated corpus is needed. The fundamental idea is to embed meaning representations of senses in the same continuous-valued vector space as the representations of words. In this way, the knowledge encoded in the lexical resource is combined with the information derived by the distributional methods. Once this step has been carried out, the sense representations can be plugged back into e.g. the skip-gram model, which allows us to compute scores for the different possible senses of a word in a given context. We evaluated the new word sense disambiguation system on two Swedish test sets annotated with senses defined by the SALDO lexical resource. In both evaluations, our system soundly outperformed random and first-sense baselines. Its accuracy was slightly above that of a wellknown graph-based system, while being computationally much more efficient.

Richard Johansson | Luis Nieto Piña | Richard Johansson

[1] Patrick Pantel,et al. From Frequency to Meaning: Vector Space Models of Semantics , 2010, J. Artif. Intell. Res..

[2] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[3] Richard Johansson,et al. Embedding a Semantic Network in a Word Space , 2015, NAACL.

[4] Richard Johansson,et al. Neural context embeddings for automatic discovery of word senses , 2015, VS@HLT-NAACL.

[5] Collin F. Baker,et al. A Frames Approach to Semantic Analysis , 2009 .

[6] Björn Gambäck,et al. Towards Dynamic Word Sense Discrimination with Random Indexing , 2013, CVSM@ACL.

[7] Richard Johansson,et al. Defining the Eukalyptus forest – the Koala treebank of Swedish , 2015, NODALIDA.

[8] Christiane Fellbaum,et al. Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[9] Geoffrey Zweig,et al. Linguistic Regularities in Continuous Space Word Representations , 2013, NAACL.

[10] Maria Toporowska Gronostaj,et al. The Rocky Road towards a Swedish FrameNet - Creating SweFN , 2012, LREC.

[11] Mirella Lapata,et al. Dependency-Based Construction of Semantic Space Models , 2007, CL.