论文信息 - An empirical study of semantic similarity in WordNet and Word2Vec

An empirical study of semantic similarity in WordNet and Word2Vec

This thesis performs an empirical analysis of Word2Vec by comparing its output to WordNet, a well-known, human-curated lexical database. It finds that Word2Vec tends to uncover more of certain types of semantic relations than others – with Word2Vec returning more hypernyms, synonomyns and hyponyms than hyponyms or holonyms. It also shows the probability that neighbors separated by a given cosine distance in Word2Vec are semantically related in WordNet. This result both adds to our understanding of the stillunknown Word2Vec and helps to benchmark new semantic tools built from word vectors. Word2Vec, Natural Language Processing, WordNet, Distributional Semantics

Abram Handler | Abram Handler

[1] Omer Levy,et al. word2vec Explained: deriving Mikolov et al.'s negative-sampling word-embedding method , 2014, ArXiv.

[2] Philip Resnik,et al. Political Ideology Detection Using Recursive Neural Networks , 2014, ACL.

[3] Ted Briscoe,et al. Looking for Hyponyms in Vector Space , 2014, CoNLL.

[4] Charles Taylor. Theories of meaning , 1980 .

[5] Jason Eisner,et al. Lexical Semantics , 2020, The Handbook of English Linguistics.

[6] Ewan Klein,et al. Natural Language Processing with Python , 2009 .

[7] Dominic Widdows,et al. Geometry and Meaning , 2004, Computational Linguistics.

[8] Petr Sojka,et al. Software Framework for Topic Modelling with Large Corpora , 2010 .

[9] J. Firth. Papers in linguistics , 1958 .

[10] Mark Stevenson,et al. The Reuters Corpus Volume 1 -from Yesterday’s News to Tomorrow’s Language Resources , 2002, LREC.

[11] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.