论文信息 - Retrieving Multi-Entity Associations: An Evaluation of Combination Modes for Word Embeddings

Retrieving Multi-Entity Associations: An Evaluation of Combination Modes for Word Embeddings

Word embeddings have gained significant attention as learnable representations of semantic relations between words, and have been shown to improve upon the results of traditional word representations. However, little effort has been devoted to using embeddings for the retrieval of entity associations beyond pairwise relations. In this paper, we use popular embedding methods to train vector representations of an entity-annotated news corpus, and evaluate their performance for the task of predicting entity participation in news events versus a traditional word cooccurrence network as a baseline. To support queries for events with multiple participating entities, we test a number of combination modes for the embedding vectors. While we find that even the best combination modes for word embeddings do not quite reach the performance of the full cooccurrence network, especially for rare entities, we observe that different embedding methods model different types of relations, thereby indicating the potential for ensemble methods.

Andreas Spitz | Michael Gertz | Gloria Feher

[1] Mandar Mitra,et al. Word Embedding based Generalized Language Model for Information Retrieval , 2015, SIGIR.

[2] Craig MacDonald,et al. Enhancing Sensitivity Classification with Semantic Features Using Word Embeddings , 2017, ECIR.

[3] Bhaskar Mitra,et al. Improving Document Ranking with Dual Word Embeddings , 2016, WWW.

[4] Andreas Spitz,et al. Entity-Centric Topic Extraction and Exploration: A Network-Based Approach , 2018, ECIR.

[5] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[6] Thorsten Joachims,et al. Evaluation methods for unsupervised word embeddings , 2015, EMNLP.

[7] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[8] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[9] Danqi Chen,et al. Reasoning With Neural Tensor Networks for Knowledge Base Completion , 2013, NIPS.

[10] J. Elman. Distributed representations, simple recurrent networks, and grammatical structure , 1991, Machine Learning.

[11] Andreas Spitz,et al. Exploring Entity-centric Networks in Entangled News Streams , 2018, WWW.