论文信息 - Neural Vector Conceptualization for Word Vector Space Interpretation

Neural Vector Conceptualization for Word Vector Space Interpretation

Distributed word vector spaces are considered hard to interpret which hinders the understanding of natural language processing (NLP) models. In this work, we introduce a new method to interpret arbitrary samples from a word vector space. To this end, we train a neural model to conceptualize word vectors, which means that it activates higher order concepts it recognizes in a given vector. Contrary to prior approaches, our model operates in the original vector space and is capable of learning non-linear relations between word vectors and concepts. Furthermore, we show that it produces considerably less entropic concept activation profiles than the popular cosine similarity.

Robert Schwarzenberg | David Harbecke | Lisa Raithel

[1] Ji-Rong Wen,et al. An Inference Approach to Basic Level of Categorization , 2015, CIKM.

[2] Abdelmonaime Lachkar,et al. An effective short text conceptualization based on new short text similarity , 2018, Social Network Analysis and Mining.

[3] Antske Fokkens,et al. Firearms and Tigers are Dangerous, Kitchen Knives and Zebras are Not: Testing whether Word Embeddings Can Tell , 2018, BlackboxNLP@EMNLP.

[4] Dan Roth,et al. Unsupervised Sparse Vector Densification for Short Text Similarity , 2015, NAACL.

[5] Sebastian Ruder,et al. Universal Language Model Fine-tuning for Text Classification , 2018, ACL.

[6] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[7] Wayne D. Gray,et al. Basic objects in natural categories , 1976, Cognitive Psychology.

[8] Haldun M. Ozaktas,et al. Imparting interpretability to word embeddings while preserving semantic structure , 2018, Natural Language Engineering.

[9] Andrew Zisserman,et al. Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps , 2013, ICLR.

[10] Luke S. Zettlemoyer,et al. Deep Contextualized Word Representations , 2018, NAACL.

[11] Wojciech Samek,et al. Methods for interpreting and understanding deep neural networks , 2017, Digit. Signal Process..