Do Embeddings Actually Capture Knowledge Graph Semantics?

Knowledge graph embeddings that generate vector space representations of knowledge graph triples, have gained considerable popularity in past years. Several embedding models have been proposed that achieve state-of-the-art performance for the task of triple completion in knowledge graphs. Relying on the presumed semantic capabilities of the learned embeddings, they have been leveraged for various other tasks such as entity typing, rule mining and conceptual clustering. However, a critical analysis of the utility as well as limitations of these embeddings for semantic representation of the underlying entities and relations has not been performed by previous work. In this paper, we performed a systematic evaluation of popular knowledge graph embedding models to obtain a better understanding of their semantic capabilities as compared to a non-embedding based approach. Our analysis brings attention to the fact that semantic representation in the knowledge graph embeddings is not universal, but restricted to a small subset of the entities based on dataset characteristics. We provide further insights into the reasons for this behaviour. The results of our experiments indicate that careful analysis of benefits of the embeddings needs to be performed when employing them for semantic tasks.

[1]  Chengkai Li,et al.  A benchmarking study of embedding-based entity alignment for knowledge graphs , 2020, Proc. VLDB Endow..

[2]  Zhendong Mao,et al.  Knowledge Graph Embedding: A Survey of Approaches and Applications , 2017, IEEE Transactions on Knowledge and Data Engineering.

[3]  Jason Weston,et al.  Translating Embeddings for Modeling Multi-relational Data , 2013, NIPS.

[4]  Hans-Peter Kriegel,et al.  Factorizing YAGO: scalable machine learning for linked data , 2012, WWW.

[5]  Pasquale Minervini,et al.  Convolutional 2D Knowledge Graph Embeddings , 2017, AAAI.

[6]  Fabian M. Suchanek,et al.  YAGO3: A Knowledge Base from Multilingual Wikipedias , 2015, CIDR.

[7]  Yizhou Sun,et al.  Universal Representation Learning of Knowledge Bases by Jointly Embedding Instances and Ontological Concepts , 2019, KDD.

[8]  Guillaume Bouchard,et al.  Complex Embeddings for Simple Link Prediction , 2016, ICML.

[9]  Gerard de Melo,et al.  Explainable Link Prediction for Emerging Entities in Knowledge Graphs , 2020, SEMWEB.

[10]  Heiko Paulheim,et al.  Type Inference on Noisy RDF Data , 2013, SEMWEB.

[11]  Hao Zhu,et al.  Quantifying Similarity between Relations with Fact Distribution , 2019, ACL.

[12]  Danqi Chen,et al.  Observed versus latent features for knowledge base and text inference , 2015, CVSC.

[13]  Xiaojie Wang,et al.  Connecting Embeddings for Knowledge Graph Entity Typing , 2020, ACL.

[14]  Wolf-Tilo Balke,et al.  Knowledge Graph Consolidation by Unifying Synonymous Relationships , 2019, SEMWEB.

[15]  Rainer Gemulla,et al.  LibKGE - A knowledge graph embedding library for reproducible research , 2020, EMNLP.

[16]  Nagiza F. Samatova,et al.  Learning Entity Type Embeddings for Knowledge Graph Completion , 2017, CIKM.

[17]  Andrea Rossi,et al.  Knowledge Graph Embeddings: Are Relation-Learning Models Learning Relations? , 2020, EDBT/ICDT Workshops.

[18]  Pankaj Kumar,et al.  Type-Sensitive Knowledge Base Inference Without Explicit Type Supervision , 2018, ACL.

[19]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[20]  Evgeny Kharlamov,et al.  Rule Learning from Knowledge Graphs Guided by Embedding Models , 2018, SEMWEB.

[21]  Hans-Peter Kriegel,et al.  A Three-Way Model for Collective Learning on Multi-Relational Data , 2011, ICML.

[22]  Chengkai Li,et al.  Realistic Re-evaluation of Knowledge Graph Completion Methods: An Experimental Study , 2020, SIGMOD Conference.

[23]  Rainer Gemulla,et al.  You CAN Teach an Old Dog New Tricks! On Training Knowledge Graph Embeddings , 2020, ICLR.

[24]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[25]  Daria Stepanova,et al.  Towards Utilizing Knowledge Graph Embedding Models for Conceptual Clustering , 2020, SEMWEB.

[26]  Heike Adel,et al.  ExCut: Explainable Embedding-Based Clustering over Knowledge Graphs , 2020, SEMWEB.

[27]  Heiko Paulheim,et al.  RDF2Vec: RDF Graph Embeddings for Data Mining , 2016, SEMWEB.

[28]  Philippe Cudré-Mauroux,et al.  Are Meta-Paths Necessary?: Revisiting Heterogeneous Graph Embeddings , 2018, CIKM.

[29]  Jianfeng Gao,et al.  Embedding Entities and Relations for Learning and Inference in Knowledge Bases , 2014, ICLR.

[30]  Ralf Krestel,et al.  Learning Fine-Grained Semantics for Multi-Relational Data , 2020, SEMWEB.

[31]  Praveen Paritosh,et al.  Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.

[32]  Jing Jiang,et al.  Attributed Graph Clustering: A Deep Attentional Embedding Approach , 2019, IJCAI.

[33]  Wei Zhang,et al.  Iteratively Learning Embeddings and Rules for Knowledge Graph Reasoning , 2019, WWW.