A Survey on State-of-the-art Techniques for Knowledge Graphs Construction and Challenges ahead

Global datasphere is increasing fast, and it is expected to reach 175 Zettabytes by 20251. However, most of the content is unstructured and is not understandable by machines. Structuring this data into a knowledge graph enables multitudes of intelligent applications such as deep question answering, recommendation systems, semantic search, etc. The knowledge graph is an emerging technology that allows logical reasoning and uncovers new insights using content along with the context. Thereby, it provides necessary syntax and reasoning semantics that enable machines to solve complex healthcare, security, financial institutions, economics, and business problems. As an outcome, enterprises are putting their effort into constructing and maintaining knowledge graphs to support various downstream applications. Manual approaches are too expensive. Automated schemes can reduce the cost of building knowledge graphs up to 15-250 times. This paper critiques stateof-the-art automated techniques to produce knowledge graphs of near-human quality autonomously. Additionally, it highlights different research issues that need to be addressed to deliver high-quality knowledge graphs. Keywords—knowledge graph, knowledge graph refinements

[1]  Qi Liu,et al.  An Automatic and Rapid Knowledge Graph Construction Method of SG-CIM Model , 2020, 2020 IEEE International Conference on Smart Cloud (SmartCloud).

[2]  Heiner Stuckenschmidt,et al.  Marrying Uncertainty and Time in Knowledge Graphs , 2017, AAAI.

[3]  Christopher Ré,et al.  Tuffy: Scaling up Statistical Inference in Markov Logic Networks using an RDBMS , 2011, Proc. VLDB Endow..

[4]  Jinta Weng,et al.  Construction and Application of Teaching System Based on Crowdsourcing Knowledge Graph , 2019, CCKS.

[5]  Luc De Raedt,et al.  Inductive Logic Programming: Theory and Methods , 1994, J. Log. Program..

[6]  James Cheng,et al.  A Representation Learning Framework for Property Graphs , 2019, KDD.

[7]  Fabian M. Suchanek,et al.  AMIE: association rule mining under incomplete evidence in ontological knowledge bases , 2013, WWW.

[8]  Richong Zhang,et al.  Knowledge graphs completion via probabilistic reasoning , 2020, Inf. Sci..

[9]  Wenhan Xiong,et al.  DeepPath: A Reinforcement Learning Method for Knowledge Graph Reasoning , 2017, EMNLP.

[10]  Douglas B. Lenat,et al.  CYC: a large-scale investment in knowledge infrastructure , 1995, CACM.

[11]  Wei Zhang,et al.  Knowledge vault: a web-scale approach to probabilistic knowledge fusion , 2014, KDD.

[12]  Daisy Zhe Wang,et al.  ScaLeKB: scalable learning and inference over large knowledge bases , 2016, The VLDB Journal.

[13]  Lise Getoor,et al.  Knowledge Graph Identification , 2013, SEMWEB.

[14]  Xiao Lin Rel 4 KC : A Reinforcement Learning Agent for Knowledge Graph Completion and Validation , 2019 .

[15]  Andriy Nikolov,et al.  metaphactory: A platform for knowledge graph management , 2019, Semantic Web.

[16]  Catherine Havasi,et al.  ConceptNet 5.5: An Open Multilingual Graph of General Knowledge , 2016, AAAI.

[17]  Kai Zheng,et al.  Open-world knowledge graph completion with multiple interaction attention , 2020, World Wide Web.

[18]  Heiko Paulheim,et al.  Knowledge graph refinement: A survey of approaches and evaluation methods , 2016, Semantic Web.

[19]  Dieter Fensel,et al.  Knowledge Graph Validation , 2020, ArXiv.

[20]  Hari Mohan Pandey,et al.  DAPath: Distance-aware knowledge graph reasoning based on deep reinforcement learning , 2020, Neural Networks.

[21]  Yan Jia,et al.  Multi-source Knowledge Fusion: A Survey , 2019, 2019 IEEE Fourth International Conference on Data Science in Cyberspace (DSC).

[22]  Evgeniy Gabrilovich,et al.  A Review of Relational Machine Learning for Knowledge Graphs , 2015, Proceedings of the IEEE.

[23]  Praveen Paritosh,et al.  Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.

[24]  Lise Getoor,et al.  A short introduction to probabilistic soft logic , 2012, NIPS 2012.

[25]  Steffen Staab,et al.  Knowledge graphs , 2021, Commun. ACM.

[26]  Max Welling,et al.  Modeling Relational Data with Graph Convolutional Networks , 2017, ESWC.

[27]  Majigsuren Enkhsaikhan,et al.  ICDM 2019 Knowledge Graph Contest: Team UWA , 2019, 2019 IEEE International Conference on Data Mining (ICDM).

[28]  Xiaojun Chen,et al.  A review: Knowledge reasoning over knowledge graph , 2020, Expert Syst. Appl..

[29]  Gregory F. Cooper,et al.  A Bayesian method for the induction of probabilistic networks from data , 1992, Machine Learning.

[30]  Jason Weston,et al.  Translating Embeddings for Modeling Multi-relational Data , 2013, NIPS.

[31]  Young-Tack Park,et al.  Path-based reasoning approach for knowledge graph completion using CNN-BiLSTM with attention mechanism , 2020, Expert Syst. Appl..

[32]  Paolo Papotti,et al.  Mining Expressive Rules in Knowledge Graphs , 2020, ACM J. Data Inf. Qual..

[33]  Tim Weninger,et al.  Open-World Knowledge Graph Completion , 2017, AAAI.

[34]  Ryutaro Ichise,et al.  T2KG: An End-to-End System for Creating Knowledge Graph from Unstructured Text , 2017, AAAI Workshops.

[35]  Divesh Srivastava,et al.  Knowledge Curation and Knowledge Fusion: Challenges, Models and Applications , 2015, SIGMOD Conference.

[36]  Zhen Wang,et al.  Knowledge Graph Embedding by Translating on Hyperplanes , 2014, AAAI.

[37]  Ryutaro Ichise,et al.  An Automatic Knowledge Graph Creation Framework from Natural Language Text , 2018, IEICE Trans. Inf. Syst..

[38]  Luigi Bellomarini,et al.  Vadalog: A modern architecture for automated reasoning with large knowledge graphs , 2020, Inf. Syst..

[39]  Miao Fan,et al.  Transition-based Knowledge Graph Embedding with Relational Mapping Properties , 2014, PACLIC.

[40]  Matthew Richardson,et al.  Markov logic networks , 2006, Machine Learning.

[41]  Simon Razniewski,et al.  Machine Knowledge: Creation and Curation of Comprehensive Knowledge Bases , 2020, Found. Trends Databases.

[42]  P. Hitzler Semantic Web: A Review Of The Field , 2020 .

[43]  Markus Krötzsch,et al.  Wikidata , 2014, Commun. ACM.

[44]  Jeff Z. Pan,et al.  Schema aware iterative Knowledge Graph completion , 2020, J. Web Semant..

[45]  Simon Razniewski,et al.  Structured Knowledge: Have we made progress? An extrinsic study of KB coverage over 19 years , 2020, International Conference on Information and Knowledge Management.

[46]  Jie Cao,et al.  GRL: Knowledge graph completion with GAN-based reinforcement learning , 2020, Knowl. Based Syst..

[47]  Christopher Ré,et al.  DeepDive: Web-scale Knowledge-base Construction using Statistical Learning and Inference , 2012, VLDS.

[48]  Heiner Stuckenschmidt,et al.  Time-Aware Probabilistic Knowledge Graphs , 2019, TIME.

[49]  Vítor Santos Costa,et al.  Inductive Logic Programming , 2013, Lecture Notes in Computer Science.

[50]  Ting Deng,et al.  LENA: Locality-Expanded Neural Embedding for Knowledge Base Completion , 2019, AAAI.

[51]  Wei Zhang,et al.  From Data Fusion to Knowledge Fusion , 2014, Proc. VLDB Endow..

[52]  Xindong Wu,et al.  Automatic Knowledge Graph Construction: A Report on the 2019 ICDM/ICBK Contest , 2019, 2019 IEEE International Conference on Data Mining (ICDM).

[53]  Lise Getoor,et al.  Using Semantics and Statistics to Turn Data into Knowledge , 2015, AI Mag..

[54]  Natasha Noy,et al.  Industry-scale Knowledge Graphs: Lessons and Challenges , 2019, ACM Queue.

[55]  Elena Console,et al.  Data Fusion , 2009, Encyclopedia of Database Systems.

[56]  Shuai Wang,et al.  Building Domain-Specific Knowledge Graph for Unmanned Combat Vehicle Decision Making under Uncertainty , 2019, 2019 Chinese Automation Congress (CAC).

[57]  Gholamreza Haffari,et al.  Reasoning Like Human: Hierarchical Reinforcement Learning for Knowledge Graph Reasoning , 2020, IJCAI.

[58]  Rahul Gupta,et al.  Knowledge base completion via search-based question answering , 2014, WWW.

[59]  Ruben Verborgh,et al.  Rule-driven inconsistency resolution for knowledge graph generation rules , 2019, Semantic Web.

[60]  Zhiyuan Liu,et al.  Learning Entity and Relation Embeddings for Knowledge Graph Completion , 2015, AAAI.

[61]  Tatiana Levashova,et al.  Knowledge fusion patterns: A survey , 2019, Inf. Fusion.