A Decade of Scholarly Research on Open Knowledge Graphs

The proliferation of open knowledge graphs has led to a surge in scholarly research on the topic over the past decade. This paper presents a bibliometric analysis of the scholarly literature on open knowledge graphs published between 2013 and 2023. The study aims to identify the trends, patterns, and impact of research in this field, as well as the key topics and research questions that have emerged. The work uses bibliometric techniques to analyze a sample of 4445 scholarly articles retrieved from Scopus. The findings reveal an ever-increasing number of publications on open knowledge graphs published every year, particularly in developed countries (+50 per year). These outputs are published in highly-referred scholarly journals and conferences. The study identifies three main research themes: (1) knowledge graph construction and enrichment, (2) evaluation and reuse, and (3) fusion of knowledge graphs into NLP systems. Within these themes, the study identifies specific tasks that have received considerable attention, including entity linking, knowledge graph embedding, and graph neural networks.

[1]  Lei Zou,et al.  Knowledge Graph Quality Management: A Comprehensive Survey , 2023, IEEE Transactions on Knowledge and Data Engineering.

[2]  Taehyun Ha An explainable artificial-intelligence-based approach to investigating factors that influence the citation of papers , 2022, Technological Forecasting and Social Change.

[3]  E. Simperl,et al.  A Decade of Knowledge Graphs in Natural Language Processing: A Survey , 2022, AACL.

[4]  D. Mietchen,et al.  Using logical constraints to validate statistical information about disease outbreaks in collaborative knowledge graphs: the case of COVID-19 epidemiology in Wikidata , 2022, PeerJ Comput. Sci..

[5]  Shilpa Verma,et al.  Scholarly knowledge graphs through structuring scholarly communication: a review , 2022, Complex & Intelligent Systems.

[6]  Stefan Schlobach,et al.  Knowledge graphs as tools for explainable machine learning: A survey , 2021, Artif. Intell..

[7]  Michele Bevilacqua,et al.  Ten Years of BabelNet: A Survey , 2021, IJCAI.

[8]  Haoran Xie,et al.  Topic analysis and development in knowledge graph research: A bibliometric review on three decades , 2021, Neurocomputing.

[9]  Erik Cambria,et al.  Knowledge graph representation and reasoning , 2021, Neurocomputing.

[10]  Shirui Pan,et al.  Graph Learning: A Survey , 2021, IEEE Transactions on Artificial Intelligence.

[11]  P. Lambin,et al.  Knowledge Graphs for COVID-19: An Exploratory Review of the Current Landscape , 2021, Journal of personalized medicine.

[12]  Christoph Lange,et al.  A comprehensive quality assessment framework for scientific events , 2020, Scientometrics.

[13]  Rolf Schwitter,et al.  A survey on automatically constructed universal knowledge bases , 2020, J. Inf. Sci..

[14]  J. Homolak,et al.  Preliminary analysis of COVID-19 academic information patterns: a call for open science in the times of closed borders , 2020, Scientometrics.

[15]  Jeff Z. Pan,et al.  Knowledge-Driven Intelligent Survey Systems Towards Open Science , 2020, New Generation Computing.

[16]  Steffen Staab,et al.  Knowledge graphs , 2021, Commun. ACM.

[17]  Grégoire Côté,et al.  Scopus as a curated, high-quality bibliometric data source for academic research in quantitative science studies , 2020, Quantitative Science Studies.

[18]  Magnus Nyström,et al.  Adversarial Machine Learning-Industry Perspectives , 2020, 2020 IEEE Security and Privacy Workshops (SPW).

[19]  Houcemeddine Turki,et al.  Wikidata: A large-scale collaborative ontological medical database , 2019, J. Biomed. Informatics.

[20]  Yannis Tzitzikas,et al.  Large-scale Semantic Integration of Linked Data , 2019, ACM Comput. Surv..

[21]  Marçal Mora Cantallops,et al.  A systematic literature review on Wikidata , 2019, Data Technol. Appl..

[22]  Finn Årup Nielsen,et al.  Ordia: A Web Application for Wikidata Lexemes , 2019, ESWC.

[23]  Fabien L. Gandon,et al.  A survey of the first 20 years of research on semantic Web and linked data , 2018, Ingénierie des Systèmes d Inf..

[24]  M. Kotsemir,et al.  Research landscape of the BRICS countries: current trends in research output, thematic structures of publications, and the relative influence of partners , 2018, Scientometrics.

[25]  Achim Rettinger,et al.  Linked data quality of DBpedia, Freebase, OpenCyc, Wikidata, and YAGO , 2017, Semantic Web.

[26]  Ronald M. Summers,et al.  ChestX-ray: Hospital-Scale Chest X-ray Database and Benchmarks on Weakly Supervised Classification and Localization of Common Thorax Diseases , 2019, Deep Learning and Convolutional Neural Networks for Medical Imaging and Clinical Informatics.

[27]  Heiko Paulheim,et al.  Knowledge graph refinement: A survey of approaches and evaluation methods , 2016, Semantic Web.

[28]  Gerhard Weikum,et al.  YAGO: A Multilingual Knowledge Base from Wikipedia, Wordnet, and Geonames , 2016, SEMWEB.

[29]  Kristian Fog Nielsen,et al.  Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking , 2016, Nature Biotechnology.

[30]  Mark Sanderson,et al.  Conferences versus journals in computer science , 2015, J. Assoc. Inf. Sci. Technol..

[31]  Markus Krötzsch,et al.  Wikidata , 2014, Commun. ACM.

[32]  Henning Hermjakob,et al.  The Reactome pathway knowledgebase , 2013, Nucleic Acids Res..

[33]  Joanna L. Sharman,et al.  The IUPHAR/BPS Guide to PHARMACOLOGY: an expert-driven knowledgebase of drug targets and their ligands , 2013, Nucleic Acids Res..

[34]  R. Rousseau,et al.  Theory and practice of the shifted Lotka function , 2012, Scientometrics.

[35]  M. Deakin,et al.  The Triple-Helix Model of Smart Cities: A Neo-Evolutionary Perspective , 2011 .

[36]  Ed C. M. Noyons,et al.  A unified approach to mapping and clustering of bibliometric networks , 2010, J. Informetrics.

[37]  C.-H. Liu,et al.  Ontology-Based Context Representation and Reasoning Using OWL and SWRL , 2010, 2010 8th Annual Communication Networks and Services Research Conference.

[38]  Ludo Waltman,et al.  Software survey: VOSviewer, a computer program for bibliometric mapping , 2009, Scientometrics.

[39]  Claudio Gutiérrez,et al.  The Expressive Power of SPARQL , 2008, SEMWEB.

[40]  Yuh-Shan Ho,et al.  Use of citation per publication as an indicator to evaluate pentachlorophenol research , 2008, Scientometrics.

[41]  Yuh-Shan Ho,et al.  Use of citation per publication as an indicator to evaluate contingent valuation research , 2008, Scientometrics.

[42]  J. Burnham Scopus database: a review , 2006, Biomedical digital libraries.

[43]  Jürgen Umbrich,et al.  Introduction: What Is a Knowledge Graph? , 2020 .

[44]  Heiko Paulheim,et al.  Semantic Web in data mining and knowledge discovery: A comprehensive survey , 2016, J. Web Semant..

[45]  Jens Lehmann,et al.  DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia , 2015, Semantic Web.