KGdiff: Tracking the Evolution of Knowledge Graphs

A Knowledge Graph (KG) is a machine-readable, labeled graph-like representation of human knowledge. As the main goal of KG is to represent data by enriching it with computer-processable semantics, the knowledge graph creation usually involves acquiring data from external resources and datasets. In many domains, especially in biomedicine, the data sources continuously evolve, and KG engineers and domain experts must not only track the changes in KG entities and their interconnections but introduce changes to the KG schema and the graph population software. We present a framework to track the KG evolution both in terms of the schema and individuals. KGdiff is a software tool that incrementally collects the relevant meta-data information from a KG and compares it to a prior version the KG. The KG is represented in OWL/RDF/RDFS and the meta-data is collected using domain-independent queries. We evaluate our method on different RDF/OWL data sets (ontologies).

[1]  Henning Hermjakob,et al.  The Reactome pathway Knowledgebase , 2015, Nucleic acids research.

[2]  Yannis Stavrakas,et al.  A Flexible Framework for Understanding the Dynamics of Evolving RDF Datasets , 2015, International Semantic Web Conference.

[3]  Bijan Parsia,et al.  Categorising logical differences between OWL ontologies , 2011, CIKM '11.

[4]  Krys J. Kochut,et al.  ProKinO: An Ontology for Integrative Analysis of Protein Kinases in Cancer , 2011, PloS one.

[5]  Vassilis Christophides,et al.  On the Foundations of Computing Deltas Between RDF Models , 2007, ISWC/ASWC.

[6]  Henning Hermjakob,et al.  The Reactome pathway knowledgebase , 2013, Nucleic Acids Res..

[7]  Sören Auer,et al.  A Versioning and Evolution Framework for RDF Knowledge Bases , 2006, Ershov Memorial Conference.

[8]  M. Stratton,et al.  The COSMIC (Catalogue of Somatic Mutations in Cancer) database and website , 2004, British Journal of Cancer.

[9]  Susan S. Taylor,et al.  ProKinO: A Unified Resource for Mining the Cancer Kinome , 2014, Human mutation.

[10]  Ngoc Thanh Nguyen,et al.  Framework for ontology evolution based on a multi-attribute alignment method , 2015, 2015 IEEE 2nd International Conference on Cybernetics (CYBCONF).

[11]  Johann Eder,et al.  Detecting Changes in Ontologies via DAG Comparison , 2006, EMOI-INTEROP.

[12]  Yongtang Shi,et al.  Fifty years of graph matching, network alignment and network comparison , 2016, Inf. Sci..

[13]  Nora Taleb,et al.  A method based on OWL schema for detecting changes between Ontology's versions , 2014, Intell. Decis. Technol..

[14]  Anne Laurent,et al.  Representing history in graph-oriented NoSQL databases: A versioning system , 2013, Eighth International Conference on Digital Information Management (ICDIM 2013).

[15]  Krys J. Kochut,et al.  Tracing the origin and evolution of pseudokinases across the tree of life , 2019, Science Signaling.

[16]  The UniProt Consortium,et al.  UniProt: a worldwide hub of protein knowledge , 2018, Nucleic Acids Res..

[17]  Anna Zhukova,et al.  Modeling sample variables with an Experimental Factor Ontology , 2010, Bioinform..

[18]  Erhard Rahm,et al.  COnto-Diff: generation of complex evolution mappings for life science ontologies , 2013, J. Biomed. Informatics.

[19]  Patrick K. Nicholson,et al.  Automated assessment of knowledge hierarchy evolution: comparing directed acyclic graphs , 2019, Inf. Retr. J..

[20]  Michel C. A. Klein,et al.  Tracking Changes During Ontology Evolution , 2004, SEMWEB.

[21]  Jessica A. Turner,et al.  The Ontology for Biomedical Investigations , 2016, PloS one.