Validation of SHACL Constraints over KGs with OWL 2 QL Ontologies via Rewriting

Constraints have traditionally been used to ensure data quality. Recently, several constraint languages such as SHACL, as well as mechanisms for constraint validation, have been proposed for Knowledge Graphs (KGs). KGs are often enhanced with ontologies that define relevant background knowledge in a formal language such as OWL 2 QL. However, existing systems for constraint validation either ignore these ontologies, or compile ontologies and constraints into rules that should be executed by some rule engine. In the latter case, one has to rely on different systems when validating constrains over KGs and over ontology-enhanced KGs. In this work, we address this problem by defining rewriting techniques that allow to compile an OWL 2 QL ontology and a set of SHACL constraints into another set of SHACL constraints. We show that in the general case the rewriting may not exists, but it always exists for the positive fragment of SHACL. Our rewriting techniques allow to validate constraints over KGs with and without ontologies using the same SHACL validation engines.

[1]  Peer Kröger,et al.  On event-driven knowledge graph completion in digital factories , 2017, 2017 IEEE International Conference on Big Data (Big Data).

[2]  Evgeny Kharlamov,et al.  Towards a semantic keyword search over industrial knowledge graphs (extended abstract) , 2017, 2017 IEEE International Conference on Big Data (Big Data).

[3]  Boris Motik,et al.  Bridging the gap between OWL and relational databases , 2009, J. Web Semant..

[4]  Fajar J. Ekaputra,et al.  SHACL4P: SHACL constraints validation within Protégé ontology editor , 2016, 2016 International Conference on Data and Software Engineering (ICoDSE).

[5]  Dan Suciu,et al.  Data on the Web: From Relations to Semistructured Data and XML , 1999 .

[6]  Diego Calvanese,et al.  Introducing Datatypes in DL-Lite , 2012, ECAI.

[7]  Carsten Lutz,et al.  Efficient Query Rewriting in the Description Logic EL and Beyond , 2015, IJCAI.

[8]  Diego Calvanese,et al.  Capturing model-based ontology evolution at the instance level: The case of DL-Lite , 2013, J. Comput. Syst. Sci..

[9]  Michaël Thomazo,et al.  Sound, complete and minimal UCQ-rewriting for existential rules , 2013, Semantic Web.

[10]  Marcelo Arenas,et al.  Foundations of RDF Databases , 2008, Reasoning Web.

[11]  Ian Horrocks,et al.  Towards Simplification of Analytical Workflows With Semantics at Siemens (Extended Abstract) , 2018, 2018 IEEE International Conference on Big Data (Big Data).

[12]  Diego Calvanese,et al.  Tractable Reasoning and Efficient Query Answering in Description Logics: The DL-Lite Family , 2007, Journal of Automated Reasoning.

[13]  Georg Gottlob,et al.  Complexity and expressive power of logic programming , 2001, CSUR.

[14]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[15]  Ian Horrocks,et al.  Using Semantic Technology to Tame the Data Variety Challenge , 2016, IEEE Internet Computing.

[16]  Thomas A. Runkler,et al.  SemDia: Semantic Rule-Based Equipment Diagnostics Tool , 2017, CIKM.

[17]  Chao Tian,et al.  Keys for Graphs , 2015, Proc. VLDB Endow..

[18]  Thomas A. Runkler,et al.  Semantic Rule-Based Equipment Diagnostics , 2017, SEMWEB.

[19]  Evgeny Kharlamov,et al.  Ontology Based Data Access in Statoil , 2017, J. Web Semant..

[20]  Ian Horrocks,et al.  Semantic Rules for Machine Diagnostics: Execution and Management , 2017, CIKM.

[21]  Ognjen Savkovic,et al.  Semantics and Validation of Recursive SHACL , 2018, SEMWEB.

[22]  Evgeny Kharlamov,et al.  Semantic access to streaming and static data at Siemens , 2017, J. Web Semant..

[23]  Evgeny Kharlamov,et al.  Faceted search over RDF-based knowledge graphs , 2016, J. Web Semant..

[24]  Ian Horrocks,et al.  Finding Data Should be Easier than Finding Oil , 2018, 2018 IEEE International Conference on Big Data (Big Data).

[25]  Yavor Nenov,et al.  SOMM: Industry Oriented Ontology Management Tool , 2016, International Semantic Web Conference.

[26]  Yinghui Wu,et al.  Functional Dependencies for Graphs , 2016, SIGMOD Conference.

[27]  Ian Horrocks,et al.  Semantic Diagnostics of Smart Factories , 2018, JIST.

[28]  Thomas A. Runkler,et al.  Diagnostics of Trains with Semantic Diagnostics Rules , 2018, ILP.

[29]  Evgeny Kharlamov,et al.  Towards Semantically Enhanced Digital Twins , 2018, 2018 IEEE International Conference on Big Data (Big Data).

[30]  Ian Horrocks,et al.  OptiqueVQS: A visual query system over ontologies for industry , 2018, Semantic Web.

[31]  Diego Calvanese,et al.  Evolution of DL-Lite Knowledge Bases , 2010, SEMWEB.

[32]  Gerhard Weikum,et al.  WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[33]  Yavor Nenov,et al.  Capturing Industrial Information Models with Ontologies and Constraints , 2016, SEMWEB.

[34]  Jiao Tao,et al.  Integrity Constraints in OWL , 2010, AAAI.

[35]  José Emilio Labra Gayo,et al.  Semantics and Validation of Shapes Schemas for RDF , 2014, SEMWEB.

[36]  Evgeny Kharlamov,et al.  Capturing Instance Level Ontology Evolution for DL-Lite , 2011, SEMWEB.

[37]  Peter F. Patel-Schneider,et al.  Using Description Logics for RDF Constraint Checking and Closed-World Recognition , 2014, AAAI.