Astrea: Automatic Generation of SHACL Shapes from Ontologies

Knowledge Graphs (KGs) that publish RDF data modelled using ontologies in a wide range of domains have populated the Web. The SHACL language is a W3C recommendation that has been endowed to encode a set of either value or model data restrictions that aim at validating KG data, ensuring data quality. Developing shapes is a complex and time consuming task that is not feasible to achieve manually. This article presents two resources that aim at generating automatically SHACL shapes for a set of ontologies: (1) Astrea-KG, a KG that publishes a set of mappings that encode the equivalent conceptual restrictions among ontology constraint patterns and SHACL constraint patterns, and (2) Astrea, a tool that automatically generates SHACL shapes from a set of ontologies by executing the mappings from the Astrea-KG. These two resources are openly available at Zenodo, GitHub, and a web application. In contrast to other proposals, these resources cover a large number of SHACL restrictions producing both value and model data restrictions, whereas other proposals consider only a limited number of restrictions or focus only on value or model restrictions.

[1]  Sebastian Hellmann,et al.  Inference of Latent Shape Expressions Associated to DBpedia Ontology , 2018, International Semantic Web Conference.

[2]  Peter F. Patel-Schneider,et al.  OWL 2 Web Ontology Language Primer (Second Edition) , 2012 .

[3]  Aldo Gangemi,et al.  Ontology Design Patterns , 2005 .

[4]  Simon Cox,et al.  Time Ontology in OWL , 2017 .

[5]  Dimitris Kontokostas,et al.  Validating RDF Data , 2017, Validating RDF Data.

[6]  Armin Haller,et al.  Semantic Sensor Network Ontology , 2017 .

[7]  Declan O'Sullivan,et al.  Using Ontology Design Patterns To Define SHACL Shapes , 2018, WOP@ISWC.

[8]  Smartm2m; Smart Appliances; Reference Ontology and Onem2m Mapping , .

[9]  Andrea Maurino,et al.  Towards Improving the Quality of Knowledge Graphs with Data-driven Ontology Patterns and SHACL , 2018, WOP@ISWC.

[10]  Marco Torchiano,et al.  RDF shape induction using knowledge base profiling , 2018, SAC.

[11]  José Emilio Labra Gayo,et al.  Shape Designer for ShEx and SHACL constraints , 2019, ISWC Satellites.

[12]  Christoph Lange,et al.  Evaluating the quality of the LOD cloud: An empirical investigation , 2018, Semantic Web.

[13]  Andrea Maurino,et al.  Topic profiling benchmarks in the linked open data cloud: Issues and lessons learned , 2019, Semantic Web.

[14]  Armin Haller,et al.  The modular SSN ontology: A joint W3C and OGC standard specifying the semantics of sensors, observations, sampling, and actuation , 2018, Semantic Web.

[15]  Daniel Fernández-Álvarez,et al.  Challenges in RDF Validation , 2019, Studies in Computational Intelligence.

[16]  Harold R. Solbrig,et al.  Validating RDF with Shape Expressions , 2014, ArXiv.