Challenges in RDF Validation

The RDF data model forms a cornerstone of the Semantic Web technology stack. Although there have been different proposals for RDF serialization syntaxes, the underlying simple data model enables great flexibility which allows it to be successfully employed in many different scenarios and to form the basis on which other technologies are developed. In order to apply an RDF-based approach in practice it is necessary to communicate the structure of the data that is being stored or represented. Data quality is of paramount importance for the acceptance of RDF as a data representation language and it must be enabled by the use of tools that can check if some data conforms to some specific structure. There have been several recent proposals for RDF validation languages like ShEx and SHACL. In this chapter, we describe both proposals and enumerate some challenges and trends that we foresee with regards to RDF validation. We devote more space to what we consider one of the main challenges, which is to compare ShEx and SHACL and to understand their underlying foundations. To that end, we propose an intermediate language and show how ShEx and SHACL can be converted to it.

[1]  Harold R. Solbrig,et al.  Shape expressions: an RDF validation and transformation language , 2014, SEM '14.

[2]  Dimitris Kontokostas,et al.  Validating RDF Data , 2017, Validating RDF Data.

[3]  Rik Van de Walle,et al.  RML: A Generic Language for Integrated RDF Mappings of Heterogeneous Data , 2014, LDOW.

[4]  Irena Holubová,et al.  jInfer: A Framework for XML Schema Inference , 2015, Computer/law journal.

[5]  Ruben Verborgh,et al.  Declarative Rules for Linked Data Generation at Your Fingertips! , 2018, ESWC.

[6]  S. Mikulionienė Timing of Transition to Adulthood in Lithuania: the Experience of Four Cohorts , 2020 .

[7]  Michael Hausenblas,et al.  Describing Linked Datasets , 2009, LDOW.

[8]  Young-Koo Lee,et al.  Transforming Valid XML Documents into RDF via RDF Schema , 2007, Third International Conference on Next Generation Web Services Practices (NWeSP'07).

[9]  Aidan Hogan,et al.  SPORTAL: Profiling the Content of Public SPARQL Endpoints , 2016, Int. J. Semantic Web Inf. Syst..

[10]  Arthur G. Ryman,et al.  OSLC Resource Shape: A language for defining constraints on Linked Data , 2013, LDOW.

[11]  F. Breitling A standard transformation from XML to RDF via XSLT , 2009, ArXiv.

[12]  David Pearce,et al.  Stable reasoning , 2017, J. Appl. Non Class. Logics.

[13]  Rinke Hoekstra,et al.  Meta-data for a lot of LOD , 2017, Semantic Web.

[14]  E. Mannens,et al.  XML to RDF Conversion: A Generic Approach , 2008, 2008 International Conference on Automated Solutions for Cross Media Content and Multi-Channel Distribution.

[15]  A. Gray Validata : A tool for testing profile conformance , 2016 .

[16]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[17]  Egor V. Kostylev,et al.  SPARQL with Property Paths , 2015, SEMWEB.

[18]  Daniele Braga,et al.  C-SPARQL: a Continuous Query Language for RDF Data Streams , 2010, Int. J. Semantic Comput..

[19]  Stefan Decker,et al.  Mapping between RDF and XML with XSPARQL , 2012, Journal on Data Semantics.

[20]  François Goasdoué,et al.  Query-Oriented Summarization of RDF Graphs , 2015, BICOD.

[21]  E. Prud hommeaux,et al.  SPARQL query language for RDF , 2011 .

[22]  Iovka Boneva Comparative expressiveness of ShEx and SHACL (Early working draft) , 2016 .

[23]  Raphael Volz,et al.  A Comparison of RDF Query Languages , 2004, SEMWEB.

[24]  Yo-Sub Han,et al.  Inferring a Relax NG Schema from XML Documents , 2016, LATA.

[25]  Iovka Boneva,et al.  Complexity and Expressiveness of ShEx for RDF , 2015, ICDT.

[26]  Ivan Herman,et al.  XSLT+SPARQL : Scripting the semantic web with SPARQL embedded into XSLT stylesheets , 2008 .

[27]  Heiko Paulheim,et al.  Detection of Relation Assertion Errors in Knowledge Graphs , 2017, K-CAP.

[28]  Peter A. Boncz,et al.  Exploiting Emergent Schemas to Make RDF Systems More Efficient , 2016, SEMWEB.

[29]  Juan L. Reutter,et al.  Semantics and Validation of Recursive SHACL [Extended Version] , 2018 .

[30]  Eric Miller,et al.  World Wide Web Consortium , 2004 .

[31]  Eric van der Vlist RELAX NG - a simpler schema language for XML , 2004 .

[32]  Sebastian Hellmann,et al.  Inference of Latent Shape Expressions Associated to DBpedia Ontology , 2018, International Semantic Web Conference.

[33]  Thomas Eiter,et al.  LARS: A Logic-Based Framework for Analytic Reasoning over Streams - (Extended Abstract) , 2015, SOFSEM.

[34]  Asunción Gómez-Pérez,et al.  Loupe - An Online Tool for Inspecting Datasets in the Linked Data Cloud , 2015, SEMWEB.

[35]  Axel Polleres,et al.  Everything you always wanted to know about blank nodes , 2014, J. Web Semant..

[36]  José Emilio Labra Gayo,et al.  XMLSchema2ShEx: Converting XML validation to RDF validation , 2020, Semantic Web.

[37]  Young-Koo Lee,et al.  Exploiting XML Schema for Interpreting XML Documents as RDF , 2008, 2008 IEEE International Conference on Services Computing.

[38]  Fajar J. Ekaputra,et al.  SHACL4P: SHACL constraints validation within Protégé ontology editor , 2016, 2016 International Conference on Data and Software Engineering (ICoDSE).

[39]  Karl Aberer,et al.  Enabling Query Technologies for the Semantic Sensor Web , 2012, Int. J. Semantic Web Inf. Syst..

[40]  Aidan Hogan,et al.  Modelling Dynamics in Semantic Web Knowledge Graphs with Formal Concept Analysis , 2018, WWW.

[41]  Jose María Álvarez Rodríguez,et al.  Validating and Describing Linked Data Portals using RDF Shape Expressions , 2014, LDQ@SEMANTICS.

[42]  Deborah L. McGuinness,et al.  OWL Web ontology language overview , 2004 .

[43]  Dan Brickley,et al.  Resource Description Framework (RDF) Model and Syntax Specification , 2002 .

[44]  Steve Battle Gloze : XML to RDF and back again , 2006 .

[45]  Dan Brickley,et al.  Resource Description Framework (RDF) , 2017, Encyclopedia of GIS.

[46]  C. M. Sperberg-McQueen,et al.  W3C XML Schema Definition Language (XSD) 1.1 Part 1: Structures , 2012 .

[47]  Dan Grossman,et al.  Probability type inference for flexible approximate programming , 2015, OOPSLA.

[48]  Ye Qi,et al.  Service retrieval Based on Behavior model in Pervasive Computing Environment , 2007, Third International Conference on Next Generation Web Services Practices (NWeSP'07).

[49]  José Emilio Labra Gayo,et al.  Semantics and Validation of Shapes Schemas for RDF , 2014, SEMWEB.

[50]  Danh Le Phuoc,et al.  A Native and Adaptive Approach for Unified Processing of Linked Streams and Linked Data , 2011, SEMWEB.

[51]  Thomas Eiter,et al.  Ticker: A system for incremental ASP-based stream reasoning* , 2017, Theory and Practice of Logic Programming.

[52]  Ognjen Savkovic,et al.  Semantics and Validation of Recursive SHACL , 2018, SEMWEB.