SPBv: Benchmarking Linked Data Archiving Systems

As Linked Open Data (LOD) datasets are constantly evolving, both at schema and instance level, there is a need for systems that e ciently support storing and querying of such evolving data. However, there is a limited number of such systems and even fewer benchmarks that test their performance. In this paper, we describe in detail the rst version of the SPBv benchmark developed in the context of the HOBBIT EU H2020 project. SPBv aims to test the ability of archiving systems to e ciently manage evolving Linked Data datasets and queries evaluated across multiple versions of these datasets. We discuss the benchmark data generator and the query workload, and we describe a set of experiments we conducted with Virtuoso and R43ples systems.

[1]  Tudor Groza,et al.  SemVersion: RDF-based ontology versioning system , 2006 .

[2]  Steve Cassidy,et al.  Version Control for RDF Triple Stores , 2007, ICSOFT.

[3]  Jeff Heflin,et al.  LUBM: A benchmark for OWL knowledge base systems , 2005, J. Web Semant..

[4]  Christos Pateritsas,et al.  A query language for multi-version data web archives , 2016, Expert Syst. J. Knowl. Eng..

[5]  Rik Van de Walle,et al.  R&Wbase: git for triples , 2013, LDOW.

[6]  Sang-Won Lee,et al.  A Version Management Framework for RDF Triple Stores , 2012, Int. J. Softw. Eng. Knowl. Eng..

[7]  Kostas Stefanidis,et al.  On Designing Archiving Policies for Evolving RDF Datasets on the Web , 2014, ER.

[8]  Jürgen Umbrich,et al.  Evaluating query and storage strategies for RDF archives , 2019, Semantic Web.

[9]  Jürgen Umbrich,et al.  Towards Dataset Dynamics: Change Frequency of Linked Open Data Sources , 2010, LDOW.

[10]  Jürgen Umbrich,et al.  BEAR: Benchmarking the Efficiency of RDF Archiving , 2015 .

[11]  Kostas Stefanidis,et al.  Versioning for Linked Data: Archiving Systems and Benchmarks , 2016, BLINK@ISWC.

[12]  George Papastefanatos,et al.  The EvoGen Benchmark Suite for Evolving RDF Data , 2016, MEPDaW/LDQ@ESWC.

[13]  Harald Sack,et al.  TailR: a platform for preserving history on the web of data , 2015, SEMANTICS.

[14]  Enrico Motta,et al.  Ontology evolution: a process-centric survey , 2013, The Knowledge Engineering Review.

[15]  Thomas Neumann,et al.  TPC-H Analyzed: Hidden Messages and Lessons Learned from an Influential Benchmark , 2013, TPCTC.

[16]  Atanas Kiryakov,et al.  Benchmarking RDF Query Engines: The LDBC Semantic Publishing Benchmark , 2016, BLINK@ISWC.

[17]  Dimitris Plexousakis,et al.  Ontology evolution without tears , 2013, J. Web Semant..

[18]  Jürgen Umbrich,et al.  Observing Linked Data Dynamics , 2013, ESWC.

[19]  Leon Urbas,et al.  R43ples: Revisions for Triples - An Approach for Version Control in the Semantic Web , 2014, LDQ@SEMANTICS.

[20]  Gerhard Weikum,et al.  x-RDF-3X , 2010, Proc. VLDB Endow..