Linked Data-as-a-Service: The Semantic Web Redeployed

Ad-hoc querying is crucial to access information from Linked Data, yet publishing queryable RDF datasets on the Web is not a trivial exercise. The most compelling argument to support this claim is that the Web contains hundreds of thousands of data documents, while only 260 queryable SPARQL endpoints are provided. Even worse, the SPARQL endpoints we do have are often unstable, may not comply with the standards, and may differ in supported features. In other words, hosting data online is easy, but publishing Linked Data via a queryable API such as SPARQL appears to be too difficult. As a consequence, in practice, there is no single uniform way to query the LOD Cloud today. In this paper, we therefore combine a large-scale Linked Data publication project LOD Laundromat with a low-cost server-side interface Triple Pattern Fragments, in order to bridge the gap between the Web of downloadable data documents and the Web of live queryable data. The result is ai¾?repeatable, low-cost, open-source data publication process. To demonstrate its applicability, we made over 650,000 data documents available as datai¾?APIs, consisting of 30i¾?billion i¾?triples.

[1]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[2]  Jürgen Umbrich,et al.  An empirical survey of Linked Data conformance , 2012, J. Web Semant..

[3]  Rik Van de Walle,et al.  Web-Scale Querying through Linked Data Fragments , 2014, LDOW.

[4]  Günter Ladwig,et al.  FedBench: A Benchmark Suite for Federated Semantic Data Query Processing , 2011, SEMWEB.

[5]  Rik Van de Walle,et al.  Querying Datasets on the Web with High Availability , 2014, SEMWEB.

[6]  Hugh Glaser,et al.  Consuming Multiple Linked Data Sources: Challenges and Experiences , 2010, COLD.

[7]  Paul T. Groth,et al.  A web observatory for the machine processability of structured data on the web , 2014, WebSci '14.

[8]  Axel Polleres,et al.  Binary RDF representation for publication and exchange (HDT) , 2013, J. Web Semant..

[9]  R. Doyle The American terrorist. , 2001, Scientific American.

[10]  Jürgen Umbrich,et al.  Observing Linked Data Dynamics , 2013, ESWC.

[11]  Jürgen Umbrich,et al.  SPARQL Web-Querying Infrastructure: Ready for Action? , 2013, SEMWEB.

[12]  Antoine Isaac,et al.  data.europeana.eu: The Europeana Linked Open Data Pilot , 2011, Dublin Core Conference.

[13]  Luca Matteis Restpark: Minimal RESTful API for Retrieving RDF Triples , 2014, ArXiv.

[14]  Yuzhong Qu,et al.  Object Link Structure in the Semantic Web , 2010, ESWC.

[15]  Yuzhong Qu,et al.  An Empirical Study of Vocabulary Relatedness and Its Application to Recommender Systems , 2011, International Semantic Web Conference.

[16]  Stefan Schlobach,et al.  LOD Laundromat: A Uniform Way of Publishing Other People's Dirty Data , 2014, SEMWEB.

[17]  Muhammad Saleem,et al.  A fine-grained evaluation of SPARQL endpoint federation systems , 2016, Semantic Web.

[18]  Praveen Paritosh,et al.  Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.

[19]  Jens Lehmann,et al.  LODStats - An Extensible Framework for High-Performance Dataset Analytics , 2012, EKAW.

[20]  Eyal Oren,et al.  Sindice.com: a document-oriented lookup index for open linked data , 2008, Int. J. Metadata Semant. Ontologies.

[21]  Jens Lehmann,et al.  Linked Open Data Statistics: Collection and Exploitation , 2013, KESW.

[22]  Huajun Chen,et al.  The Semantic Web , 2011, Lecture Notes in Computer Science.