Provenance Information in the Web of Data

The openness of the Web and the ease to combine linked data from dierent sources creates new challenges. Systems that consume linked data must evaluate quality and trustworthiness of the data. A common approach for data quality assessment is the analysis of provenance information. For this reason, this paper discusses provenance of data on the Web and proposes a suitable provenance model. While traditional provenance research usually addresses the creation of data, our provenance model also represents data access, a dimension of provenance that is particularly relevant in the context of Web data. Based on our model we identify options to obtain provenance information and we raise open questions concerning the publication of provenance-related metadata for linked data on the Web.

[1]  Bertram Ludäscher,et al.  Provenance in Scientific Workflow Systems , 2007, IEEE Data Eng. Bull..

[2]  Previous version: , 2004 .

[3]  Michael Hausenblas,et al.  A Performance and Scalability Metric for Virtual RDF Graphs , 2007, SFSW.

[4]  Luc Moreau,et al.  The Open Provenance Model , 2007 .

[5]  Stefan Decker,et al.  Semantic Sitemaps: Efficient and Flexible Access to Datasets on the Semantic Web , 2008, ESWC.

[6]  Wang Chiew Tan Provenance in Databases: Past, Current, and Future , 2007, IEEE Data Eng. Bull..

[7]  Peter Haase,et al.  OMV – Ontology Metadata Vocabulary , 2005 .

[8]  Eyal Oren,et al.  Sindice.com: a document-oriented lookup index for open linked data , 2008, Int. J. Metadata Semant. Ontologies.

[9]  Sanjeev Khanna,et al.  Data Provenance: Some Basic Issues , 2000, FSTTCS.

[10]  Val Tannen,et al.  Provenance semirings , 2007, PODS.

[11]  Deborah L. McGuinness,et al.  Knowledge Provenance Infrastructure , 2003, IEEE Data Eng. Bull..

[12]  Yolanda Gil,et al.  Towards content trust of web resources , 2006, WWW '06.

[13]  Leslie Daigle,et al.  WHOIS Protocol Specification , 2004, RFC.

[14]  Bijan Parsia,et al.  Laconic and Precise Justifications in OWL , 2008, SEMWEB.

[15]  Silvio Micali,et al.  A Digital Signature Scheme Secure Against Adaptive Chosen-Message Attacks , 1988, SIAM J. Comput..

[16]  Jun Zhao,et al.  Describing Linked Datasets On the Design and Usage of voiD, the "Vocabulary Of Interlinked Datasets" , 2009 .

[17]  Andreas Harth,et al.  Towards a social provenance model for the Web , 2007 .

[18]  James Frew,et al.  Lineage retrieval for scientific data processing: a survey , 2005, CSUR.

[19]  Jeremy J. Carroll,et al.  Named graphs, provenance and trust , 2005, WWW '05.

[20]  Deborah L. McGuinness,et al.  A proof markup language for Semantic Web services , 2006, Inf. Syst..

[21]  John G. Breslin,et al.  An Architecture to Discover and Query Decentralized RDF Data , 2007, SFSW.

[22]  Dan Brickley,et al.  FOAF Vocabulary Specification , 2004 .

[23]  V. Vianu,et al.  Edinburgh Why and Where: A Characterization of Data Provenance , 2017 .

[24]  Deborah L. McGuinness,et al.  Tracking RDF Graph Provenance using RDF Molecules , 2005 .

[25]  Amit P. Sheth,et al.  Semantically Annotating a Web Service , 2007, IEEE Internet Computing.

[26]  Yogesh L. Simmhan,et al.  A survey of data provenance in e-science , 2005, SGMD.

[27]  Roy T. Fielding,et al.  Hypertext Transfer Protocol - HTTP/1.0 , 1996, RFC.

[28]  Christian Bizer,et al.  D2R Server - Publishing Relational Databases on the Semantic Web , 2004 .