A Semantic Framework for the Management of Enriched Provenance Logs

Provenance is a key metadata for assessing electronic documents trustworthiness. It gives an indicator on the reliability and the quality of the document content. Most of the applications exchanging and processing documents on the web or in clouds become provenance aware and provides heterogeneous, decentralized and not interoperable provenance data. Most of provenance management systems are either dedicated to a specific application (workflow, database) or a specific data type. Those systems were not conceived to support provenance over distributed and open sources. Hence, collecting and querying provenance from heterogeneous sources is always a challenging task. This work presents a new provenance management framework based on semantic web technologies. It allows to import provenance sources, to enrich them semantically to obtain high level representation of provenance. It supports semantic correlation between different provenance sources and allows the use of a high level semantic query language.

[1]  Sudha Ram,et al.  A New Perspective on Semantics of Data Provenance , 2009, SWPM.

[2]  Yogesh L. Simmhan,et al.  The Open Provenance Model core specification (v1.1) , 2011, Future Gener. Comput. Syst..

[3]  Margo Seltzer,et al.  Foundations for provenance-aware systems , 2010 .

[4]  Marianne Winslett,et al.  Preventing history forgery with secure provenance , 2009, TOS.

[5]  Juliana Freire,et al.  Provenance and scientific workflows: challenges and opportunities , 2008, SIGMOD Conference.

[6]  Amit P. Sheth,et al.  Semantic Provenance for eScience: Managing the Deluge of Scientific Data , 2008, IEEE Internet Computing.

[7]  J. Euzenat,et al.  Ontology Matching , 2007, Springer Berlin Heidelberg.

[8]  Seán O'Riain,et al.  Prov4J: A Semantic Web Framework for Generic Provenance Management , 2010, SWPM@ISWC.

[9]  Yurdaer N. Doganata,et al.  Business Provenance - A Technology to Increase Traceability of End-to-End Operations , 2008, OTM Conferences.

[10]  Paul T. Groth,et al.  An Architecture for Provenance Systems , 2006 .

[11]  Krishnaprasad Thirunarayan,et al.  PrOM: A Semantic Web Framework for Provenance Management in Science , 2009 .

[12]  Luc Moreau,et al.  The Foundations for Provenance on the Web , 2010, Found. Trends Web Sci..

[13]  Bruno Defude,et al.  Document Provenance in the Cloud: Constraints and Challenges , 2010, EUNICE.

[14]  Margo I. Seltzer,et al.  Provenance as first class cloud data , 2010, OPSR.

[15]  Cláudio T. Silva,et al.  Provenance for Computational Tasks: A Survey , 2008, Computing in Science & Engineering.

[16]  Parag Agrawal,et al.  Trio: a system for data, uncertainty, and lineage , 2006, VLDB.

[17]  Yogesh L. Simmhan,et al.  A survey of data provenance in e-science , 2005, SGMD.

[18]  Olaf Hartig Provenance Information in the Web of Data , 2009, LDOW.