On the Use of Semantic Annotations for Supporting Provenance in Grids

There has seen a strong demand for provenance in grid applications, which enables users to trace how a particular result has been arrived at by identifying the resources, configurations and execution settings. In this paper we analyses the requirements of provenance support and discusses the nature and characteristics of provenance data on the Grid. We define a new conception called augmented provenance that enhances conventional provenance data with extensive metadata and semantics. A hybrid approach is proposed for the creation and management of augmented provenance in which semantic annotation is used to generate semantic provenance data and the database management system is used for execution data management. The approach has been applied to a real world application, and tools and GUIs are developed to facilitate provenance management and exploitation.

[1]  York Sure-Vetter,et al.  Evaluation of Ontology-based Tools (EON 2003) : Proceedings of the 2nd International Workshop on Evaluation of Ontology-based Tools, held at the 2nd International Semantic Web Conference ISWC 2003, 20th October 2003 (Workshop day), Sundial Resort, Sanibel Island, Florida, USA , 2003 .

[2]  D. Lanter Design of a Lineage-Based Meta-Data Base for GIS , 1991 .

[3]  Yong Zhao,et al.  Chimera: a virtual data system for representing, querying, and automating data derivation , 2002, Proceedings 14th International Conference on Scientific and Statistical Database Management.

[4]  Volker Haarslev,et al.  Racer: A Core Inference Engine for the Semantic Web , 2003, EON.

[5]  Simon J. Cox,et al.  Empowering Resource Providers to Build the Semantic Grid , 2004, IEEE/WIC/ACM International Conference on Web Intelligence (WI'04).

[6]  Carole A. Goble,et al.  Using Semantic Web Technologies for Representing E-science Provenance , 2004, SEMWEB.

[7]  Simon J. Cox,et al.  Tools and support for deploying applications on the grid , 2004, IEEE International Conference onServices Computing, 2004. (SCC 2004). Proceedings. 2004.

[8]  Marco Danelutto,et al.  Euro-Par 2004 Parallel Processing , 2004, Lecture Notes in Computer Science.

[9]  Robert Meersman,et al.  On The Move to Meaningful Internet Systems 2003: CoopIS, DOA, and ODBASE , 2003, Lecture Notes in Computer Science.

[10]  James Frew,et al.  Earth System Science Workbench: a data management infrastructure for earth science products , 2001, Proceedings Thirteenth International Conference on Scientific and Statistical Database Management. SSDBM 2001.

[11]  Jeffrey M. Bradshaw,et al.  Applying KAoS Services to Ensure Policy Compliance for Semantic Web Services Workflow Composition and Enactment , 2004, SEMWEB.

[12]  Simon J. Cox,et al.  Databases, Workflows and the Grid in a Service Oriented Environment , 2004, Euro-Par.

[13]  Rajendra Bose A conceptual framework for composing and managing scientific data lineage , 2002, Proceedings 14th International Conference on Scientific and Statistical Database Management.

[14]  Michael Luck,et al.  Logical architecture strawman for provenance systems , 2005 .

[15]  James D. Myers,et al.  Re-integrating the research record , 2003, Comput. Sci. Eng..

[16]  Carole A. Goble,et al.  Managing Semantic Metadata for Web/Grid Services , 2006, Int. J. Web Serv. Res..

[17]  I. Horrocks,et al.  The Instance Store: DL Reasoning with Large Numbers of Individuals , 2004, Description Logics.

[18]  Luc Moreau,et al.  Recording and Reasoning over Data Provenance in Web and Grid Services , 2003, OTM.

[19]  Nicholas Gibbins,et al.  3store: Efficient Bulk RDF Storage , 2003, PSSS.

[20]  Sanjeev Khanna,et al.  Why and Where: A Characterization of Data Provenance , 2001, ICDT.

[21]  Bharat K. Bhargava,et al.  E-notebook Middleware for Accountability and Reputation Based Trust in Distributed Data Sharing Communities , 2004, iTrust.

[22]  Ian T. Foster,et al.  Grid Services for Distributed System Integration , 2002, Computer.