Analysis of approaches for supporting the Open Provenance Model: A case study of the Trident workflow workbench

The Trident workbench is a platform for composing, executing and managing scientific workflows. While Trident collects provenance in its native provenance model, the third provenance challenge was an opportunity to build support for the Open Provenance Model into Trident. There are several possible approaches to harmonize our native model with OPM, and such choices are also available to other existing provenance and workflow systems working towards OPM compatibility. We identify and analyze the relative merits of these approaches in an effort to inform practitioners planning to support OPM in their existing provenance/workflow systems. Further, we describe our experience with using the integration approach we choose to interoperate with other teams as part of the challenge.

[1]  Yogesh L. Simmhan,et al.  Provenance Information Model of Karma Version 3 , 2009, 2009 Congress on Services - I.

[2]  Yogesh L. Simmhan,et al.  Provenance for Scientific Workflows Towards Reproducible Research , 2010, IEEE Data Eng. Bull..

[3]  Yogesh L. Simmhan,et al.  Building Reliable Data Pipelines for Managing Community Data Using Scientific Workflows , 2009, 2009 Fifth IEEE International Conference on e-Science.

[4]  Yogesh L. Simmhan,et al.  Special Issue: The First Provenance Challenge , 2008, Concurr. Comput. Pract. Exp..

[5]  Yogesh Simmhan,et al.  Building the Trident Scientific Workflow Workbench for Data Management in the Cloud , 2009, 2009 Third International Conference on Advanced Engineering Computing and Applications in Sciences.

[6]  Shiyong Lu,et al.  Storing, reasoning, and querying OPM-compliant scientific workflow provenance using relational databases , 2011, Future Gener. Comput. Syst..

[7]  Roger S. Barga,et al.  Capturing Workflow Event Data for Monitoring, Performance Analysis, and Management of Scientific Workflows , 2008, 2008 IEEE Fourth International Conference on eScience.

[8]  Adriane Chapman,et al.  Issues in Building Practical Provenance Systems , 2007, IEEE Data Eng. Bull..

[9]  Cláudio T. Silva,et al.  Using Mediation to Achieve Provenance Interoperability (Extended Abstract) , 2008, 2008 IEEE Fourth International Conference on eScience.

[10]  Peter M. A. Sloot,et al.  Understanding Collaborative Studies through Interoperable Workflow Provenance , 2010, IPAW.

[11]  Carole A. Goble,et al.  Data Lineage Model for Taverna Workflows with Lightweight Annotation Requirements , 2008, IPAW.

[12]  Jing Zhao,et al.  A Provenance-Integration Framework for Distributed Workflows in Grid Environments , 2008 .

[13]  Yogesh L. Simmhan,et al.  The Open Provenance Model (v1.01) , 2008 .

[14]  Yogesh L. Simmhan,et al.  A survey of data provenance in e-science , 2005, SGMD.

[15]  Paolo Missier,et al.  Understanding Collaborative Studies Through Interoperable Workflow , 2010, IPAW 2010.

[16]  Cláudio T. Silva,et al.  Using Mediation to Achieve Provenance Interoperability , 2009, 2009 Congress on Services - I.

[17]  Ilkay Altintas,et al.  Provenance Collection Support in the Kepler Scientific Workflow System , 2006, IPAW.

[18]  Cláudio T. Silva,et al.  Provenance for Visualizations: Reproducibility and Beyond , 2007, Computing in Science & Engineering.

[19]  Yogesh L. Simmhan,et al.  The Open Provenance Model core specification (v1.1) , 2011, Future Gener. Comput. Syst..

[20]  Luc Moreau,et al.  The Foundations for Provenance on the Web , 2010, Found. Trends Web Sci..

[21]  Yogesh L. Simmhan,et al.  Special Section: The third provenance challenge on using the open provenance model for interoperability , 2011, Future Gener. Comput. Syst..

[22]  Tomás Knap,et al.  W3P: Building an OPM based provenance model for the Web , 2011, Future Gener. Comput. Syst..

[23]  GilYolanda,et al.  Special Issue: The First Provenance Challenge , 2008 .

[24]  Luc Moreau,et al.  The Open Provenance Model , 2007 .