Mining Taverna's semantic web of provenance

Taverna is a workflow workbench developed as part of the UK's myGrid project. Taverna's provenance model captures both internal provenance locally generated in Taverna and external provenance gathered from third‐party data providers. This model also supports overlaying secondary provenance over the primary logs and lineage. This design is motivated by the particular properties of bioinformatics data and services used in Taverna. A Semantic Web of provenance, Ouzo, is built to combine the above different provenance by means of semantic annotations. This paper shows how Ouzo can be mined by a provenance usage component, Provenance Query and Answer (ProQA). ProQA supports provenance retrievals as well as provenance abstraction, aggregation, and semantic reasoning. ProQA is implemented as a suite APIs which can be deployed as provenance services to compose system provenance workflows that analyse experiment results using the provenance records. We show how these features of Taverna's provenance support us in answering the questions from the provenance challenge workshop and a set of additional provenance queries. Copyright © 2007 John Wiley & Sons, Ltd.

[1]  Carole A. Goble,et al.  Using Semantic Web Technologies for Representing E-science Provenance , 2004, SEMWEB.

[2]  Yolanda Gil,et al.  Provenance trails in the Wings/Pegasus system , 2008, Concurr. Comput. Pract. Exp..

[3]  Edward A. Lee,et al.  CONCURRENCY AND COMPUTATION: PRACTICE AND EXPERIENCE Concurrency Computat.: Pract. Exper. 2000; 00:1–7 Prepared using cpeauth.cls [Version: 2002/09/19 v2.02] Taverna: Lessons in creating , 2022 .

[4]  Ian Horrocks,et al.  Deciding Semantic Matching of Stateless Services , 2006, AAAI.

[5]  Ian Foster,et al.  The First Provenance Challenge , 2008 .

[6]  James D. Myers,et al.  Tracking provenance semantics in heterogeneous execution systems , 2008, Concurr. Comput. Pract. Exp..

[7]  Susan B. Davidson,et al.  Addressing the provenance challenge using ZOOM , 2008, Concurr. Comput. Pract. Exp..

[8]  Thomas Gruber,et al.  Ontology of Folksonomy: A Mash-Up of Apples and Oranges , 2007, Int. J. Semantic Web Inf. Syst..

[9]  Bertram Ludäscher,et al.  From computation models to models of provenance: the RWS approach , 2008, Concurr. Comput. Pract. Exp..

[10]  Roger Barga,et al.  Automatic capture and efficient storage of e-Science experiment provenance , 2008 .

[11]  Paul T. Groth,et al.  Extracting causal graphs from an open provenance data model , 2008, Concurr. Comput. Pract. Exp..

[12]  Carole A. Goble,et al.  An Identity Crisis in the Life Sciences , 2006, IPAW.

[13]  Carole A. Goble,et al.  Feta: A Light-Weight Architecture for User Oriented Semantic Service Discovery , 2005, ESWC.

[14]  Sean Martin,et al.  The impact of Life Science Identifier on informatics data. , 2005, Drug discovery today.

[15]  Yong Zhao,et al.  Tracking provenance in a virtual data grid , 2008, Concurr. Comput. Pract. Exp..

[16]  Jeremy J. Carroll,et al.  Named graphs , 2005, J. Web Semant..