Knowledge Annotations in Scientific Workflows: An Implementation in Kepler

Scientific research products are the result of long-term collaborations between teams. Scientific workflows are capable of helping scientists in many ways including collecting information about how research was conducted (e.g., scientific workflow tools often collect and manage information about datasets used and data transformations). However, knowledge about why data was collected is rarely documented in scientific workflows. In this paper we describe a prototype system built to support the collection of scientific expertise that influences scientific analysis. Through evaluating a scientific research effort underway at the Pacific Northwest National Laboratory, we identified features that would most benefit PNNL scientists in documenting how and why they conduct their research, making this information available to the entire team. The prototype system was built by enhancing the Kepler Scientific Workflow System to create knowledge-annotated scientific workflows and to publish them as semantic annotations.

[1]  Dan Brickley,et al.  Resource Description Framework (RDF) Model and Syntax Specification , 2002 .

[2]  Paulo Pinheiro da Silva,et al.  e-Science 2006 - Second IEEE International Conference on e-Science and Grid Computing , 2006 .

[3]  Wendy Hall,et al.  The Semantic Web Revisited , 2006, IEEE Intelligent Systems.

[4]  Carole A. Goble,et al.  The design and realisation of the myExperiment Virtual Research Environment for social sharing of workflows , 2009, Future Gener. Comput. Syst..

[5]  Deborah L. McGuinness,et al.  OWL Web ontology language overview , 2004 .

[6]  Ann Q. Gates,et al.  On the Use of Abstract Workflows to Capture Scientific Process Provenance , 2010, TaPP.

[7]  Ian J. Taylor,et al.  Workflows and e-Science: An overview of workflow system features and capabilities , 2009, Future Gener. Comput. Syst..

[8]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[9]  Edward A. Lee,et al.  Scientific workflow management and the Kepler system , 2006, Concurr. Comput. Pract. Exp..

[10]  Bertram Ludäscher,et al.  Scientific workflow management and the Kepler system: Research Articles , 2006 .

[11]  James D. Myers,et al.  Adapting the electronic laboratory notebook for the semantic era , 2005, Proceedings of the 2005 International Symposium on Collaborative Technologies and Systems, 2005..

[12]  Ann Q. Gates,et al.  Workflow-Driven Ontologies: An Earth Sciences Case Study , 2006, 2006 Second IEEE International Conference on e-Science and Grid Computing (e-Science'06).

[13]  Martinus Oostrom,et al.  STOMP Subsurface Transport Over Multiple Phases, Version 4.0, User’s Guide , 2006 .

[14]  White,et al.  STOMP. Subsurface Transport Over Multiple Phases , 1997 .

[15]  Bertram Ludäscher,et al.  A Calculus for Propagating Semantic Annotations Through Scientific Workflow Queries , 2006, EDBT Workshops.

[16]  Jacqueline Senker,et al.  The contribution of tacit knowledge to innovation , 1993, AI & SOCIETY.