Options for a Federated, Cloud-based Sy Knowledgebase

Systems biology is characterized by a larg scientists who use a wide variety of fragmente data sets and computational tools of all scales research. In order to provide a more coheren environment for systems biology, we are wo the Department of Energy Systems Biology (Kbase) project to define a federated clou architecture. The Kbase will eventually host m of biological data, provide high performan computational resources, and support a large with tools and services to enable them to u resources. In this paper, we describe the investigations into the design of a workflo suitable for use in the Kbase. The approach u based workflow description and open so technologies, and incorporates a data a execution layer for exploiting data locality architecture. We describe a use case and the implementation we have built that demonstrat of our approach.

[1]  Ian T. Foster,et al.  Building Scientific Workflow with Taverna and BPEL: A Comparative Study in caGrid , 2009, ICSOC Workshops.

[2]  Carole A. Goble,et al.  myExperiment: a repository and social network for the sharing of bioinformatics workflows , 2010, Nucleic Acids Res..

[3]  Sam Ruby,et al.  RESTful Web Services , 2007 .

[4]  Manish Parashar,et al.  Latency Performance of SOAP Implementations , 2002, 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'02).

[5]  Norman W. Paton,et al.  Adaptive workflow processing and execution in Pegasus , 2009 .

[6]  William R. Cannon,et al.  Physicochemical/Thermodynamic Framework for the Interpretation of Peptide Tandem Mass Spectra† , 2010 .

[7]  Jano I. van Hemert,et al.  Orchestrating Data-Centric Workflows , 2008, 2008 Eighth IEEE International Symposium on Cluster Computing and the Grid (CCGRID).