Workflows for Metabolic Flux Analysis: Data Integration and Human Interaction

Software frameworks implementing scientific workflow applications have become ubiquitous in many research fields. The most beneficial advantages of workflow-enabled applications involve automation of routine operations and distributed computing on heterogeneous systems. Particular challenges in scientific applications include grid-scale orchestration of complex tasks with interactive workflows and data management allowing for integration of heterogeneous data sets. We present a workflow for the 13C isotope-based Metabolic Flux Analysis (13C-MFA). The core of any 13C-MFA study is the metabolic network modeling workflow. It consists of sub-tasks involving model set-up and acquisition of measurement data sets within a graphical environment, the evaluation of the model equations and, finally, the visualization of data and simulation results. Human intervention and the integration of various knowledge and data sources is crucial in each step of the modeling workflow. A scientific workflow framework is presented that serves for organization and automation of complex analysis processes involved in 13C-MFA applications. By encapsulating technical details and avoiding recurrent issues, sources for errors are minimized, the evaluation procedure for 13C labeling experiments is accelerated and, moreover, becomes documentable.

[1]  Carole A. Goble,et al.  A comparison of using Taverna and BPEL in building scientific workflows: the case of caGrid , 2010, Concurr. Comput. Pract. Exp..

[2]  H. Kitano Systems Biology: A Brief Overview , 2002, Science.

[3]  Bernd Freisleben,et al.  Fault-Tolerant BPEL Workflow Execution via Cloud-Aware Recovery Policies , 2009, 2009 35th Euromicro Conference on Software Engineering and Advanced Applications.

[4]  W. Wiechert 13C metabolic flux analysis. , 2001, Metabolic engineering.

[5]  Jens Nielsen,et al.  Impact of systems biology on metabolic engineering of Saccharomyces cerevisiae. , 2008, FEMS yeast research.

[6]  W Wiechert,et al.  Bidirectional reaction steps in metabolic networks: IV. Optimal design of isotopomer labeling experiments. , 1999, Biotechnology and bioengineering.

[7]  W Wiechert,et al.  Visual exploration of isotope labeling networks in 3D , 2008, Bioprocess and biosystems engineering.

[8]  Carole A. Goble,et al.  Taverna: a tool for building and running workflows of services , 2006, Nucleic Acids Res..

[9]  Valentin E. Brimkov,et al.  Computational modeling of objects represented in images , 2011, Graph. Model..

[10]  Jens Stoye,et al.  MeltDB: a software platform for the analysis and integration of metabolomics experiment data , 2008, Bioinform..

[11]  Wil M. P. van der Aalst,et al.  Transactions on Petri Nets and Other Models of Concurrency II, Special Issue on Concurrency in Process-Aware Information Systems , 2009, Trans. Petri Nets and Other Models of Concurrency.

[12]  Anne H. H. Ngu,et al.  Flexible Scientific Workflow Modeling Using Frames, Templates, and Dynamic Embedding , 2008, SSDBM.

[13]  U. Sauer,et al.  13C-based metabolic flux analysis , 2009, Nature Protocols.

[14]  Frank Leymann,et al.  Business Grid: Combining Web Services and the Grid , 2009, Trans. Petri Nets Other Model. Concurr..

[15]  Daniel Crawl,et al.  Workflows and extensions to the Kepler scientific workflow system to support environmental sensor data access and analysis , 2010, Ecol. Informatics.

[16]  Alexander Sczyrba,et al.  GeneFisher-P: variations of GeneFisher as processes in Bio-jETI , 2008, BMC Bioinformatics.

[17]  Thomas Friese,et al.  Flex-SwA: Flexible Exchange of Binary Data Based on SOAP Messages with Attachments , 2006, 2006 IEEE International Conference on Web Services (ICWS'06).

[18]  Edward A. Lee,et al.  Scientific workflow management and the Kepler system , 2006, Concurr. Comput. Pract. Exp..

[19]  Yinjie J. Tang,et al.  Advances in analysis of microbial metabolic fluxes via (13)C isotopic labeling. , 2009, Mass spectrometry reviews.

[20]  Dennis Gannon,et al.  Workflows for e-Science, Scientific Workflows for Grids , 2014 .

[21]  Bernd Freisleben,et al.  LCDL: an extensible framework for wrapping legacy code , 2009, iiWAS.

[22]  Bernd Freisleben,et al.  On-Demand Resource Provisioning for BPEL Workflows Using Amazon's Elastic Compute Cloud , 2009, 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid.

[23]  Marianne Winslett,et al.  Scientific and Statistical Database Management, 21st International Conference, SSDBM 2009, New Orleans, LA, USA, June 2-4, 2009, Proceedings , 2009, SSDBM.

[24]  W. Wiechert,et al.  Bidirectional reaction steps in metabolic networks: II. Flux estimation and statistical analysis. , 1997, Biotechnology and bioengineering.

[25]  Jianwu Wang,et al.  Kepler + Hadoop: a general architecture facilitating data-intensive applications in scientific workflow systems , 2009, WORKS '09.

[26]  Wolfgang Wiechert,et al.  Customizable Visualization of Multi-omics Data in the Context of Biochemical Networks , 2009, 2009 Second International Conference in Visualisation.

[27]  Arie Shoshani,et al.  Scientific Data Management - Challenges, Technology, and Deployment , 2009, Scientific Data Management.

[28]  Tiziana Margaria,et al.  Bio-jETI: a framework for semantics-based service composition , 2009, BMC Bioinformatics.

[29]  Wolfgang Wiechert,et al.  Customizable Visualization on Demand for Hierarchically Organized Information in Biochemical Networks , 2010, CompIMAGE.