Work flows in life science

The introduction of computer science technology in the life science domain has resulted in a new life science discipline called bioinformatics. Bioinformaticians are biologists who know how to apply computer science technology to perform computer based experiments, also known as in-silico or dry lab experiments. Various tools, such as databases, web applications and scripting languages, are used to design and run in-silico experiments. As the size and complexity of these experiments grow, new types of tools are required to design and execute the experiments and to analyse the results. Workflow systems promise to fulfill this role. The bioinformatician composes an experiment by using tools and web services as building blocks, and connecting them, often through a graphical user interface. Workflow systems, such as Taverna, provide access to up to a few thousand resources in a uniform way. Although workflow systems are intended to make the bioinformaticians' work easier, bioinformaticians experience difficulties in using them. This thesis is devoted to find out which problems bioinformaticians experience using workflow systems and to provide solutions for these problems.

[1]  Robert Stevens,et al.  Are bioinformaticians doing e-business? , 2002 .

[2]  Thomas M. Oinn,et al.  The Taverna Interaction Service: enabling manual interaction in workflows , 2008, Bioinform..

[3]  Carole Goble,et al.  Curating Scientific Web Services and Workflows , 2008 .

[4]  Wil M. P. van der Aalst,et al.  The Application of Petri Nets to Workflow Management , 1998, J. Circuits Syst. Comput..

[5]  Abhishek Tiwari,et al.  Workflow based framework for life science informatics , 2007, Comput. Biol. Chem..

[6]  James G. R. Gilbert,et al.  The vertebrate genome annotation (Vega) database , 2004, Nucleic Acids Res..

[7]  Juliana Freire,et al.  Provenance and scientific workflows: challenges and opportunities , 2008, SIGMOD Conference.

[8]  C. Steinbeck,et al.  Recent developments of the chemistry development kit (CDK) - an open-source java library for chemo- and bioinformatics. , 2006, Current pharmaceutical design.

[9]  Carol Prior Workflow and Process Management , 2003 .

[10]  Heiko Schoof,et al.  BioMOBY Successfully Integrates Distributed Heterogeneous Bioinformatics Web Services. The PlaNet Exemplar Case1 , 2005, Plant Physiology.

[11]  Carole A. Goble,et al.  Semantically Linking and Browsing Provenance Logs for E-science , 2004, ICSNW.

[12]  L. Stein Creating a bioinformatics nation , 2002, Nature.

[13]  Matthew R. Pocock,et al.  Taverna: a tool for the composition and enactment of bioinformatics workflows , 2004, Bioinform..

[14]  Jan Krüger,et al.  Playing with pesticides. , 1998, BMC Bioinformatics.

[15]  M. Ghanem,et al.  Web services in the life sciences , 2005, Drug Discovery Today.

[16]  Jacques Wainer,et al.  Scientific Workflow Systems , 1996 .

[17]  David Meredith,et al.  Evaluation of BPEL to Scientific Workflows , 2006, Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06).

[18]  Ingo H. C. Wassink,et al.  Designing Workflows on the Fly Using e-BioFlow , 2009, ICSOC/ServiceWave.

[19]  Sameer Velankar,et al.  SOAP-based services provided by the European Bioinformatics Institute , 2005, Nucleic Acids Res..

[20]  Thorsten Meinl,et al.  KNIME: The Konstanz Information Miner , 2007, GfKl.

[21]  Ian J. Taylor,et al.  Workflows and e-Science: An overview of workflow system features and capabilities , 2009, Future Gener. Comput. Syst..

[22]  Ian T. Foster,et al.  Virtual Data Language: A Typed Workflow Notation for Diversely Structured Scientific Data , 2007, Workflows for e-Science, Scientific Workflows for Grids.

[23]  Peter Wegner,et al.  Why interaction is more powerful than algorithms , 1997, CACM.

[24]  Matthew S. Shields Control- Versus Data-Driven Workflows , 2007, Workflows for e-Science, Scientific Workflows for Grids.

[25]  John Shalf,et al.  Enabling Applications on the Grid: A Gridlab Overview , 2003, Int. J. High Perform. Comput. Appl..

[26]  Marian Petre,et al.  Usability Analysis of Visual Programming Environments: A 'Cognitive Dimensions' Framework , 1996, J. Vis. Lang. Comput..

[27]  Carole Goble,et al.  Discovering Scientific Workflows: The myExperiment Benchmarks , 2008 .

[28]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[29]  Layna Fischer New Tools for New Times: The Workflow Paradigm , 1994 .

[30]  Anil Wipat,et al.  Experiences with e-Science workflow specification and enactment in bioinformatics , 2003 .

[31]  Karen Schuchardt,et al.  New paradigms in problem solving environments for scientific computing , 2002, IUI '02.

[32]  Karen Holtzblatt,et al.  Contextual design , 1997, INTR.

[33]  Anne E. Trefethen,et al.  e-Science and its implications , 2003, Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[34]  Kees M. van Hee,et al.  Resource-Constrained Workflow Nets , 2006, Fundam. Informaticae.

[35]  Han Rauwerda,et al.  OligoRAP – an Oligo Re-Annotation Pipeline to improve annotation and estimate target specificity , 2009, BMC proceedings.

[36]  Yaron Goland,et al.  Web Services Business Process Execution Language , 2009, Encyclopedia of Database Systems.

[37]  Cees T. A. M. de Laat,et al.  WS-VLAM: A GT4 Based Workflow Management System , 2007, International Conference on Computational Science.

[38]  Kees M. van Hee,et al.  Soundness of Resource-Constrained Workflow Nets , 2005, ICATPN.

[39]  Robert Stevens,et al.  Treating Shimantic Web Syndrome with Ontologies , 2004 .

[40]  Iain E. Buchan,et al.  Requirements Engineering for E-science: Experiences in Epidemiology , 2009, IEEE Software.

[41]  Anton Nijholt,et al.  E-BioFlow: Different Perspectives on Scientific Workflows , 2008, BIRD.

[42]  W.M.P. van der Aalst,et al.  Towards a taxonomy of process flexibility (extended version) , 2007 .

[43]  Paul T. Groth,et al.  Provenance-based validation of e-science experiments , 2005, J. Web Semant..

[44]  Cees T. A. M. de Laat,et al.  VLAM-G: a grid-based virtual laboratory , 2002, Future Gener. Comput. Syst..

[45]  Norman W. Paton,et al.  On Characterising and Identifying Mismatches in Scientific Workflows , 2006, DILS.

[46]  Bertram Ludäscher,et al.  Actor-Oriented Design of Scientific Workflows , 2005, ER.

[47]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[48]  Gustavo Alonso,et al.  Functionality and Limitations of Current Workflow Management Systems , 1997, unpublished.

[49]  Jason E. Stewart,et al.  Minimum information about a microarray experiment (MIAME)—toward standards for microarray data , 2001, Nature Genetics.

[50]  Gerrit C. van der Veer,et al.  Situational awareness support to enhance teamwork in collaborative environments , 2008, ECCE '08.

[51]  Ross D King,et al.  Are the current ontologies in biology good ontologies? , 2005, Nature Biotechnology.

[52]  Paul Cairns,et al.  InfoVis Experience Enhancement though Mediated Interaction , 2005 .

[53]  Mathias Weske,et al.  Modeling samples processing in laboratory environments as scientific workflows , 1997, Database and Expert Systems Applications. 8th International Conference, DEXA '97. Proceedings.

[54]  Haruki Nakamura,et al.  Remediation of the protein data bank archive , 2007, Nucleic Acids Res..

[55]  Mathias Weske,et al.  WASA: A Workflow-Based Architecture to Support Scientific Database Applications (Extended Abstract) , 1995, DEXA.

[56]  James Frew,et al.  Lineage retrieval for scientific data processing: a survey , 2005, CSUR.

[57]  Olga Kulyk,et al.  Applying a User-centred Approach to Interactive Visualization Design , 2008 .

[58]  Christoph W. Sensen,et al.  Semantic Web Service provision: a realistic framework for Bioinformatics programmers , 2007, Bioinform..

[59]  Cláudio T. Silva,et al.  VisTrails: visualization meets data management , 2006, SIGMOD Conference.

[60]  Fabrizio Silvestri,et al.  Biological Experiments on the Grid: A Novel Workflow Management Platform , 2007, Twentieth IEEE International Symposium on Computer-Based Medical Systems (CBMS'07).

[61]  Carole A. Goble,et al.  Recycling workflows and services through discovery and reuse , 2007, Concurr. Comput. Pract. Exp..

[62]  Wil M. P. van der Aalst,et al.  Design and Implementation of the YAWL System , 2004, CAiSE.

[63]  Anton Nijholt,et al.  In The Truman Show: Generating Dynamic Scenarios in a Driving Simulator , 2006, IEEE Intelligent Systems.

[64]  Michael Luck,et al.  A Protocol for Recording Provenance in Service-Oriented Grids , 2004, OPODIS.

[65]  Jack A. M. Leunissen,et al.  Evolution of web services in bioinformatics , 2005, Briefings Bioinform..

[66]  Jan Van den Bussche,et al.  Mapping the NRC Dataflow Model to the Open Provenance Model , 2008, IPAW.

[67]  Norman W. Paton,et al.  Contextualised Workflow Execution in MyGrid , 2005, EGC.

[68]  Wil M. P. van der Aalst,et al.  Formal semantics and analysis of control flow in WS-BPEL , 2007, Sci. Comput. Program..

[69]  Ilkay Altintas,et al.  Provenance Collection Support in the Kepler Scientific Workflow System , 2006, IPAW.

[70]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[71]  Paul T. Groth,et al.  Analyzing the Gap between Workflows and their Natural Language Descriptions , 2009, 2009 Congress on Services - I.

[72]  Carole A. Goble,et al.  Automating experiments using semantic data in a bioinformatics grid , 2004, IEEE Intelligent Systems.

[73]  Paul Dourish,et al.  Process descriptions as organisational accounting devices: the dual use of workflow technologies , 2001, GROUP.

[74]  Mari Carmen Puerta Melguizo,et al.  Assessing users mental models in designing complex systems , 2002, IEEE International Conference on Systems, Man and Cybernetics.

[75]  Jérôme Gouzy,et al.  REMORA: a pilot in the ocean of BioMoby web-services , 2006, Bioinform..

[76]  Laura Downey Group usability testing: evolution in usability techniques , 2007 .

[77]  M. Kappler Software for rapid prototyping in the pharmaceutical and biotechnology industries. , 2008, Current opinion in drug discovery & development.

[78]  Olga Kulyk,et al.  Sticks, balls or a ribbon? Results of a formative user study with bioinformaticians , 2008 .

[79]  Donald F. Ferguson,et al.  The WS-Resource Framework , 2004 .

[80]  Pieter B. T. Neerincx Web services for transcriptomics , 2009 .

[81]  Wilkinson,et al.  The BioMOBY Project Explores Open-Source , Simple , Extensible Protocols for Enabling Biological Database Interoperability , 2004 .

[82]  Edward A. Lee,et al.  CONCURRENCY AND COMPUTATION: PRACTICE AND EXPERIENCE Concurrency Computat.: Pract. Exper. 2000; 00:1–7 Prepared using cpeauth.cls [Version: 2002/09/19 v2.02] Taverna: Lessons in creating , 2022 .

[83]  Arthur M. Lesk,et al.  Introduction to bioinformatics , 2002 .

[84]  Anton Nijholt,et al.  Using R in Taverna: RShell v1.2 , 2009, BMC Research Notes.

[85]  John F. Sowa,et al.  Knowledge representation: logical, philosophical, and computational foundations , 2000 .

[86]  V. Vianu,et al.  Edinburgh Why and Where: A Characterization of Data Provenance , 2017 .

[87]  K. Dunbar How scientists think: On-line creativity and conceptual change in science. , 1997 .

[88]  Wolfgang Reisig,et al.  A Primer in Petri Net Design , 1992, Springer Compass International.

[89]  Mark D. Wilkinson Gbrowse Moby: a Web-based browser for BioMoby Services , 2006, Source Code for Biology and Medicine.

[90]  Carole A. Goble,et al.  Workflow discovery: the problem, a case study from e-Science and a graph-based solution , 2006, 2006 IEEE International Conference on Web Services (ICWS'06).

[91]  C Fields Data exchange and inter-database communication in genome projects. , 1992, Trends in biotechnology.

[92]  Romain Robbes,et al.  How Program History Can Improve Code Completion , 2008, 2008 23rd IEEE/ACM International Conference on Automated Software Engineering.

[93]  Bienvenido Vélez,et al.  Lynx: an open architecture for catalyzing the deployment of interactive digital government workflow-based systems , 2006, DG.O.

[94]  Gerrit C. van der Veer,et al.  New interactions with workflow systems , 2009, ECCE.

[95]  Amit P. Sheth,et al.  Managing heterogeneous multi-system tasks to support enterprise-wide operations , 1995, Distributed and Parallel Databases.

[96]  Cesare Pautasso,et al.  The JOpera visual composition language , 2005, J. Vis. Lang. Comput..

[97]  Olga Kulyk,et al.  Smart Environments for Collaborative Design, Implementation, and Interpretation of Scientific Experiments , 2007 .

[98]  Kees M. van Hee,et al.  Workflow Management: Models, Methods, and Systems , 2002, Cooperative information systems.

[99]  Edward A. Lee,et al.  Scientific workflow management and the Kepler system , 2006, Concurr. Comput. Pract. Exp..

[100]  Susan B. Davidson,et al.  A User-Centric Framework for Accessing Biological Sources and Tools , 2005, DILS.

[101]  George Pavlou,et al.  On the Evolution of Management Approaches, Frameworks and Protocols: A Historical Perspective , 2007, Journal of Network and Systems Management.

[102]  Cees T. A. M. de Laat,et al.  WS-VLAM: towards a scalable workflow system on the grid , 2007, WORKS '07.

[103]  Mark D. Wilkinson,et al.  BioMOBY: An Open Source Biological Web Services Proposal , 2002, Briefings Bioinform..

[104]  E. L. Koua,et al.  A usability framework for the design and evaluation of an exploratory geovisualization environment , 2004 .

[105]  Jaime Urquiza-Fuentes,et al.  Human-Centered Aspects , 2006, Human-Centered Visualization Environments.

[106]  Omran A. Bukhres,et al.  A Dynamic Workflow Approach for the Integration of Bioinformatics Services , 2005, Cluster Computing.

[107]  Carole A. Goble,et al.  Data curation + process curation=data integration + science , 2008, Briefings Bioinform..

[108]  Amit P. Sheth,et al.  IntelliGEN: A Distributed Workflow System for Discovering Protein-Protein Interactions , 2004, Distributed and Parallel Databases.

[109]  Peter Johnson,et al.  Interaction in creative tasks , 2006, CHI.

[110]  Carole A. Goble,et al.  Delivering web service coordination capability to users , 2004, WWW Alt. '04.

[111]  Wil M.P. van der Aalst,et al.  YAWL: yet another workflow language , 2005, Inf. Syst..

[112]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[113]  Jacques Wainer,et al.  WorkFlow systems: a few definitions and a few suggestions , 1995, COCS '95.

[114]  Adam Arbree,et al.  Mapping Abstract Complex Workflows onto Grid Environments , 2003, Journal of Grid Computing.

[115]  Heidi J. C. Ellis,et al.  Conceptual-level workflow modeling of scientific experiments using NMR as a case study , 2007, BMC Bioinformatics.

[116]  Andreas Prlic,et al.  Ensembl 2008 , 2007, Nucleic Acids Res..

[117]  Egon L. Willighagen,et al.  The Chemistry Development Kit (CDK): An Open-Source Java Library for Chemo-and Bioinformatics , 2003, J. Chem. Inf. Comput. Sci..

[118]  Mark D. Wilkinson,et al.  Moby and Moby 2: Creatures of the Deep (Web) , 2009, Briefings Bioinform..

[119]  Jacques Cohen,et al.  Bioinformatics—an introduction for computer scientists , 2004, CSUR.

[120]  Kirsten N. Whitley Visual Programming Languages and the Empirical Evidence For and Against , 1997, J. Vis. Lang. Comput..

[121]  Gregory R. Madey,et al.  Improving the Reuse of Scientific Workflows and Their By-products , 2007 .

[122]  Ekaterina Pilicheva,et al.  LIMaS: the JAVA-based application and database for microarray experiment tracking , 2004, Mammalian Genome.

[123]  Lloyd Treinish,et al.  An extended data-flow architecture for data analysis and visualization , 1995, COMG.

[124]  Benjamin M. Good,et al.  DataBiNS: a BioMoby-based data-mining workflow for biological pathways and non-synonymous SNPs , 2007, Bioinform..

[125]  Yoshihiro Yamanishi,et al.  KEGG for linking genomes to life and the environment , 2007, Nucleic Acids Res..

[126]  Brad A. Myers,et al.  Workflow by Example: Automating Database Interactions via Deductions , 2006 .

[127]  Wil M. P. van der Aalst,et al.  Workflow Patterns: On the Expressive Power of (Petri-net-based) Workflow Languages. , 2002 .

[128]  Hiraku Morisawa,et al.  Development of an open source laboratory information management system for 2-D gel electrophoresis-based proteomics workflow , 2006, BMC Bioinformatics.

[129]  C. Petri Kommunikation mit Automaten , 1962 .

[130]  Gert Vriend,et al.  MRS: a fast and compact retrieval system for biological data , 2005, Nucleic Acids Res..

[131]  Wil M. P. van der Aalst,et al.  Worklets: A Service-Oriented Implementation of Dynamic Flexibility in Workflows , 2006, OTM Conferences.

[132]  Dan Wu,et al.  Priorities for nucleotide trace, sequence and annotation data capture at the Ensembl Trace Archive and the EMBL Nucleotide Sequence Database , 2007, Nucleic Acids Res..

[133]  Olga Kulyk,et al.  Getting to know bioinformaticians : Results of an exploratory user study , 2006 .

[134]  M. Bittner,et al.  Expression profiling using cDNA microarrays , 1999, Nature Genetics.

[135]  Paul T. Groth,et al.  PReServ: Provenance Recording for Services , 2005 .

[136]  Amit P. Sheth,et al.  Using SAWSDL for Semantic Service Interoperability , 2007 .

[137]  Vito Perrone,et al.  Better bioinformatics through usability analysis , 2009, Bioinform..

[138]  Dennis Gannon,et al.  Scientific versus Business Workflows , 2007, Workflows for e-Science, Scientific Workflows for Grids.

[139]  Daniel S. Katz,et al.  Pegasus: A framework for mapping complex scientific workflows onto distributed systems , 2005, Sci. Program..

[140]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[141]  Anne E. Trefethen,et al.  The UK e-Science Core Programme and the Grid , 2002, Future Gener. Comput. Syst..

[142]  Steve Pettifer,et al.  An active registry for bioinformatics web services , 2009, Bioinform..

[143]  Martin Senger,et al.  BioMoby extensions to the Taverna workflow management and enactment software , 2006, BMC Bioinformatics.

[144]  Carole A. Goble,et al.  Software Design for Empowering Scientists , 2009, IEEE Software.

[145]  J. Thompson,et al.  The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. , 1997, Nucleic acids research.

[146]  Andrea Klug Workflow Handbook 1997 , 1997 .

[147]  Tessa A. Lau,et al.  Programming by demonstration: an inductive learning formulation , 1998, IUI '99.

[148]  Amit P. Sheth,et al.  Specification and Execution of Transactional Workflows , 1995, Modern Database Systems.

[149]  Bart De Moor,et al.  BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis , 2005, Bioinform..

[150]  Anton Nijholt,et al.  Bringing Hollywood to the Driving School: Dynamic Scenario Generation in Simulations and Games , 2005, INTETAIN.

[151]  Kevin Crowston,et al.  Social Dynamics of FLOSS Team Communication Across Channels , 2008, OSS.

[152]  Colin Ware,et al.  Information Visualization: Perception for Design , 2000 .

[153]  César Garita,et al.  VLAM-G: A Grid-based virtual laboratory , 2002 .

[154]  Edward A. Lee,et al.  The ptolemy II framework for visual languages , 2001, Proceedings IEEE Symposia on Human-Centric Computing Languages and Environments (Cat. No.01TH8587).

[155]  Erhard Rahm,et al.  Analyzing the Evolution of Life Science Ontologies and Mappings , 2008, DILS.

[156]  W. John MacMullen,et al.  Information problems in molecular biology and bioinformatics: Research Articles , 2005 .

[157]  Hamideh Afsarmanesh,et al.  Modelling Multi-disciplinary Scientific Experiments and Information , 2003, ISCIS.

[158]  Yi Chen,et al.  Efficiently discovering critical workflows in scientific explorations , 2009, Future Gener. Comput. Syst..

[159]  Jizhong Zhou,et al.  Empirical Establishment of Oligonucleotide Probe Design Criteria , 2005, Applied and Environmental Microbiology.

[160]  Bertram Ludäscher,et al.  An Ontology-Driven Framework for Data Transformation in Scientific Workflows , 2004, DILS.

[161]  Marco Roos,et al.  Analysing Scientific Workflows: Why Workflows Not Only Connect Web Services , 2009, 2009 Congress on Services - I.

[162]  Ann E. Stapleton,et al.  Codifying bioinformatics processes without programming , 2004 .

[163]  Carole A. Goble,et al.  Taverna Workflows: Syntax and Semantics , 2007, Third IEEE International Conference on e-Science and Grid Computing (e-Science 2007).

[164]  Carole A. Goble,et al.  A classification of tasks in bioinformatics , 2001, Bioinform..

[165]  Ian J. Taylor,et al.  The Triana Workflow Environment: Architecture and Applications , 2007, Workflows for e-Science, Scientific Workflows for Grids.

[166]  Xuan Shi Semantic Web Services: An Unfulfilled Promise , 2007, IT Professional.

[167]  Ilya Shmulevich,et al.  Systems biology driven software design for the research enterprise , 2007, BMC Bioinformatics.

[168]  Carole A. Goble,et al.  Feta: A Light-Weight Architecture for User Oriented Semantic Service Discovery , 2005, ESWC.

[169]  Dov Stekel,et al.  Microarray Bioinformatics: Appendix: MIAME Glossary , 2003 .

[170]  Ian J. Taylor,et al.  Triana: a graphical Web service composition and execution toolkit , 2004, Proceedings. IEEE International Conference on Web Services, 2004..

[171]  Yogesh L. Simmhan,et al.  A survey of data provenance in e-science , 2005, SGMD.

[172]  K. Dunbar HOW SCIENTISTS REALLY REASON: SCIENTIFIC REASONING IN REAL-WORLD LABORATORIES , 1995 .

[173]  Morris A. Swertz,et al.  Beyond standardization: dynamic software infrastructures for systems biology , 2007, Nature Reviews Genetics.

[174]  Chris F. Taylor,et al.  The MGED Ontology: a resource for semantics-based description of microarray experiments , 2006, Bioinform..

[175]  Cesare Pautasso JOpera: Visual Composition of Grid Services , 2004 .

[176]  Susan B. Davidson,et al.  Towards a Model of Provenance and User Views in Scientific Workflows , 2006, DILS.

[177]  D. Kell,et al.  Here is the evidence, now what is the hypothesis? The complementary roles of inductive and hypothesis-driven science in the post-genomic era. , 2004, BioEssays : news and reviews in molecular, cellular and developmental biology.

[178]  Carole A. Goble,et al.  Ontology-based Knowledge Representation for Bioinformatics , 2000, Briefings Bioinform..

[179]  Peter Dadam,et al.  Enabling Adaptive Process-Aware Information Systems with ADEPT2 , 2008, Handbook of Research on Business Process Modeling.

[180]  Carole A. Goble,et al.  Mining Taverna's semantic web of provenance , 2008, Concurr. Comput. Pract. Exp..

[181]  Roy T. Fielding,et al.  Principled design of the modern Web architecture , 2000, Proceedings of the 2000 International Conference on Software Engineering. ICSE 2000 the New Millennium.

[182]  Simon Urbanek,et al.  Rserve A fast way to provide R functionality to applications , 2003 .

[183]  Bertram Ludäscher,et al.  Kepler: an extensible system for design and execution of scientific workflows , 2004, Proceedings. 16th International Conference on Scientific and Statistical Database Management, 2004..

[184]  Carole A. Goble,et al.  Performing statistical analyses on quantitative data in Taverna workflows: An example using R and maxdBrowse to identify differentially-expressed genes from microarray data , 2008, BMC Bioinformatics.

[185]  Luc Moreau,et al.  The Open Provenance Model: An Overview , 2008, IPAW.

[186]  B. Jayashree,et al.  Laboratory Information Management Software for genotyping workflows: applications in high throughput crop genotyping , 2006, BMC Bioinformatics.

[187]  Elaine Toms,et al.  Developing a protocol for bioinformatics analysis: An integrated information behavior and task analysis approach , 2005, J. Assoc. Inf. Sci. Technol..

[188]  Moe Thandar Wynn,et al.  Workflow simulation for operational decision support , 2009, Data Knowl. Eng..

[189]  Geoffrey C. Fox,et al.  Examining the Challenges of Scientific Workflows , 2007, Computer.

[190]  Gilles Kahn,et al.  Coroutines and Networks of Parallel Processes , 1977, IFIP Congress.

[191]  Christoph Wilhelm Sensen,et al.  A Pilot Study into the Usability of a Scientific Workflow Construction Tool , 2007 .

[192]  Daniela Fogli,et al.  Visual Interactive Systems for End-User Development: A Model-Based Design Methodology , 2007, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[193]  Roger S. Barga,et al.  Automatic Generation of Workflow Provenance , 2006, IPAW.

[194]  P. Argos,et al.  SRS: information retrieval system for molecular biology data banks. , 1996, Methods in enzymology.

[195]  Anne H. H. Ngu,et al.  Flexible Scientific Workflow Modeling Using Frames, Templates, and Dynamic Embedding , 2008, SSDBM.

[196]  Khalid Belhajjame Addressing the Issue of Service Volatility in Scientific Workflows , 2007, ICSOC.

[197]  Arie Rip,et al.  The Computer Revolution in Science: Steps Towards the Realization of Computer-Supported Discovery Environments , 1997, Artif. Intell..

[198]  Carole A. Goble,et al.  The Data Playground: An Intuitive Workflow Specification Environment , 2007, Third IEEE International Conference on e-Science and Grid Computing (e-Science 2007).

[199]  Hitomi Ohkawa,et al.  Scientific workflows as productivity tools for drug discovery. , 2008, Current opinion in drug discovery & development.

[200]  Andreas D Baxevanis,et al.  Searching the NCBI Databases Using Entrez , 2006, Current protocols in human genetics.

[201]  Ian T. Foster Globus Toolkit Version 4: Software for Service-Oriented Systems , 2005, NPC.

[202]  Richard R. Mahaffey LIMS: Applied Information Technology for the Laboratory , 1990 .

[203]  Amit P. Sheth,et al.  An overview of workflow management: From process modeling to workflow automation infrastructure , 1995, Distributed and Parallel Databases.

[204]  David Charles De Roure,et al.  myExperiment: social networking for workflow-using e-scientists , 2007, WORKS '07.

[205]  Christoph Bussler,et al.  Workflow Management: Modeling Concepts, Architecture and Implementation , 1996 .

[206]  Robert Ramberg,et al.  Backdoor Creativity: Collaborative Creativity in Technology Supported Teams , 2004, COOP.

[207]  Emmanuel Barillot,et al.  XML, bioinformatics and data integration , 2001, Bioinform..

[208]  Catherine Brooksbank,et al.  The European Bioinformatics Institute's data resources: towards systems biology , 2004, Nucleic Acids Res..

[209]  Yogesh L. Simmhan,et al.  The Open Provenance Model (v1.01) , 2008 .

[210]  T. Oinn,et al.  Soaplab - a unified Sesame door to analysis tools , 2003 .

[211]  Carole A. Goble,et al.  Applying Semantic Web Services to Bioinformatics: Experiences Gained, Lessons Learnt , 2004, SEMWEB.

[212]  Peter Gregor,et al.  Usability and User-Centered Design in Scientific Software Development , 2009, IEEE Software.

[213]  Carole A. Goble,et al.  Seven Bottlenecks to Workflow Reuse and Repurposing , 2005, International Semantic Web Conference.

[214]  Marco Roos,et al.  The promise of a virtual lab in drug discovery. , 2006, Drug discovery today.

[215]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[216]  Jano I. van Hemert,et al.  Scientific Workflow: A Survey and Research Directions , 2007, PPAM.

[217]  Christopher D. Hundhausen,et al.  An experimental study of the impact of visual semantic feedback on novice programming , 2007, J. Vis. Lang. Comput..

[218]  Christoph W. Sensen,et al.  Seahawk: moving beyond HTML in Web-based bioinformatics analysis , 2007, BMC Bioinformatics.

[219]  Chris North,et al.  An Evaluation of Microarray Visualization Tools for Biological Insight , 2004 .

[220]  Gary D. Bader,et al.  SeqHound: biological sequence and structure database as a platform for bioinformatics research , 2002, BMC Bioinformatics.

[221]  S. Gruvberger,et al.  BioArray Software Environment (BASE): a platform for comprehensive management and analysis of microarray data , 2002, Genome Biology.

[222]  Luc Moreau,et al.  Provenance of e-Science Experiments - Experience from Bioinformatics , 2003 .

[223]  Carole A. Goble,et al.  Taverna/myGrid: Aligning a Workflow System with the Life Sciences Community , 2007, Workflows for e-Science, Scientific Workflows for Grids.

[224]  Carole A. Goble,et al.  Using Semantic Web Technologies for Representing E-science Provenance , 2004, SEMWEB.

[225]  Antonio Brogi,et al.  From BPEL Processes to YAWL Workflows , 2006, WS-FM.

[226]  Alfons Kemper,et al.  Semantic Caching for Web Services , 2005, ICSOC.