The Data Playground: An Intuitive Workflow Specification Environment

Workflows systems are steadily finding their way into the work practices of scientists. This is particularly true in the in silico science of bioinformatics, where biological data can be processed by Web services. In this paper we investigate the potential of evolving the users' interaction with workflow environments so that it more closely relates to the mode in which their day to day work is carried out. We present the Data Playground, an environment designed to encourage the uptake of workflow systems in bioinformatics through more intuitive interaction by focusing the user on their data rather than on the processes. We implement a prototype plug-in for the Taverna workflow environment and show how this can promote the creation of workflow fragments by automatically converting the users' interactions with data and Web services into a more conventional workflow specification.

[1]  Dan R. Olsen,et al.  Macros by example in a graphical UIMS , 1988, IEEE Computer Graphics and Applications.

[2]  James C. Spohrer,et al.  KidSim: programming agents without a programming language , 1994, CACM.

[3]  Atsushi Sugiura,et al.  Internet scrapbook: automating Web browsing tasks by demonstration , 1998, UIST '98.

[4]  Tessa A. Lau,et al.  Programming by demonstration: an inductive learning formulation , 1998, IUI '99.

[5]  H. Lieberman Your Wish is My Command: Programming By Example , 2001 .

[6]  Anthony Rowe,et al.  The discovery net system for high throughput bioinformatics , 2003, ISMB.

[7]  L. Stein Integrating biological databases , 2003, Nature Reviews Genetics.

[8]  Carole A. Goble,et al.  Exploring Williams-Beuren syndrome using myGrid , 2004, ISMB/ECCB.

[9]  Carole A. Goble,et al.  Applying Semantic Web Services to Bioinformatics: Experiences Gained, Lessons Learnt , 2004, SEMWEB.

[10]  Wil M. P. van der Aalst,et al.  Workflow mining: discovering process models from event logs , 2004, IEEE Transactions on Knowledge and Data Engineering.

[11]  Matthew R. Pocock,et al.  Taverna: a tool for the composition and enactment of bioinformatics workflows , 2004, Bioinform..

[12]  Marc Spraragen,et al.  An intelligent assistant for interactive workflow composition , 2004, IUI '04.

[13]  Kerry K Kakazu,et al.  The Cancer Biomedical Informatics Grid (caBIG): pioneering an expansive network of information and tools for collaborative cancer research. , 2004, Hawaii medical journal.

[14]  Bertram Ludäscher,et al.  Kepler: an extensible system for design and execution of scientific workflows , 2004 .

[15]  Heiko Schoof,et al.  BioMOBY Successfully Integrates Distributed Heterogeneous Bioinformatics Web Services. The PlaNet Exemplar Case1 , 2005, Plant Physiology.

[16]  Rob Miller,et al.  Automation and customization of rendered web pages , 2005, UIST.

[17]  Daniel J. Blankenberg,et al.  Galaxy: a platform for interactive large-scale genome analysis. , 2005, Genome research.

[18]  Carole A. Goble,et al.  Taverna: a tool for building and running workflows of services , 2006, Nucleic Acids Res..

[19]  Brad A. Myers,et al.  Workflow by Example: Automating Database Interactions via Deductions , 2006 .

[20]  Jason Maassen,et al.  Programming Scientific and Distributed Workflow with Triana Services , 2004 .

[21]  Carole A. Goble,et al.  Workflow discovery: the problem, a case study from e-Science and a graph-based solution , 2006, 2006 IEEE International Conference on Web Services (ICWS'06).

[22]  Carole A. Goble,et al.  A Semantic Web-Based Approach to Knowledge Management for Grid Applications , 2007, IEEE Transactions on Knowledge and Data Engineering.

[23]  Carole A. Goble,et al.  The myGrid ontology: bioinformatics service discovery , 2007, Int. J. Bioinform. Res. Appl..

[24]  David Charles De Roure,et al.  myExperiment: social networking for workflow-using e-scientists , 2007, WORKS '07.

[25]  Carole A. Goble,et al.  Automatic annotation of Web services based on workflow definitions , 2006, TWEB.