myTea: Connecting the Web to Digital Science on the Desktop

Bioinformaticians regularly access the hundreds of databases and tools that are available to them on the Web. None of these tools communicate with each other, causing the scientist to copy results manually from a Web site into a spreadsheet or word processor. myGrids' Taverna has made it possible to create templates (workflows) that automatically run searches using these databases and tools, cutting down what previously took days of work into hours, and enabling the automated capture of experimental details. What is still missing in the capture process, however, is the details of work done on that material once it moves from the Web to the desktop: if a scientist runs a process on some data, there is nothing to record why that action was taken; it is likewise not easy to publish a record of this process back to the community on the Web. In this paper, we present a novel interaction framework, built on Semantic Web technologies, and grounded in usability design practice, in particular the Making Tea method. Through this work, we introduce a new model of practice designed specifically to (1) support the scientists' interactions with data from the Web to the desktop, (2) provide automatic annotation of process to capture what has previously been lost and (3) associate provenance services automatically with that data in order to enable meaningful interrogation of the process and controlled sharing of the results.

[1]  Carole A. Goble,et al.  Feta: A Light-Weight Architecture for User Oriented Semantic Service Discovery , 2005, ESWC.

[2]  Frank van Harmelen,et al.  Sesame: A Generic Architecture for Storing and Querying RDF and RDF Schema , 2002, SEMWEB.

[3]  Sean Martin,et al.  Globally distributed object identification for biological knowledgebases , 2004, Briefings Bioinform..

[4]  Robert Stevens,et al.  Annotating, Linking and Browsing Provenance Logs for {e-Science} , 2003 .

[5]  Matthew R. Pocock,et al.  Taverna: a tool for the composition and enactment of bioinformatics workflows , 2004, Bioinform..

[6]  Dat Tran,et al.  Applying Task Analysis to Describe and Facilitate Bioinformatics Tasks , 2004, MedInfo.

[7]  J. G. Hollands,et al.  Engineering Psychology and Human Performance , 1984 .

[8]  Bruno W. S. Sobral,et al.  A Life Scientist's Gateway to Distributed Data Management and Computing: The PathPort/ToolBus Framework , 2003, OMICS.

[9]  N. Kaminski,et al.  Bioinformatics. A user's perspective. , 2000, American journal of respiratory cell and molecular biology.

[10]  P. Argos,et al.  SRS: information retrieval system for molecular biology data banks. , 1996, Methods in enzymology.

[11]  Limsoon Wong,et al.  BioKleisli: a digital library for biomedical researchers , 1997, International Journal on Digital Libraries.

[12]  S. R. Pettifer,et al.  UTOPIA—User-Friendly Tools for Operating Informatics Applications , 2004, Comparative and functional genomics.

[13]  Carole A. Goble,et al.  Using Semantic Web Technologies for Representing E-science Provenance , 2004, SEMWEB.

[14]  Carole A. Goble,et al.  A Suite of Daml+Oil Ontologies to Describe Bioinformatics Web Services and Data , 2003, Int. J. Cooperative Inf. Syst..

[15]  Peter Buneman,et al.  Challenges in Integrating Biological Data Sources , 1995, J. Comput. Biol..

[16]  Pedro Mendes,et al.  ISYS: a decentralized, component-based approach to the integration of heterogeneous bioinformatics resources , 2001, Bioinform..

[17]  Anthony Rowe,et al.  The discovery net system for high throughput bioinformatics , 2003, ISMB.

[18]  Carole A. Goble,et al.  Exploring Williams-Beuren syndrome using myGrid , 2004, ISMB/ECCB.

[19]  Naftali Kaminski,et al.  A User ’ s Perspective , 2000 .

[20]  Monica M. C. Schraefel,et al.  Making tea: iterative design through analogy , 2004, DIS '04.

[21]  Carole A. Goble,et al.  A classification of tasks in bioinformatics , 2001, Bioinform..

[22]  Shahrokh Saeednia,et al.  How to maintain both privacy and authentication in digital libraries , 2000 .