Enhancing Data Sharing in Collaborative Research Projects with DASH

We describe a software framework, called DASH, that enables the facile access, maintenance, curation and sharing of computational biology data among collaborating research scientists. The DASH event-based framework enables members of team-based research projects to describe the multistep computational processing pipelines frequently required to generate data for sharing, monitors multiple distributed data stores for changes, and will then automatically invoke the appropriate processing pipeline(s). These pipelines can be used to communicate the results of data analyses to collaborators using mechanisms such as Web Services. We describe the overall design of the DASH system and the application of a simple DASH prototype to a collaborative pharmacogenomics research project involving several dozen researchers located at several different sites--the UCSF Pharmacogenetics of Membrane Transporters project.

[1]  Gail E. Kaiser,et al.  A paradigm for decentralized process modeling and its realization in the OZ environment , 1994, Proceedings of 16th International Conference on Software Engineering.

[2]  Matthew R. Pocock,et al.  Taverna: a tool for the composition and enactment of bioinformatics workflows , 2004, Bioinform..

[3]  Kenneth M. Anderson,et al.  Metis: lightweight, flexible, and Web-based workflow services for digital libraries , 2003, 2003 Joint Conference on Digital Libraries, 2003. Proceedings..

[4]  Robert J. K. Jacob,et al.  A software model and specification language for non-WIMP user interfaces , 1999, TCHI.

[5]  David S. Rosenblum,et al.  Design and evaluation of a wide-area event notification service , 2001, TOCS.

[6]  Chris Gane,et al.  Structured Systems Analysis: Tools and Techniques , 1977 .

[7]  Carole A. Goble,et al.  myGrid: personalised bioinformatics on the information grid , 2003, ISMB.

[8]  E. Zerhouni The NIH Roadmap , 2003, Science.

[9]  Alfonso Fuggetta,et al.  Exploiting an event-based infrastructure to develop complex distributed systems , 1998, Proceedings of the 20th International Conference on Software Engineering.

[10]  Ian T. Foster,et al.  Globus: a Metacomputing Infrastructure Toolkit , 1997, Int. J. High Perform. Comput. Appl..

[11]  Inder M. Verma,et al.  The NIH Roadmap , 2004 .

[12]  Conrad C. Huang,et al.  SNP Analysis and Presentation in the Pharmacogenetics of Membrane Transporters Project , 2003, Pacific Symposium on Biocomputing.

[13]  Conrad C. Huang,et al.  Natural variation in human membrane transporter genes reveals evolutionary and functional constraints , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Russ B. Altman,et al.  PharmGKB: the Pharmacogenetics Knowledge Base , 2002, Nucleic Acids Res..