PsyGrid: Applying e-Science to Epidemiology

The process of hypothesis-driven epidemiological research has three phases - the establishment and characterisation of a large, representative cohort from a geographically distributed population; the integration of the cohort data with other data sources to provide additional characterisation; the formulation of a hypothesis and generation of the corresponding predictions. Grid-computing technologies make possible secure, distributed collaboration, and the ability to share data sources, computational resources and storage resources across administrative boundaries. PsyGrid is an e-science project established to apply grid-computing technologies to each of the three phases, with the aim of eliminating the obstacles that hinder epidemiological research. We describe a system for distributed cohort characterisation, and the first application to the study of first episode psychosis

[1]  Steve Pettifer,et al.  Knowledge Integration , 2004, The Grid 2, 2nd Edition.

[2]  Geoffrey C. Fox,et al.  Web Service Grids: an evolutionary approach , 2005, Concurr. Pract. Exp..

[3]  Anne E. Trefethen,et al.  The UK e-Science Core Programme and the Grid , 2002, Future Gener. Comput. Syst..

[4]  Norman W. Paton,et al.  OGSA-DQP: A Service for Distributed Querying on the Grid , 2004, EDBT.

[5]  Matthew R. Pocock,et al.  Taverna: a tool for the composition and enactment of bioinformatics workflows , 2004, Bioinform..

[6]  W. Yasnoff,et al.  Public Health Informatics and Information Systems , 2003, Health Informatics.

[7]  Arie Shoshani,et al.  Data Access, Integration, and Management , 2004, The Grid 2, 2nd Edition.

[8]  Jim Basney,et al.  The MyProxy online credential repository , 2005, Softw. Pract. Exp..