Grid-enabled Virtual Screening Against Malaria

WISDOM is an international initiative to enable a virtual screening pipeline on a Grid infrastructure. Its first attempt was to deploy large scale in silico docking on a public Grid infrastructure. Protein–ligand docking is about computing the binding energy of a protein target to a library of potential drugs using a scoring algorithm. Previous deployments were either limited to one cluster, to Grids of clusters in the tightly protected environment of a pharmaceutical laboratory or to desktop Grids. The first large scale docking experiment ran on the EGEE Grid production service from 11 July 2005 to 19 August 2005 against targets relevant to research on malaria and saw over 41 million compounds docked for the equivalent of 80 years of CPU time. Up to 1,700 computers were simultaneously used in 15 countries around the world. Issues related to the deployment and the monitoring of the in silico docking experiment as well as experience with Grid operation and services are reported in the paper. The main problem encountered for such a large scale deployment was the Grid infrastructure stability. Although the overall success rate was above 80%, a lot of monitoring and supervision was still required at the application level to resubmit the jobs that failed. But the experiment demonstrated how Grid infrastructures have a tremendous capacity to mobilize very large CPU resources for well targeted goals during a significant period of time. This success leads to a second computing challenge targeting avian flu neuraminidase N1.

[1]  Rajesh Raman,et al.  Matchmaking: distributed resource management for high throughput computing , 1998, Proceedings. The Seventh International Symposium on High Performance Distributed Computing (Cat. No.98TB100244).

[2]  Thomas Lengauer,et al.  A fast flexible docking method using an incremental construction algorithm. , 1996, Journal of molecular biology.

[3]  Paul D Lyne,et al.  Structure-based virtual screening: an overview. , 2002, Drug discovery today.

[4]  Laurence Loewe,et al.  Global Computing for Bioinformatics , 2002, Briefings Bioinform..

[5]  L. Muñoz,et al.  Virtual Laboratory , 2002 .

[6]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[7]  Vincent Breton,et al.  From Grid to Healthgrid - Proceedings of Healthgrid 2005, Oxford, UK, 7-9 April 2005 , 2005, HealthGrid.

[8]  Krishnarao Appasani Continuing the 'ome' revolution. , 2002, Drug discovery today.

[9]  Flavia Donno,et al.  Analysis of the ATLAS Rome production experience on the LHC computing grid , 2005, First International Conference on e-Science and Grid Computing (e-Science'05).

[10]  Martin Hofmann,et al.  Grid Added Value to Address Malaria , 2006, CCGRID.

[11]  Andy Oram,et al.  Peer-to-Peer: Harnessing the Power of Disruptive Technologies , 2001 .

[12]  Vijay S. Pande,et al.  Folding@Home and Genome@Home: Using distributed computing to tackle previously intractable problem , 2009, 0901.0866.

[13]  Charles L. Brooks,et al.  Predictor@Home: a "protein structure prediction supercomputer" based on public-resource computing , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.

[14]  David Abramson,et al.  The Virtual Laboratory: a toolset to enable distributed molecular modelling for drug design on the World‐Wide Grid , 2003, Concurr. Comput. Pract. Exp..

[15]  A. Anderson The process of structure-based drug design. , 2003, Chemistry & biology.

[16]  David S. Goodsell,et al.  Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function , 1998, J. Comput. Chem..

[17]  I Bird,et al.  LHC computing grid : Technical design report , 2005 .

[18]  Ian Foster,et al.  Grid technologies empowering drug discovery. , 2002, Drug discovery today.

[19]  Ian T. Foster,et al.  GNARE: an environment for grid-based high-throughput genome analysis , 2005, CCGrid 2005. IEEE International Symposium on Cluster Computing and the Grid, 2005..

[20]  J. Irwin,et al.  ZINC ? A Free Database of Commercially Available Compounds for Virtual Screening. , 2005 .

[21]  Fabrizio Gagliardi,et al.  Building an infrastructure for scientific Grid computing: status and goals of the EGEE project , 2005, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[22]  Jack R. Collins,et al.  Structure and inhibition of plasmepsin II, a hemoglobin-degrading enzyme from P. Falciparum , 1996 .

[23]  David Abramson,et al.  Application of grid computing to parameter sweeps and optimizations in molecular modeling , 2005, Future Gener. Comput. Syst..

[24]  R. Spencer,et al.  High-throughput screening of historic collections: observations on file size, biological targets, and file diversity. , 1998, Biotechnology and bioengineering.

[25]  Jochen Wiesner,et al.  New antimalarial drugs. , 2003, Angewandte Chemie.

[26]  David J. Garcia Aristegui,et al.  GROCK: high-throughput docking using LCG grid tools , 2005, The 6th IEEE/ACM International Workshop on Grid Computing, 2005..

[27]  David S. Goodsell,et al.  Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function , 1998 .

[28]  D. Goldberg,et al.  Aspartic proteases of Plasmodium falciparum and other parasitic protozoa as drug targets. , 2001, Trends in parasitology.

[29]  Hiroshi Nakamura,et al.  Grid as a bioinformatic tool , 2004, Parallel Comput..

[30]  Proceedings of Healthgrid 2005. , 2005, Studies in health technology and informatics.

[31]  Marc Zimmermann,et al.  Demonstration of In Silico Docking at a Large Scale on Grid Infrastructure , 2006, HealthGrid.

[32]  W. Graham Richards,et al.  Virtual screening using grid computing: the screensaver project , 2002, Nature Reviews Drug Discovery.

[33]  Charles L. Brooks,et al.  Predictor@Home: A "Protein Structure Prediction Supercomputer' Based on Global Computing , 2006, IEEE Transactions on Parallel and Distributed Systems.

[34]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[35]  D. Sullivan,et al.  Hemoglobin metabolism in the malaria parasite Plasmodium falciparum. , 1997, Annual review of microbiology.