High‐throughput virtual molecular docking with AutoDockCloud

Virtual molecular docking is a computational method used in computer‐aided drug discovery that calculates the binding affinity of a small molecule drug candidate to a target protein. High‐throughput virtual screenings calculate the binding affinities for a large number of molecules at once and ranks potential drug candidates to greatly reduces the time and cost of suggesting new potential pharmaceuticals. This high‐throughput screening is a task parallel process and therefore well‐suited for distributed computing. In this study, we use the open source Hadoop framework implementing the MapReduce paradigm for distributed computing on a cloud platform and the widely used molecular docking program, AutoDock. The initial implementation of AutoDockCloud showed a speed‐up of 450 on Kandinsky, a cloud computer located at Oak Ridge National Laboratory. Further modifications show promise for a greater speed‐up of large chemical library screenings and also incorporates and distributes the pre‐docking procedures. Copyright © 2012 John Wiley & Sons, Ltd.

[1]  David A. Agard,et al.  Structural characterization of a subtype-selective ligand reveals a novel mode of estrogen receptor antagonism , 2002, Nature Structural Biology.

[2]  Rick Mullin,et al.  EXELIXIS AFTER GSK: A six-year partnership ends, and a new era begins for a BIOTECHNOLOGY PIONEER , 2009 .

[3]  Yongbo Hu,et al.  Comparison of Several Molecular Docking Programs: Pose Prediction and Virtual Screening Accuracy , 2009, J. Chem. Inf. Model..

[4]  Paul J. Hergenrother,et al.  Structure-Based Design and In Silico Virtual Screening of Combinatorial Libraries. A Combined Chemical- Computational Project , 2005 .

[5]  Arthur J. Olson,et al.  AutoDock Vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading , 2009, J. Comput. Chem..

[6]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[7]  Jaques Reifman,et al.  DOVIS 2.0: an efficient and easy to use parallel virtual screening tool based on AutoDock 4.0 , 2008, Chemistry Central journal.

[8]  Arthur J. Olson,et al.  Analysis of HIV Wild‐Type and Mutant Structures via in silico Docking Against Diverse Ligand Libraries. , 2007 .

[9]  J. Irwin,et al.  Benchmarking sets for molecular docking. , 2006, Journal of medicinal chemistry.

[10]  David S. Goodsell,et al.  Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function , 1998 .

[11]  Stephen Trzaska digital briefs: NEW SOFTWARE AND WEBSITES FOR THE CHEMICAL ENTERPRISE , 2006 .

[12]  Jeremy C. Smith,et al.  Task‐parallel message passing interface implementation of Autodock4 for docking of very large databases of compounds using high‐performance super‐computers , 2011, J. Comput. Chem..

[13]  David S. Goodsell,et al.  A semiempirical free energy force field with charge‐based desolvation , 2007, J. Comput. Chem..

[14]  Tom White,et al.  Hadoop: The Definitive Guide , 2009 .

[15]  David P. Anderson,et al.  The Effectiveness of Threshold-Based Scheduling Policies in BOINC Projects , 2006, 2006 Second IEEE International Conference on e-Science and Grid Computing (e-Science'06).

[16]  Na Ke,et al.  A retinoic acid binding cytochrome P450: CYP120A1 from Synechocystis sp. PCC 6803. , 2005, Archives of biochemistry and biophysics.

[17]  J. Bajorath,et al.  Docking and scoring in virtual screening for drug discovery: methods and applications , 2004, Nature Reviews Drug Discovery.

[18]  Johan Montagnat,et al.  Grid-enabled Virtual Screening Against Malaria , 2006, Journal of Grid Computing.

[19]  David S. Goodsell,et al.  Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function , 1998, J. Comput. Chem..

[20]  Brian K. Shoichet,et al.  ZINC - A Free Database of Commercially Available Compounds for Virtual Screening , 2005, J. Chem. Inf. Model..