High-throughput Molecular Docking Now in Reach for a Wider Biochemical Community

In silico molecular docking is used to predict how a small molecule, the ligand, interacts with a target protein, its receptor. Together with experimental methods like NMR or X-ray crystallography, industrial and academic groups use it for their investigation of compounds with the potential to modulate the protein's function and become a lead molecule for drug development. The interpretation of raw data, from NMR, mass spectrometry or crystallography, is greatly assisted by computers. Biochemists can, perhaps better than anybody else, perform individual analyses for the computational modeling of interactions. However, an extension towards virtual screening of compound libraries, i.e. the computational docking of thousands or even millions of ligands to a target receptor, is often perceived as technically challenging and computationally too expensive in the biochemical community. Here we describe how to integrate spare resources of regular desktop computers on and off campus using the Berkeley Open Infrastructure for Network Computing (BOINC) infrastructure for volunteer grid computing. We have brought both the BOINC server and the Auto Dock software suite into the Debian and Ubuntu Linux distributions, and provide detailed instructions to help render the implementation of a large-scale high-throughput docking (HTD) project straightforward. Thus, this increased availability of computational resources, protocols and source code opens up the possibility of many new self-run projects and collaborations, for those who may be adopting the technology for the first time. On the technical level, we expect to observe more contributors from the Open Source and academic communities. On the biological side, we anticipate new and faster progress on commercially less interesting and so-called 'neglected' diseases.

[1]  Brian K. Shoichet,et al.  ZINC - A Free Database of Commercially Available Compounds for Virtual Screening , 2005, J. Chem. Inf. Model..

[2]  Carole A. Goble,et al.  Community-driven computational biology with Debian Linux , 2010, BMC Bioinformatics.

[3]  Lydia E. Kavraki,et al.  Molecular docking: a problem with thousands of degrees of freedom , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).

[4]  Yi Pan,et al.  Distributed Sequence Alignment Applications for the Public Computing Architecture , 2008, IEEE Transactions on NanoBioscience.

[5]  Jianghong An,et al.  A large-scale computational approach to drug repositioning. , 2006, Genome informatics. International Conference on Genome Informatics.

[6]  Brian K. Shoichet,et al.  Virtual screening of chemical libraries , 2004, Nature.

[7]  J Andrew McCammon,et al.  Discovery of a novel binding trench in HIV integrase. , 2004, Journal of medicinal chemistry.

[8]  Jano I. van Hemert,et al.  Special Issue: Portals for life sciences—Providing intuitive access to bioinformatic tools , 2011, Concurr. Comput. Pract. Exp..

[9]  Martin Hofmann-Apitius,et al.  In silico drug discovery approaches on grid computing infrastructures. , 2010, Current clinical pharmacology.

[10]  Stefano Forli,et al.  Virtual screening with AutoDock: theory and practice , 2010, Expert opinion on drug discovery.

[11]  Thomas Lengauer,et al.  Improved cyclodextrin-based receptors for camptothecin by inverse virtual screening. , 2007, Chemistry.

[12]  Arthur J. Olson,et al.  AutoDock Vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading , 2009, J. Comput. Chem..

[13]  David S. Wishart,et al.  DrugBank: a knowledgebase for drugs, drug actions and drug targets , 2007, Nucleic Acids Res..

[14]  Yanli Wang,et al.  PubChem: a public information system for analyzing bioactivities of small molecules , 2009, Nucleic Acids Res..

[15]  Michael P. Cummings,et al.  Computing the Tree of Life: Leveraging the Power of Desktop and Service Grids , 2011, 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum.

[16]  Garrett M Morris,et al.  A dynamic model of HIV integrase inhibition and drug resistance. , 2010, Journal of molecular biology.

[17]  D. Kliebenstein,et al.  The Complex Genetic Architecture of the Metabolome , 2010, PLoS genetics.

[18]  D. Goodsell,et al.  Visualization of macromolecular structures , 2010, Nature Methods.

[19]  Gianluigi Lauro,et al.  Inverse virtual screening of antitumor targets: pilot study on a small database of natural bioactive compounds. , 2011, Journal of natural products.

[20]  David S. Goodsell,et al.  AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility , 2009, J. Comput. Chem..

[21]  Jaques Reifman,et al.  DOVIS: an implementation for high-throughput virtual screening using AutoDock , 2008, BMC Bioinformatics.

[22]  Gilles Fedak,et al.  EDGeS: Bridging EGEE to BOINC and XtremWeb , 2009, Journal of Grid Computing.

[23]  Fergal P. Casey,et al.  Approved Drug Mimics of Short Peptide Ligands from Protein Interaction Motifs , 2008, J. Chem. Inf. Model..

[24]  L. Peltonen,et al.  Monogenic autoimmune diseases - lessons of self-tolerance. , 2005, Current opinion in immunology.

[25]  D. Goodsell,et al.  Automated prediction of ligand‐binding sites in proteins , 2007, Proteins.

[26]  Garrett M Morris,et al.  Using AutoDock for Ligand‐Receptor Docking , 2008, Current protocols in bioinformatics.

[27]  Péter Kacsuk,et al.  GenWrapper: A generic wrapper for running legacy applications on desktop grids , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.