VSDMIP: virtual screening data management on an integrated platform

A novel software (VSDMIP) for the virtual screening (VS) of chemical libraries integrated within a MySQL relational database is presented. Two main features make VSDMIP clearly distinguishable from other existing computational tools: (i) its database, which stores not only ligand information but also the results from every step in the VS process, and (ii) its modular and pluggable architecture, which allows customization of the VS stages (such as the programs used for conformer generation or docking), through the definition of a detailed workflow employing user-configurable XML files. VSDMIP, therefore, facilitates the storage and retrieval of VS results, easily adapts to the specific requirements of each method and tool used in the experiments, and allows the comparison of different VS methodologies. To validate the usefulness of VSDMIP as an automated tool for carrying out VS several experiments were run on six protein targets (acetylcholinesterase, cyclin-dependent kinase 2, coagulation factor Xa, estrogen receptor alpha, p38 MAP kinase, and neuraminidase) using nine binary (actives/inactive) test sets. The performance of several VS configurations was evaluated by means of enrichment factors and receiver operating characteristic plots.

[1]  Kun Wang,et al.  Gaussian mapping of chemical fragments in ligand binding sites , 2004, J. Comput. Aided Mol. Des..

[2]  S. Teague Implications of protein flexibility for drug discovery , 2003, Nature Reviews Drug Discovery.

[3]  B. Kuhn,et al.  Validation and use of the MM-PBSA approach for drug discovery. , 2005, Journal of medicinal chemistry.

[4]  K. Sharp,et al.  Accurate Calculation of Hydration Free Energies Using Macroscopic Solvent Models , 1994 .

[5]  A. Ortiz,et al.  A new implicit solvent model for protein–ligand docking , 2007, Proteins.

[6]  David Rogers,et al.  Cheminformatics analysis and learning in a data pipelining environment , 2006, Molecular Diversity.

[7]  C. E. Peishoff,et al.  A critical assessment of docking programs and scoring functions. , 2006, Journal of medicinal chemistry.

[8]  Niu Huang,et al.  Physics-Based Scoring of Protein-Ligand Complexes: Enrichment of Known Inhibitors in Large-Scale Virtual Screening , 2006, J. Chem. Inf. Model..

[9]  Lenwood S. Heath,et al.  H++: a server for estimating pKas and adding missing hydrogens to macromolecules , 2005, Nucleic Acids Res..

[10]  Jennifer A Townes,et al.  The development of monocyclic pyrazolone based cytokine synthesis inhibitors. , 2005, Bioorganic & medicinal chemistry letters.

[11]  J M Blaney,et al.  A geometric approach to macromolecule-ligand interactions. , 1982, Journal of molecular biology.

[12]  David A. Agard,et al.  The Structural Basis of Estrogen Receptor/Coactivator Recognition and the Antagonism of This Interaction by Tamoxifen , 1998, Cell.

[13]  Brian K Shoichet,et al.  Prediction of protein-ligand interactions. Docking and scoring: successes and gaps. , 2006, Journal of medicinal chemistry.

[14]  Lahana,et al.  How many leads from HTS? , 1999, Drug discovery today.

[15]  F. Lombardo,et al.  Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings , 1997 .

[16]  Brian K. Shoichet,et al.  ZINC - A Free Database of Commercially Available Compounds for Virtual Screening , 2005, J. Chem. Inf. Model..

[17]  Brian K. Shoichet,et al.  Virtual screening of chemical libraries , 2004, Nature.

[18]  B. Honig,et al.  Classical electrostatics in biology and chemistry. , 1995, Science.

[19]  Marta Murcia,et al.  Virtual screening with flexible docking and COMBINE-based models. Application to a series of factor Xa inhibitors. , 2004, Journal of medicinal chemistry.

[20]  Adam Smith,et al.  Screening for drug discovery: The leading question , 2002, Nature.

[21]  P. Kollman,et al.  How well does a restrained electrostatic potential (RESP) model perform in calculating conformational energies of organic and biological molecules? , 2000 .

[22]  Gregory D. Hawkins,et al.  Pairwise solute descreening of solute charges from a dielectric medium , 1995 .

[23]  E. Goldsmith,et al.  The structure of mitogen-activated protein kinase p38 at 2.1-A resolution. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[24]  J. A. Grant,et al.  Gaussian docking functions. , 2003, Biopolymers.

[25]  Luhua Lai,et al.  Further development and validation of empirical scoring functions for structure-based binding affinity prediction , 2002, J. Comput. Aided Mol. Des..

[26]  J. Pin,et al.  Virtual screening workflow development guided by the "receiver operating characteristic" curve approach. Application to high-throughput docking on metabotropic glutamate receptor subtype 4. , 2005, Journal of medicinal chemistry.

[27]  Paul Watson,et al.  A web-based platform for virtual screening. , 2003, Journal of molecular graphics & modelling.

[28]  Martin Almlöf,et al.  Free energy calculations and ligand binding. , 2003, Advances in protein chemistry.

[29]  A. Ortiz,et al.  Evaluation of docking functions for protein-ligand docking. , 2001, Journal of medicinal chemistry.

[30]  Walter Thiel,et al.  MINDO/3 Study of the Addition of Singlet Oxygen (1ΔgO2) to 1,3-Butadiene , 1977 .

[31]  D. Rognan,et al.  Protein-based virtual screening of chemical databases. 1. Evaluation of different docking/scoring combinations. , 2000, Journal of medicinal chemistry.

[32]  S Cusack,et al.  Influenza B virus neuraminidase can synthesize its own inhibitor. , 1993, Structure.

[33]  J. Bajorath,et al.  Docking and scoring in virtual screening for drug discovery: methods and applications , 2004, Nature Reviews Drug Discovery.

[34]  W. Delano The PyMOL Molecular Graphics System , 2002 .

[35]  Claudio N. Cavasotto,et al.  Protein flexibility in ligand docking and virtual screening to protein kinases. , 2004, Journal of molecular biology.

[36]  Emil Alexov,et al.  Rapid grid‐based construction of the molecular surface and the use of induced surface charge to calculate reaction field energies: Applications to the molecular systems and geometric objects , 2002, J. Comput. Chem..

[37]  J. Sussman,et al.  Structure of acetylcholinesterase complexed with E2020 (Aricept): implications for the design of new anti-Alzheimer drugs. , 1999, Structure.

[38]  James J. P. Stewart,et al.  MOPAC: A semiempirical molecular orbital program , 1990, J. Comput. Aided Mol. Des..

[39]  Maria A Miteva,et al.  Fast structure-based virtual ligand screening combining FRED, DOCK, and Surflex. , 2005, Journal of medicinal chemistry.

[40]  W Patrick Walters,et al.  A detailed comparison of current docking and scoring methods on systems of pharmaceutical relevance , 2004, Proteins.

[41]  David G. Lloyd,et al.  Considerations in Compound Database Preparation-"Hidden" Impact on Virtual Screening Results , 2005, J. Chem. Inf. Model..

[42]  Adrian A Canutescu,et al.  A graph‐theory algorithm for rapid protein side‐chain prediction , 2003, Protein science : a publication of the Protein Society.

[43]  D. Bashford,et al.  Use of 1H NMR spectroscopy and computer simulations To analyze histidine pKa changes in a protein tyrosine phosphatase: experimental and theoretical determination of electrostatic properties in a small protein. , 1997, Biochemistry.

[44]  Gregory D. Hawkins,et al.  Parametrized Models of Aqueous Free Energies of Solvation Based on Pairwise Descreening of Solute Atomic Charges from a Dielectric Medium , 1996 .

[45]  A H Calvert,et al.  Identification of novel purine and pyrimidine cyclin-dependent kinase inhibitors with distinct molecular interactions and tumor cell growth inhibition profiles. , 2000, Journal of medicinal chemistry.

[46]  Antonio Morreale,et al.  Structure-Based Discovery of Novel Non-nucleosidic DNA Alkyltransferase Inhibitors: Virtual Screening and in Vitro and in Vivo Activities , 2008, J. Chem. Inf. Model..

[47]  Jaques Reifman,et al.  DOVIS: an implementation for high-throughput virtual screening using AutoDock , 2008, BMC Bioinformatics.

[48]  P. Kollman,et al.  Calculating structures and free energies of complex molecules: combining molecular mechanics and continuum models. , 2000, Accounts of chemical research.

[49]  A. Spada,et al.  Crystal structures of human factor Xa complexed with potent inhibitors. , 2000, Journal of medicinal chemistry.

[50]  E. Mehler,et al.  Electrostatic effects in proteins: comparison of dielectric and charge models. , 1991, Protein engineering.

[51]  A. Sali,et al.  Modeller: generation and refinement of homology-based protein structure models. , 2003, Methods in enzymology.

[52]  Stewart A. Adcock,et al.  Molecular dynamics: survey of methods for simulating the activity of proteins. , 2006, Chemical reviews.

[53]  Henrik Boström,et al.  Improving structure-based virtual screening by multivariate analysis of scoring data. , 2003, Journal of medicinal chemistry.

[54]  Christopher W. Murray,et al.  The sensitivity of the results of molecular docking to induced fit effects: Application to thrombin, thermolysin and neuraminidase , 1999, J. Comput. Aided Mol. Des..

[55]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[56]  P. Fischer,et al.  Protein structures in virtual screening: a case study with CDK2. , 2006, Journal of medicinal chemistry.

[57]  Tommi H. Nyrönen,et al.  SOMA - Workflow for Small Molecule Property Calculations on a Multiplatform Computing Grid , 2006, J. Chem. Inf. Model..

[58]  D. Frank Hsu,et al.  Consensus Scoring Criteria for Improving Enrichment in Virtual Screening , 2005, J. Chem. Inf. Model..

[59]  Ramesha,et al.  How many leads from HTS? - Comment. , 2000, Drug discovery today.

[60]  D. Case,et al.  Theory and applications of the generalized born solvation model in macromolecular simulations , 2000, Biopolymers.

[61]  David S. Goodsell,et al.  Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function , 1998 .

[62]  Gerard Pujadas,et al.  BDT: an easy-to-use front-end application for automation of massive docking tasks and complex docking strategies with AutoDock , 2006, Bioinform..

[63]  Robert P. Sheridan,et al.  Enhanced Virtual Screening by Combined Use of Two Docking Methods: Getting the Most on a Limited Budget , 2005, J. Chem. Inf. Model..