HierVLS hierarchical docking protocol for virtual ligand screening of large-molecule databases.

To provide practical means for rapidly scanning the extensive experimental combinatorial chemistry libraries now available for high-throughput screening (HTS), it is essential to establish computational virtual ligand screening (VLS) techniques to rapidly identify out of a large library all active compounds against a particular protein target. Toward this goal we developed HierVLS, a fast hierarchical docking approach that starts with a coarse grain conformational search over a large number of configurations filtered with a fast but crude energy function, followed by a succession of finer grain levels, using successively more accurate but more expensive descriptions of the ligand-protein-solvent interactions to filter successively fewer cases. The final step of this procedure optimizes one configuration of the ligand in the protein site using our most accurate energy expression and description of the solvent, which would be impractical for all conformations and sites sampled in the coarse level. HierVLS is based on the HierDock approach, but rather than allowing an hour or more to determine the best binding site and energy for each ligands (as in HierDock), we have adapted our procedure so that it can lead to reliable results while using only 4 min (866 MHz Pentium III processor) per ligand. To validate the accuracy for HierVLS to predict the experimentally observed binding conformation, we considered 37 cocrystal structures comprising 11 target proteins. We find that HierVLS identifies the correct binding mode for all 37 cocrystals. In addition, the calculated binding energies correlate well with available experimental binding constants. To validate how well HierVLS can identify the correct ligand in an extensive library of decoys, we considered a library of over 10 000 molecules. HierVLS identifies 26 out of the 37 cases in the top 2% ranked by binding affinity among the 10 037 molecules. The failures result from either metal-containing sites on the protein or water-mediated ligand-protein interactions, which we anticipate can be solved within the constraints of practical VLS. We then applied HierVLS to screen a 55000-compound virtual library against the target protein-tyrosine phosphatase 1B (ptp1b). The top 250 compounds by binding affinity included all six ptp1b cocrystal ligands added to the library plus three other experimentally confirmed binders. The best (top 1) binder is an experimentally confirmed positive. We conclude that HierVLS is useful for selecting leads for a particular target out of large combinatorial databases.

[1]  Ruben Abagyan,et al.  Comparative study of several algorithms for flexible ligand docking , 2003, J. Comput. Aided Mol. Des..

[2]  Martin Stahl,et al.  Binding site characteristics in structure-based virtual screening: evaluation of current docking tools , 2003, Journal of molecular modeling.

[3]  Peter L. Freddolino,et al.  Prediction of structure and function of G protein-coupled receptors , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[4]  P. Kollman,et al.  Use of MM-PBSA in reproducing the binding free energies to HIV-1 RT of TIBO derivatives and predicting the binding mode to HIV-1 RT of efavirenz by docking and MM-PBSA. , 2001, Journal of the American Chemical Society.

[5]  Todd J. A. Ewing,et al.  DOCK 4.0: Search strategies for automated molecular docking of flexible molecule databases , 2001, J. Comput. Aided Mol. Des..

[6]  Gerhard Klebe,et al.  Predicting binding modes, binding affinities and ‘hot spots’ for protein-ligand complexes using a knowledge-based scoring function , 2000 .

[7]  D. Rognan,et al.  Protein-based virtual screening of chemical databases. 1. Evaluation of different docking/scoring combinations. , 2000, Journal of medicinal chemistry.

[8]  G M Shepherd,et al.  Molecular mechanisms underlying differential odor responses of a mouse olfactory receptor. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[9]  T Lengauer,et al.  Two-stage method for protein-ligand docking. , 1999, Journal of medicinal chemistry.

[10]  Y. Martin,et al.  A general and fast scoring function for protein-ligand interactions: a simplified potential approach. , 1999, Journal of medicinal chemistry.

[11]  M Hendlich,et al.  Databases for protein-ligand complexes. , 1998, Acta crystallographica. Section D, Biological crystallography.

[12]  L M Amzel,et al.  Structure-based drug design. , 1998, Current opinion in biotechnology.

[13]  Alexander D. MacKerell,et al.  All-atom empirical potential for molecular modeling and dynamics studies of proteins. , 1998, The journal of physical chemistry. B.

[14]  David S. Goodsell,et al.  Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function , 1998, J. Comput. Chem..

[15]  G. V. Paolini,et al.  Empirical scoring functions: I. The development of a fast empirical scoring function to estimate the binding affinity of ligands in receptor complexes , 1997, J. Comput. Aided Mol. Des..

[16]  Eugene I. Shakhnovich,et al.  SMOG : DE NOVO DESIGN METHOD BASED ON SIMPLE, FAST, AND ACCURATE FREE ENERGY ESTIMATES. 2. CASE STUDIES IN MOLECULAR DESIGN , 1997 .

[17]  P Willett,et al.  Development and validation of a genetic algorithm for flexible docking. , 1997, Journal of molecular biology.

[18]  Tudor I. Oprea,et al.  Identification of a functional water channel in cytochrome P450 enzymes. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[19]  David E. Clark,et al.  PRO_SELECT: Combining structure-based drug design and combinatorial chemistry for rapid lead discovery. 1. Technology , 1997, J. Comput. Aided Mol. Des..

[20]  J. Briggs,et al.  Structure-based drug design: computational advances. , 1997, Annual review of pharmacology and toxicology.

[21]  Todd J. A. Ewing,et al.  Critical evaluation of search algorithms for automated molecular docking and database screening , 1997, J. Comput. Chem..

[22]  Thomas Lengauer,et al.  A fast flexible docking method using an incremental construction algorithm. , 1996, Journal of molecular biology.

[23]  D. Elbaum,et al.  Unexpected Binding Mode of the Sulfonamide Fluorophore 5-Dimethylamino-1-naphthalene Sulfonamide to Human Carbonic Anhydrase II , 1996, The Journal of Biological Chemistry.

[24]  W. Pryor Cytochrome P450: Structure, mechanism, and biochemistry , 1996 .

[25]  G. Whitesides,et al.  Secondary interactions significantly removed from the sulfonamide binding pocket of carbonic anhydrase II influence inhibitor binding constants. , 1995, Journal of medicinal chemistry.

[26]  Klaus Gubernator,et al.  Structure-Based Ligand Design , 1995 .

[27]  R. Harrison,et al.  Prediction of new serine proteinase inhibitors , 1994, Nature Structural Biology.

[28]  J. Thornton,et al.  Buried waters and internal cavities in monomeric proteins , 1994, Protein science : a publication of the Protein Society.

[29]  Hans-Joachim Böhm,et al.  The development of a simple empirical scoring function to estimate the binding constant for a protein-ligand complex of known three-dimensional structure , 1994, J. Comput. Aided Mol. Des..

[30]  J. Thornton,et al.  Satisfying hydrogen bonding potential in proteins. , 1994, Journal of molecular biology.

[31]  Ruben Abagyan,et al.  ICM—A new method for protein modeling and design: Applications to docking and structure prediction from the distorted native conformation , 1994, J. Comput. Chem..

[32]  J J Baldwin,et al.  Positions of His‐64 and a bound water in human carbonic anhydrase II upon binding three structurally related inhibitors , 1994, Protein science : a publication of the Protein Society.

[33]  Peter A. Kollman,et al.  FREE ENERGY CALCULATIONS : APPLICATIONS TO CHEMICAL AND BIOCHEMICAL PHENOMENA , 1993 .

[34]  Hans-Joachim Böhm,et al.  LUDI: rule-based automatic design of new substituents for enzyme inhibitor leads , 1992, J. Comput. Aided Mol. Des..

[35]  S. L. Mayo,et al.  DREIDING: A generic force field for molecular simulations , 1990 .

[36]  F. A. Quiocho,et al.  Substrate specificity and affinity of a protein modulated by bound water molecules , 1989, Nature.

[37]  H M Holden,et al.  Structures of two thermolysin-inhibitor complexes that differ by a single hydrogen bond. , 1987, Science.

[38]  Ortiz de Montellano,et al.  Cytochrome P-450: Structure, Mechanism, and Biochemistry , 1986 .

[39]  B. Matthews,et al.  An interactive computer graphics study of thermolysin-catalyzed peptide cleavage and inhibition by N-carboxymethyl dipeptides. , 1984, Biochemistry.

[40]  J M Blaney,et al.  A geometric approach to macromolecule-ligand interactions. , 1982, Journal of molecular biology.

[41]  J. Gasteiger,et al.  ITERATIVE PARTIAL EQUALIZATION OF ORBITAL ELECTRONEGATIVITY – A RAPID ACCESS TO ATOMIC CHARGES , 1980 .