Virtual screening to enrich a compound collection with CDK2 inhibitors using docking, scoring, and composite scoring models

Docking programs can generate subsets of a compound collection with an increased percentage of actives against a target (enrichment) by predicting their binding mode (pose) and affinity (score), and retrieving those with the highest scores. Using the QXP and GOLD programs, we compared the ability of six single scoring functions (PLP, Ligscore, Ludi, Jain, ChemScore, PMF) and four composite scoring models (Mean Rank: MR, Rank‐by‐Vote: Vt, Bayesian Statistics: BS and PLS Discriminant Analysis: DA) to separate compounds that are active against CDK2 from inactives. We determined the enrichment for the entire set of actives (IC50 < 10 μM) and for three activity subsets. In all cases, the enrichment for each subset was lower than for the entire set of actives. QXP outperformed GOLD at pose prediction, but yielded only moderately better enrichments. Five to six scoring functions yielded good enrichments with GOLD poses, while typically only two worked well with QXP poses. For each program, two scoring functions generally performed better than the others (Ligscore2 and Ludi for GOLD; QXP and Jain for QXP). Composite scoring functions yielded better results than single scoring functions. The consensus approaches MR and Vt worked best when separating micromolar inhibitors from inactives. The statistical approaches BS and DA, which require training data, performed best when distinguishing between low and high nanomolar inhibitors. The key observation that all hit rate profiles for all four activity intervals for all scoring schemes for both programs are significantly better than random, is evidence that docking can be successfully applied to enrich compound collections. Proteins 2005. © 2005 Wiley‐Liss, Inc.

[1]  R. M. Burnett,et al.  DARWIN: A program for docking flexible molecules , 2000, Proteins.

[2]  R. Clark,et al.  Consensus scoring for ligand/protein interactions. , 2002, Journal of molecular graphics & modelling.

[3]  I. Enyedy,et al.  Discovery of small-molecule inhibitors of Bcl-2 through structure-based computer screening. , 2001, Journal of medicinal chemistry.

[4]  W Patrick Walters,et al.  A detailed comparison of current docking and scoring methods on systems of pharmaceutical relevance , 2004, Proteins.

[5]  M. Murcko,et al.  Consensus scoring: A method for obtaining improved hit rates from docking databases of three-dimensional structures into proteins. , 1999, Journal of medicinal chemistry.

[6]  Thomas Lengauer,et al.  FlexE: efficient molecular docking considering protein structure variations. , 2001, Journal of molecular biology.

[7]  David S. Goodsell,et al.  Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function , 1998, J. Comput. Chem..

[8]  Todd J. A. Ewing,et al.  Critical evaluation of search algorithms for automated molecular docking and database screening , 1997 .

[9]  Y. Martin,et al.  A general and fast scoring function for protein-ligand interactions: a simplified potential approach. , 1999, Journal of medicinal chemistry.

[10]  B. Shoichet,et al.  Molecular docking and high-throughput screening for novel inhibitors of protein tyrosine phosphatase-1B. , 2002, Journal of medicinal chemistry.

[11]  R Abagyan,et al.  Flexible protein–ligand docking by global energy optimization in internal coordinates , 1997, Proteins.

[12]  M Rarey,et al.  Detailed analysis of scoring functions for virtual screening. , 2001, Journal of medicinal chemistry.

[13]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[14]  D. Goodsell,et al.  Automated docking to multiple target structures: Incorporation of protein mobility and structural water heterogeneity in AutoDock , 2002, Proteins.

[15]  Gennady M Verkhivker,et al.  Molecular recognition of the inhibitor AG-1343 by HIV-1 protease: conformationally flexible docking by evolutionary programming. , 1995, Chemistry & biology.

[16]  Todd J. A. Ewing,et al.  Critical evaluation of search algorithms for automated molecular docking and database screening , 1997, J. Comput. Chem..

[17]  Gerhard Klebe,et al.  Successful virtual screening for novel inhibitors of human carbonic anhydrase: strategy and experimental confirmation. , 2002, Journal of medicinal chemistry.

[18]  Hege S. Beard,et al.  Glide: a new approach for rapid, accurate docking and scoring. 2. Enrichment factors in database screening. , 2004, Journal of medicinal chemistry.

[19]  Thomas Lengauer,et al.  A fast flexible docking method using an incremental construction algorithm. , 1996, Journal of molecular biology.

[20]  Anna Vulpetti,et al.  Assessment of Docking Poses: Interactions-Based Accuracy Classification (IBAC) versus Crystal Structure Deviations , 2004, J. Chem. Inf. Model..

[21]  Ajay N. Jain Scoring noncovalent protein-ligand interactions: A continuous differentiable function tuned to compute binding affinities , 1996, J. Comput. Aided Mol. Des..

[22]  David S. Goodsell,et al.  Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function , 1998 .

[23]  B. Shoichet,et al.  Flexible ligand docking using conformational ensembles , 1998, Protein science : a publication of the Protein Society.

[24]  A. N. Jain,et al.  Molecular hashkeys: a novel method for molecular characterization and its application for predicting important pharmaceutical properties of molecules. , 1999, Journal of medicinal chemistry.

[25]  R. Glen,et al.  Molecular recognition of receptor sites using a genetic algorithm with a description of desolvation. , 1995, Journal of molecular biology.

[26]  D. Rognan,et al.  Protein-based virtual screening of chemical databases. 1. Evaluation of different docking/scoring combinations. , 2000, Journal of medicinal chemistry.

[27]  Shaomeng Wang,et al.  How Does Consensus Scoring Work for Virtual Library Screening? An Idealized Computer Experiment , 2001, J. Chem. Inf. Comput. Sci..

[28]  Renxiao Wang,et al.  Comparative evaluation of 11 scoring functions for molecular docking. , 2003, Journal of medicinal chemistry.

[29]  Matthew P. Repasky,et al.  Glide: a new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy. , 2004, Journal of medicinal chemistry.

[30]  Colin McMartin,et al.  QXP: Powerful, rapid computer algorithms for structure-based drug design , 1997, J. Comput. Aided Mol. Des..

[31]  Martin Stahl,et al.  Binding site characteristics in structure-based virtual screening: evaluation of current docking tools , 2003, Journal of molecular modeling.

[32]  Samir Roy,et al.  BHB: A Simple Knowledge-Based Scoring Function to Improve the Efficiency of Database Screening , 2003, J. Chem. Inf. Comput. Sci..

[33]  Pieter F. W. Stouten,et al.  A molecular mechanics/grid method for evaluation of ligand–receptor interactions , 1995, J. Comput. Chem..

[34]  G. V. Paolini,et al.  Empirical scoring functions: I. The development of a fast empirical scoring function to estimate the binding affinity of ligands in receptor complexes , 1997, J. Comput. Aided Mol. Des..

[35]  A. N. Jain,et al.  Hammerhead: fast, fully automated docking of flexible ligands to protein binding sites. , 1996, Chemistry & biology.