Are predefined decoy sets of ligand poses able to quantify scoring function accuracy?

Due to the large number of different docking programs and scoring functions available, researchers are faced with the problem of selecting the most suitable one when starting a structure-based drug discovery project. To guide the decision process, several studies comparing different docking and scoring approaches have been published. In the context of comparing scoring function performance, it is common practice to use a predefined, computer-generated set of ligand poses (decoys) and to reevaluate their score using the set of scoring functions to be compared. But are predefined decoy sets able to unambiguously evaluate and rank different scoring functions with respect to pose prediction performance? This question arose when the pose prediction performance of our piecewise linear potential derived scoring functions (Korb et al. in J Chem Inf Model 49:84–96, 2009) was assessed on a standard decoy set (Cheng et al. in J Chem Inf Model 49:1079–1093, 2009). While they showed excellent pose identification performance when they were used for rescoring of the predefined decoy conformations, a pronounced degradation in performance could be observed when they were directly applied in docking calculations using the same test set. This implies that on a discrete set of ligand poses only the rescoring performance can be evaluated. For comparing the pose prediction performance in a more rigorous manner, the search space of each scoring function has to be sampled extensively as done in the docking calculations performed here. We were able to identify relative strengths and weaknesses of three scoring functions (ChemPLP, GoldScore, and Astex Statistical Potential) by analyzing the performance for subsets of the complexes grouped by different properties of the active site. However, reasons for the overall poor performance of all three functions on this test set compared to other test sets of similar size could not be identified.

[1]  Djamal Bouzida,et al.  Monte Carlo simulations of the peptide recognition at the consensus binding site of the constant fragment of human immunoglobulin G: The energy landscape analysis of a hot spot at the intermolecular interface , 2002, Proteins.

[2]  Jason C. Cole,et al.  Testing Assumptions and Hypotheses for Rescoring Success in Protein-Ligand Docking , 2009, J. Chem. Inf. Model..

[3]  Fabian Mörchen,et al.  Maximum Common Binding Modes (MCBM): Consensus Docking Scoring Using Multiple Ligand Information and Interaction Fingerprints , 2008, J. Chem. Inf. Model..

[4]  Thomas Stützle,et al.  An ant colony optimization approach to flexible protein–ligand docking , 2007, Swarm Intelligence.

[5]  Maria A Miteva,et al.  Fast structure-based virtual ligand screening combining FRED, DOCK, and Surflex. , 2005, Journal of medicinal chemistry.

[6]  Didier Rognan,et al.  Comparative evaluation of eight docking tools for docking and virtual screening accuracy , 2004, Proteins.

[7]  M. Murcko,et al.  Consensus scoring: A method for obtaining improved hit rates from docking databases of three-dimensional structures into proteins. , 1999, Journal of medicinal chemistry.

[8]  Li Xing,et al.  Evaluation and application of multiple scoring functions for a virtual screening experiment , 2004, J. Comput. Aided Mol. Des..

[9]  Yongbo Hu,et al.  Comparison of Several Molecular Docking Programs: Pose Prediction and Virtual Screening Accuracy , 2009, J. Chem. Inf. Model..

[10]  Reiji Teramoto,et al.  Structure-Based Virtual Screening with Supervised Consensus Scoring: Evaluation of Pose Prediction and Enrichment Factors , 2008, J. Chem. Inf. Model..

[11]  Ajay N. Jain,et al.  Customizing scoring functions for docking , 2008, J. Comput. Aided Mol. Des..

[12]  J. Irwin,et al.  Benchmarking sets for molecular docking. , 2006, Journal of medicinal chemistry.

[13]  Richard D. Taylor,et al.  Improved protein–ligand docking using GOLD , 2003, Proteins.

[14]  Robin Taylor,et al.  A new test set for validating predictions of protein–ligand interaction , 2002, Proteins.

[15]  Xiaoqin Zou,et al.  Scoring functions and their evaluation methods for protein-ligand docking: recent advances and future directions. , 2010, Physical chemistry chemical physics : PCCP.

[16]  Sunil K. Panigrahi Strong and weak hydrogen bonds in protein-ligand complexes of kinases: a comparative study , 2008, Amino Acids.

[17]  Djamal Bouzida,et al.  Computational detection of the binding‐site hot spot at the remodeled human growth hormone–receptor interface , 2003, Proteins.

[18]  W Patrick Walters,et al.  A detailed comparison of current docking and scoring methods on systems of pharmaceutical relevance , 2004, Proteins.

[19]  Thomas Sander,et al.  Comparison of Ligand- and Structure-Based Virtual Screening on the DUD Data Set , 2009, J. Chem. Inf. Model..

[20]  Zhihai Liu,et al.  Comparative Assessment of Scoring Functions on a Diverse Test Set , 2009, J. Chem. Inf. Model..

[21]  Xavier Morelli,et al.  GFscore: A General Nonlinear Consensus Scoring Function for High-Throughput Docking , 2006, J. Chem. Inf. Model..

[22]  Matthias Keil,et al.  Pattern recognition strategies for molecular surfaces: III. Binding site prediction with a neural network , 2004, J. Comput. Chem..

[23]  R. Glen,et al.  Molecular recognition of receptor sites using a genetic algorithm with a description of desolvation. , 1995, Journal of molecular biology.

[24]  Kazuhiro Yasuno,et al.  Evaluation of docking calculations on X-ray structures using CONSENSUS-DOCK. , 2010, Chemical & pharmaceutical bulletin.

[25]  Reiji Teramoto,et al.  Consensus Scoring with Feature Selection for Structure-Based Virtual Screening , 2008, J. Chem. Inf. Model..

[26]  Thierry Langer,et al.  Recent Advances in Docking and Scoring , 2005 .

[27]  Richard J. Marhöfer,et al.  Molecular Graphics - Trends and Perspectives , 2000 .

[28]  R. Cramer,et al.  Validation of the general purpose tripos 5.2 force field , 1989 .

[29]  P Willett,et al.  Development and validation of a genetic algorithm for flexible docking. , 1997, Journal of molecular biology.

[30]  Richard D. Taylor,et al.  Modeling water molecules in protein-ligand docking using GOLD. , 2005, Journal of medicinal chemistry.

[31]  Reiji Teramoto,et al.  Supervised Consensus Scoring for Docking and Virtual Screening , 2007, J. Chem. Inf. Model..

[32]  Thomas E. Exner,et al.  Influence of Protonation, Tautomeric, and Stereoisomeric States on Protein-Ligand Docking Results , 2009, J. Chem. Inf. Model..

[33]  Paul N. Mortenson,et al.  Diverse, high-quality test set for the validation of protein-ligand docking performance. , 2007, Journal of medicinal chemistry.

[34]  Thomas Stützle,et al.  PLANTS: Application of Ant Colony Optimization to Structure-Based Drug Design , 2006, ANTS Workshop.

[35]  Wei Zhao,et al.  Validation of Molecular Docking Programs for Virtual Screening against Dihydropteroate Synthase , 2009, J. Chem. Inf. Model..

[36]  Orr Ravitz,et al.  Improving molecular docking through eHiTS’ tunable scoring function , 2011, J. Comput. Aided Mol. Des..

[37]  Maria A Miteva,et al.  Post-docking virtual screening of diverse binding pockets: comparative study using DOCK, AMMOS, X-Score and FRED scoring functions. , 2010, European journal of medicinal chemistry.

[38]  Nicolas Moitessier,et al.  Docking Ligands into Flexible and Solvated Macromolecules. 4. Are Popular Scoring Functions Accurate for this Class of Proteins? , 2009, J. Chem. Inf. Model..

[39]  Markus H. J. Seifert,et al.  Robust optimization of scoring functions for a target class , 2009, J. Comput. Aided Mol. Des..

[40]  G. Desiraju,et al.  Strong and weak hydrogen bonds in the protein–ligand interface , 2007, Proteins.

[41]  Zhilong Xiu,et al.  Rescoring ligand docking poses. , 2010, Current opinion in drug discovery & development.

[42]  Zhihai Liu,et al.  Evaluation of the performance of four molecular docking programs on a diverse set of protein‐ligand complexes , 2010, J. Comput. Chem..

[43]  Martin Knoblauch,et al.  MOLCAD — Computer Aided Visualization and Manipulation of Models in Molecular Science , 1992 .

[44]  Thorsten Meinl,et al.  KNIME: The Konstanz Information Miner , 2007, GfKl.

[45]  Arthur J. Olson,et al.  Data visualization in molecular science : tools for insight and innovation , 1994 .

[46]  Shuichi Hirono,et al.  Comparison of Consensus Scoring Strategies for Evaluating Computational Models of Protein-Ligand Complexes , 2006, J. Chem. Inf. Model..

[47]  Wolfgang Heiden,et al.  Interactive Visualization of Molecular Scenarios with MOLCAD/SYBYL , 1995 .

[48]  C. E. Peishoff,et al.  A critical assessment of docking programs and scoring functions. , 2006, Journal of medicinal chemistry.

[50]  J M Blaney,et al.  A geometric approach to macromolecule-ligand interactions. , 1982, Journal of molecular biology.

[51]  Jiro Shimada,et al.  Bootstrap-Based Consensus Scoring Method for Protein-Ligand Docking , 2008, J. Chem. Inf. Model..

[52]  Martin Hofmann-Apitius,et al.  Alternative to Consensus ScoringA New Approach Toward the Qualitative Combination of Docking Algorithms , 2007, J. Chem. Inf. Model..

[53]  Ricardo L. Mancera,et al.  Ligand-Protein Cross-Docking with Water Molecules , 2010, J. Chem. Inf. Model..

[54]  R. Clark,et al.  Consensus scoring for ligand/protein interactions. , 2002, Journal of molecular graphics & modelling.

[55]  Thomas E. Exner,et al.  pKa based protonation states and microspecies for protein–ligand docking , 2010, J. Comput. Aided Mol. Des..

[56]  Christopher R. Corbeil,et al.  Docking Ligands into Flexible and Solvated Macromolecules. 3. Impact of Input Ligand Conformation, Protein Flexibility, and Water Molecules on the Accuracy of Docking Programs , 2009, J. Chem. Inf. Model..

[57]  John A. Nelder,et al.  A Simplex Method for Function Minimization , 1965, Comput. J..

[58]  Gennady M Verkhivker,et al.  Molecular recognition of the inhibitor AG-1343 by HIV-1 protease: conformationally flexible docking by evolutionary programming. , 1995, Chemistry & biology.

[59]  Hanoch Senderowitz,et al.  SeleX-CS: A New Consensus Scoring Algorithm for Hit Discovery and Lead Optimization , 2009, J. Chem. Inf. Model..

[60]  Thomas Stützle,et al.  Empirical Scoring Functions for Advanced Protein-Ligand Docking with PLANTS , 2009, J. Chem. Inf. Model..

[61]  Gennady M Verkhivker Computational analysis of ligand binding dynamics at the intermolecular hot spots with the aid of simulated tempering and binding free energy calculations. , 2004, Journal of molecular graphics & modelling.

[62]  J. Bajorath,et al.  Docking and scoring in virtual screening for drug discovery: methods and applications , 2004, Nature Reviews Drug Discovery.

[63]  Marcel L Verdonk,et al.  General and targeted statistical potentials for protein–ligand interactions , 2005, Proteins.