Potential and Limitations of Ensemble Docking

A major problem in structure-based virtual screening applications is the appropriate selection of a single or even multiple protein structures to be used in the virtual screening process. A priori it is unknown which protein structure(s) will perform best in a virtual screening experiment. We investigated the performance of ensemble docking, as a function of ensemble size, for eight targets of pharmaceutical interest. Starting from single protein structure docking results, for each ensemble size up to 500,000 combinations of protein structures were generated, and, for each ensemble, pose prediction and virtual screening results were derived. Comparison of single to multiple protein structure results suggests improvements when looking at the performance of the worst and the average over all single protein structures to the performance of the worst and average over all protein ensembles of size two or greater, respectively. We identified several key factors affecting ensemble docking performance, including the sampling accuracy of the docking algorithm, the choice of the scoring function, and the similarity of database ligands to the cocrystallized ligands of ligand-bound protein structures in an ensemble. Due to these factors, the prospective selection of optimum ensembles is a challenging task, shown by a reassessment of published ensemble selection protocols.

[1]  James L. Melville,et al.  Better than Random? The Chemotype Enrichment Problem , 2009, J. Chem. Inf. Model..

[2]  R Nussinov,et al.  Flexible docking allowing induced fit in proteins: Insights from an open to closed conformational isomers , 1998, Proteins.

[3]  M L Teodoro,et al.  Conformational flexibility models for the receptor in structure based drug design. , 2003, Current pharmaceutical design.

[4]  Heather A Carlson,et al.  Exploring experimental sources of multiple protein conformations in structure-based drug design. , 2007, Journal of the American Chemical Society.

[5]  I. Kuntz,et al.  Molecular docking to ensembles of protein structures. , 1997, Journal of molecular biology.

[6]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[7]  Jonathan W. Essex,et al.  Ensemble Docking into Multiple Crystallographically Derived Protein Structures: An Evaluation Based on the Statistical Analysis of Enrichments , 2010, J. Chem. Inf. Model..

[8]  Marcel L. Verdonk,et al.  Sensitivity of molecular docking to induced fit effects in influenza virus neuraminidase , 2002, J. Comput. Aided Mol. Des..

[9]  B. Shoichet,et al.  Information decay in molecular docking screens against holo, apo, and modeled conformations of enzymes. , 2003, Journal of medicinal chemistry.

[10]  J. Thornton,et al.  Conformational changes observed in enzyme crystal structures upon substrate binding. , 2005, Journal of molecular biology.

[11]  P E Bourne,et al.  The Protein Data Bank. , 2002, Nucleic acids research.

[12]  Somesh D. Sharma,et al.  Managing protein flexibility in docking and its applications. , 2009, Drug discovery today.

[13]  Ajay N. Jain Effects of protein conformation in docking: improved pose prediction through protein pocket adaptation , 2009, J. Comput. Aided Mol. Des..

[14]  Christopher W. Murray,et al.  Empirical scoring functions. II. The testing of an empirical scoring function for the prediction of ligand-receptor binding affinities and the use of Bayesian regression to improve the quality of the model , 1998, J. Comput. Aided Mol. Des..

[15]  Michal Vieth,et al.  Lessons in Molecular Recognition, 2. Assessing and Improving Cross-Docking Accuracy , 2007, J. Chem. Inf. Model..

[16]  Xavier Barril,et al.  Ensemble Docking from Homology Models. , 2010, Journal of chemical theory and computation.

[17]  David Rogers,et al.  Extended-Connectivity Fingerprints , 2010, J. Chem. Inf. Model..

[18]  Lisa Yan,et al.  Fully Automated Molecular Mechanics Based Induced Fit Protein-Ligand Docking Method , 2008, J. Chem. Inf. Model..

[19]  Marcel L. Verdonk,et al.  Protein-Ligand Docking against Non-Native Protein Conformers , 2008, J. Chem. Inf. Model..

[20]  Jonathan W. Essex,et al.  FDS: Flexible ligand and receptor docking with a continuum solvent model and soft‐core energy function , 2003, J. Comput. Chem..

[21]  J. Irwin,et al.  Benchmarking sets for molecular docking. , 2006, Journal of medicinal chemistry.

[22]  R. Glen,et al.  Molecular recognition of receptor sites using a genetic algorithm with a description of desolvation. , 1995, Journal of molecular biology.

[23]  Thomas Lengauer,et al.  Docking and scoring with alternative side‐chain conformations , 2009, Proteins.

[24]  Brian K Shoichet,et al.  Testing a flexible-receptor docking algorithm in a model binding site. , 2004, Journal of molecular biology.

[25]  Rafael Najmanovich,et al.  Side‐chain flexibility in proteins upon ligand binding , 2000, Proteins.

[26]  Richard A. Lewis,et al.  Lessons in molecular recognition: the effects of ligand and protein flexibility on molecular docking accuracy. , 2004, Journal of medicinal chemistry.

[27]  Rommie E. Amaro,et al.  Ensemble-Based Virtual Screening Reveals Potential Novel Antiviral Compounds for Avian Influenza Neuraminidase , 2008, Journal of medicinal chemistry.

[28]  William J. Welsh,et al.  Identification of a Minimal Subset of Receptor Conformations for Improved Multiple Conformation Docking and Two-Step Scoring. , 2004 .

[29]  Erin S. Bolstad,et al.  In pursuit of virtual lead optimization: The role of the receptor structure and ensembles in accurate docking , 2008, Proteins.

[30]  Jozef Hritz,et al.  Impact of plasticity and flexibility on docking results for cytochrome P450 2D6: a combined approach of molecular dynamics and ligand docking. , 2008, Journal of medicinal chemistry.

[31]  Marcel L Verdonk,et al.  General and targeted statistical potentials for protein–ligand interactions , 2005, Proteins.

[32]  X. Zou,et al.  Ensemble docking of multiple protein structures: Considering protein structural variations in molecular docking , 2006, Proteins.

[33]  Paul N. Mortenson,et al.  Diverse, high-quality test set for the validation of protein-ligand docking performance. , 2007, Journal of medicinal chemistry.

[34]  In-Hee Park,et al.  Dynamic ligand-induced-fit simulation via enhanced conformational samplings and ensemble dockings: a survivin example. , 2010, The journal of physical chemistry. B.

[35]  Markus Wagener,et al.  A flexible approach to induced fit docking. , 2007, Journal of medicinal chemistry.

[36]  Christopher R. Corbeil,et al.  Docking Ligands into Flexible and Solvated Macromolecules. 3. Impact of Input Ligand Conformation, Protein Flexibility, and Water Molecules on the Accuracy of Docking Programs , 2009, J. Chem. Inf. Model..

[37]  J. Mccammon,et al.  Computational drug design accommodating receptor flexibility: the relaxed complex scheme. , 2002, Journal of the American Chemical Society.

[38]  S. Teague Implications of protein flexibility for drug discovery , 2003, Nature Reviews Drug Discovery.

[39]  P. Cozzini Target Flexibility: An Emerging Consideration in Drug Discovery and Design , 2009 .

[40]  Tudor I. Oprea,et al.  Optimization of CAMD techniques 3. Virtual screening enrichment studies: a help or hindrance in tool selection? , 2008, J. Comput. Aided Mol. Des..

[41]  X. Barril,et al.  Unveiling the full potential of flexible receptor docking using multiple crystallographic structures. , 2005, Journal of medicinal chemistry.

[42]  B. Shoichet,et al.  Soft docking and multiple receptor conformations in virtual screening. , 2004, Journal of medicinal chemistry.

[43]  Claudio N. Cavasotto,et al.  Representing receptor flexibility in ligand docking through relevant normal modes. , 2005, Journal of the American Chemical Society.

[44]  A. Leach,et al.  Ligand docking to proteins with discrete side-chain flexibility. , 1994, Journal of molecular biology.

[45]  Amy C. Anderson,et al.  Scoring Ensembles of Docked Protein: Ligand Interactions for Virtual Lead Optimization , 2009, J. Chem. Inf. Model..

[46]  D. Koshland Application of a Theory of Enzyme Specificity to Protein Synthesis. , 1958, Proceedings of the National Academy of Sciences of the United States of America.

[47]  Thomas M Frimurer,et al.  Ligand-induced conformational changes: improved predictions of ligand binding conformations and affinities. , 2003, Biophysical journal.

[48]  Ruben Abagyan,et al.  Recipes for the Selection of Experimental Protein Conformations for Virtual Screening , 2010, J. Chem. Inf. Model..

[49]  G. V. Paolini,et al.  Empirical scoring functions: I. The development of a fast empirical scoring function to estimate the binding affinity of ligands in receptor complexes , 1997, J. Comput. Aided Mol. Des..

[50]  Erin S. Bolstad,et al.  In pursuit of virtual lead optimization: Pruning ensembles of receptor structures for increased efficiency and accuracy during docking , 2009, Proteins.

[51]  R. Nussinov,et al.  How different are structurally flexible and rigid binding sites? Sequence and structural features discriminating proteins that do and do not undergo conformational change upon ligand binding. , 2007, Journal of molecular biology.

[52]  Matthias Rarey,et al.  In Pursuit of Fully Flexible Protein‐Ligand Docking: Modeling the Bilateral Mechanism of Binding , 2010, Molecular informatics.

[53]  Pang-Ning Tan,et al.  Receiver Operating Characteristic , 2009, Encyclopedia of Database Systems.

[54]  Wolfgang Wenzel,et al.  Flexible side chain models improve enrichment rates in in silico screening. , 2008, Journal of medicinal chemistry.

[55]  Thomas Stützle,et al.  Empirical Scoring Functions for Advanced Protein-Ligand Docking with PLANTS , 2009, J. Chem. Inf. Model..

[56]  J. Pin,et al.  Virtual screening workflow development guided by the "receiver operating characteristic" curve approach. Application to high-throughput docking on metabotropic glutamate receptor subtype 4. , 2005, Journal of medicinal chemistry.

[57]  J. Gasteiger,et al.  FROM ATOMS AND BONDS TO THREE-DIMENSIONAL ATOMIC COORDINATES : AUTOMATIC MODEL BUILDERS , 1993 .

[58]  Maurizio Botta,et al.  Protein Kinases: Docking and Homology Modeling Reliability , 2010, J. Chem. Inf. Model..

[59]  Ruben Abagyan,et al.  A new method for ligand docking to flexible receptors by dual alanine scanning and refinement (SCARE) , 2008, J. Comput. Aided Mol. Des..

[60]  Christopher W. Murray,et al.  The sensitivity of the results of molecular docking to induced fit effects: Application to thrombin, thermolysin and neuraminidase , 1999, J. Comput. Aided Mol. Des..

[61]  Patrick McCabe,et al.  The Ensemble Performance Index: An Improved Measure for Assessing Ensemble Pose Prediction Performance , 2011, J. Chem. Inf. Model..

[62]  R. Friesner,et al.  Novel procedure for modeling ligand/receptor induced fit effects. , 2006, Journal of medicinal chemistry.

[63]  Thomas Lengauer,et al.  FlexE: efficient molecular docking considering protein structure variations. , 2001, Journal of molecular biology.

[64]  Ruben Abagyan,et al.  Consistent Improvement of Cross-Docking Results Using Binding Site Ensembles Generated with Elastic Network Normal Modes , 2009, J. Chem. Inf. Model..

[65]  Ruben Abagyan,et al.  Improved docking, screening and selectivity prediction for small molecule nuclear receptor modulators using conformational ensembles , 2010, J. Comput. Aided Mol. Des..

[66]  Woody Sherman,et al.  Improving database enrichment through ensemble docking , 2008, J. Comput. Aided Mol. Des..

[67]  Migliore Amico Ensemble-docking approach on BACE-1 : Pharmacophore Perception and Guidelines for Drug Design , .

[68]  Ruben Abagyan,et al.  Four-dimensional docking: a fast and accurate account of discrete receptor flexibility in ligand docking. , 2009, Journal of medicinal chemistry.

[69]  N. P. Todorov,et al.  Receptor flexibility in de novo ligand design and docking. , 2005, Journal of medicinal chemistry.

[70]  Richard D. Taylor,et al.  Improved protein–ligand docking using GOLD , 2003, Proteins.

[71]  Michel F. Sanner,et al.  Protein–ligand docking with multiple flexible side chains , 2008, J. Comput. Aided Mol. Des..

[72]  J Andrew McCammon,et al.  Molecular docking of balanol to dynamics snapshots of protein kinase A , 2005, Proteins.