Leveraging Data Fusion Strategies in Multireceptor Lead Optimization MM/GBSA End-Point Methods.

Accurate and efficient affinity calculations are critical to enhancing the contribution of in silico modeling during the lead optimization phase of a drug discovery campaign. Here, we present a large-scale study of the efficacy of data fusion strategies to leverage results from end-point MM/GBSA calculations in multiple receptors to identify potent inhibitors among an ensemble of congeneric ligands. The retrospective analysis of 13 congeneric ligand series curated from publicly available data across seven biological targets demonstrates that in 90% of the individual receptor structures MM/GBSA scores successfully identify subsets of inhibitors that are more potent than a random selection, and data fusion strategies that combine MM/GBSA scores from each of the receptors significantly increase the robustness of the predictions. Among nine different data fusion metrics based on consensus scores or receptor rankings, the SumZScore (i.e., converting MM/GBSA scores into standardized Z-Scores within a receptor and computing the sum of the Z-Scores for a given ligand across the ensemble of receptors) is found to be a robust and physically meaningful metric for combining results across multiple receptors. Perhaps most surprisingly, even with relatively low to modest overall correlations between SumZScore and experimental binding affinities, SumZScore tends to reliably prioritize subsets of inhibitors that are at least as potent as those that are prioritized from a "best" single receptor identified from known compounds within the congeneric series.

[1]  Jean-Michel Rondeau,et al.  Discovery of cyclic sulfone hydroxyethylamines as potent and selective β-site APP-cleaving enzyme 1 (BACE1) inhibitors: structure-based design and in vivo reduction of amyloid β-peptides. , 2012, Journal of medicinal chemistry.

[2]  W. Sherman,et al.  Prediction of Absolute Solvation Free Energies using Molecular Dynamics Free Energy Perturbation and the OPLS Force Field. , 2010, Journal of chemical theory and computation.

[3]  M. Gilson,et al.  Free energy, entropy, and induced fit in host-guest recognition: calculations with the second-generation mining minima algorithm. , 2004, Journal of the American Chemical Society.

[4]  Johan Åqvist,et al.  Ligand binding affinity prediction by linear interaction energy methods , 1998, J. Comput. Aided Mol. Des..

[5]  Ronald M. Levy,et al.  PrimeX and the Schrödinger computational chemistry suite of programs , 2012 .

[6]  Gabriela Chiosis,et al.  Identification of potent water soluble purine-scaffold inhibitors of the heat shock protein 90. , 2006, Journal of medicinal chemistry.

[7]  Dariusz Plewczynski,et al.  Brainstorming: weighted voting prediction of inhibitors for protein targets , 2010, Journal of molecular modeling.

[8]  Gabriela Chiosis,et al.  Evaluation of 8-arylsulfanyl, 8-arylsulfoxyl, and 8-arylsulfonyl adenine derivatives as inhibitors of the heat shock protein 90. , 2005, Journal of medicinal chemistry.

[9]  T. Halgren Merck molecular force field. II. MMFF94 van der Waals and electrostatic parameters for intermolecular interactions , 1996 .

[10]  Yuzhu Chen,et al.  N2-substituted O6-cyclohexylmethylguanine derivatives: potent inhibitors of cyclin-dependent kinases 1 and 2. , 2004, Journal of medicinal chemistry.

[11]  Lawrence C Kuo,et al.  3-(Indol-2-yl)indazoles as Chek1 kinase inhibitors: Optimization of potency and selectivity via substitution at C6. , 2006, Bioorganic & medicinal chemistry letters.

[12]  Philip J. Merta,et al.  Discovery of 4'-(1,4-dihydro-indeno[1,2-c]pyrazol-3-yl)-benzonitriles and 4'-(1,4-dihydro-indeno[1,2-c]pyrazol-3-yl)-pyridine-2'-carbonitriles as potent checkpoint kinase 1 (Chk1) inhibitors. , 2007, Bioorganic & medicinal chemistry letters.

[13]  W. L. Jorgensen,et al.  General model for estimation of the inhibition of protein kinases using Monte Carlo simulations. , 2004, Journal of medicinal chemistry.

[14]  Chang Park,et al.  Structure-based design, synthesis, and biological evaluation of potent and selective macrocyclic checkpoint kinase 1 inhibitors. , 2007, Journal of medicinal chemistry.

[15]  B. Berne,et al.  Role of the active-site solvent in the thermodynamics of factor Xa ligand binding. , 2008, Journal of the American Chemical Society.

[16]  G Narahari Sastry,et al.  Virtual high throughput screening in new lead identification. , 2011, Combinatorial chemistry & high throughput screening.

[17]  Jacob Kongsted,et al.  An improved method to predict the entropy term with the MM/PBSA approach , 2009, J. Comput. Aided Mol. Des..

[18]  Chang Park,et al.  Synthesis and biological evaluation of 1-(2,4,5-trisubstituted phenyl)-3-(5-cyanopyrazin-2-yl)ureas as potent Chk1 kinase inhibitors. , 2006, Bioorganic & medicinal chemistry letters.

[19]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[20]  Pallav D. Patel,et al.  Paralog-selective Hsp 90 inhibitors define tumor-specific regulation of Her 2 , 2014 .

[21]  Christine Humblet,et al.  Chemical space sampling by different scoring functions and crystal structures , 2010, J. Comput. Aided Mol. Des..

[22]  Youwei Yan,et al.  Pharmacokinetic optimization of 3-amino-6-chloropyrazinone acetamide thrombin inhibitors. Implementation of P3 pyridine N-oxides to deliver an orally bioavailable series containing P1 N-benzylamides. , 2003, Bioorganic & medicinal chemistry letters.

[23]  Lawrence C Kuo,et al.  Unexpected enhancement of thrombin inhibitor potency with o-aminoalkylbenzylamides in the P1 position. , 2003, Bioorganic & medicinal chemistry letters.

[24]  Youwei Yan,et al.  Design and synthesis of potent and selective macrocyclic thrombin inhibitors. , 2003, Bioorganic & medicinal chemistry letters.

[25]  Tingjun Hou,et al.  Assessing the Performance of the MM/PBSA and MM/GBSA Methods. 1. The Accuracy of Binding Free Energy Calculations Based on Molecular Dynamics Simulations , 2011, J. Chem. Inf. Model..

[26]  Young Do Kwon,et al.  Design, synthesis and biological evaluation of small molecule inhibitors of CD4-gp120 binding based on virtual screening. , 2011, Bioorganic & medicinal chemistry.

[27]  B. Honig,et al.  A hierarchical approach to all‐atom protein loop prediction , 2004, Proteins.

[28]  Ludmila I. Kuncheva,et al.  Measures of Diversity in Classifier Ensembles and Their Relationship with the Ensemble Accuracy , 2003, Machine Learning.

[29]  Wei Chen,et al.  Modeling Protein-Ligand Binding by Mining Minima. , 2010, Journal of chemical theory and computation.

[30]  A. Moretto,et al.  Structure-based optimization of protein tyrosine phosphatase 1B inhibitors: from the active site to the second phosphotyrosine binding site. , 2007, Journal of medicinal chemistry.

[31]  Chao-Yie Yang,et al.  Importance of ligand reorganization free energy in protein-ligand binding-affinity prediction. , 2009, Journal of the American Chemical Society.

[32]  Peter Willett,et al.  Combination of Similarity Rankings Using Data Fusion , 2013, J. Chem. Inf. Model..

[33]  Thomas A. Halgren,et al.  Merck molecular force field. II. MMFF94 van der Waals and electrostatic parameters for intermolecular. interactions , 1996, J. Comput. Chem..

[34]  G. Narahari Sastry,et al.  Molecular Dynamics Investigation on a Series of HIV Protease Inhibitors: Assessing the Performance of MM-PBSA and MM-GBSA Approaches , 2012, J. Chem. Inf. Model..

[35]  R. Wexler,et al.  Chapter 9. Anticoagulants: Thrombin and Factor Xa Inhibitors , 1999 .

[36]  Miklos Feher,et al.  Consensus scoring for protein-ligand interactions. , 2006, Drug discovery today.

[37]  J. Fleiss Measuring nominal scale agreement among many raters. , 1971 .

[38]  M. Gilson,et al.  Calculation of protein-ligand binding affinities. , 2007, Annual review of biophysics and biomolecular structure.

[39]  Clive McCarthy,et al.  Structure based design, synthesis and SAR of cyclic hydroxyethylamine (HEA) BACE-1 inhibitors. , 2011, Bioorganic & medicinal chemistry letters.

[40]  Jens Carlsson,et al.  Charges for Large Scale Binding Free Energy Calculations with the Linear Interaction Energy Method. , 2009, Journal of chemical theory and computation.

[41]  Gisbert Schneider,et al.  Virtual screening: an endless staircase? , 2010, Nature Reviews Drug Discovery.

[42]  W. L. Jorgensen,et al.  The OPLS [optimized potentials for liquid simulations] potential functions for proteins, energy minimizations for crystals of cyclic peptides and crambin. , 1988, Journal of the American Chemical Society.

[43]  Hege S. Beard,et al.  Glide: a new approach for rapid, accurate docking and scoring. 2. Enrichment factors in database screening. , 2004, Journal of medicinal chemistry.

[44]  Fredrik Svensson,et al.  Virtual Screening Data Fusion Using Both Structure- and Ligand-Based Methods , 2012, J. Chem. Inf. Model..

[45]  Woody Sherman,et al.  Exploring protein flexibility: incorporating structural ensembles from crystal structures and simulation into virtual screening protocols. , 2012, The journal of physical chemistry. B.

[46]  Philip J. Merta,et al.  1,4-Dihydroindeno[1,2-c]pyrazoles as potent checkpoint kinase 1 inhibitors: extended exploration on phenyl ring substitutions and preliminary ADME/PK studies. , 2007, Bioorganic & medicinal chemistry letters.

[47]  Wei Li,et al.  Structure‐Based Optimization of Protein Tyrosine Phosphatase‐1 B Inhibitors: Capturing Interactions with Arginine 24 , 2008, ChemMedChem.

[48]  Robert Abel,et al.  Motifs for molecular recognition exploiting hydrophobic enclosure in protein–ligand binding , 2007, Proceedings of the National Academy of Sciences.

[49]  Christine Humblet,et al.  Investigation of MM-PBSA Rescoring of Docking Poses , 2008, J. Chem. Inf. Model..

[50]  Kenneth L. Ho,et al.  Significant reduction in errors associated with nonbonded contacts in protein crystal structures: automated all-atom refinement with PrimeX , 2012, Acta crystallographica. Section D, Biological crystallography.

[51]  Woody Sherman,et al.  Generation of Receptor Structural Ensembles for Virtual Screening Using Binding Site Shape Analysis and Clustering , 2012, Chemical biology & drug design.

[52]  R. Friesner,et al.  Evaluation and Reparametrization of the OPLS-AA Force Field for Proteins via Comparison with Accurate Quantum Chemical Calculations on Peptides† , 2001 .

[53]  J. Åqvist,et al.  The linear interaction energy method for predicting ligand binding free energies. , 2001, Combinatorial chemistry & high throughput screening.

[54]  Simona Distinto,et al.  How To Optimize Shape-Based Virtual Screening: Choosing the Right Query and Including Chemical Information , 2009, J. Chem. Inf. Model..

[55]  Daniel R McMasters,et al.  Metabolism-directed optimization of 3-aminopyrazinone acetamide thrombin inhibitors. Development of an orally bioavailable series containing P1 and P3 pyridines. , 2003, Journal of medicinal chemistry.

[56]  P. Kollman,et al.  Combined molecular mechanical and continuum solvent approach (MM-PBSA/GBSA) to predict ligand binding , 2000 .

[57]  M. Noble,et al.  Searching for cyclin-dependent kinase inhibitors using a new variant of the cope elimination. , 2006, Journal of the American Chemical Society.

[58]  A. Osnowski,et al.  2-Anilino-4-(thiazol-5-yl)pyrimidine CDK inhibitors: synthesis, SAR analysis, X-ray crystallography, and biological activity. , 2004, Journal of medicinal chemistry.

[59]  B. Kuhn,et al.  Validation and use of the MM-PBSA approach for drug discovery. , 2005, Journal of medicinal chemistry.

[60]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[61]  Pallav D. Patel,et al.  Paralog-selective Hsp90 inhibitors define tumor-specific regulation of HER2. , 2013, Nature chemical biology.

[62]  Matthew P. Repasky,et al.  Glide: a new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy. , 2004, Journal of medicinal chemistry.

[63]  Kai Zhu,et al.  Improved Methods for Side Chain and Loop Predictions via the Protein Local Optimization Program:  Variable Dielectric Model for Implicitly Improving the Treatment of Polarization Effects. , 2007, Journal of chemical theory and computation.

[64]  A. Ortiz,et al.  Assessment of solvation effects on calculated binding affinity differences: trypsin inhibition by flavonoids as a model system for congeneric series. , 1997, Journal of medicinal chemistry.

[65]  Suresh Babu,et al.  Discovery of an Orally Available, Brain Penetrant BACE1 Inhibitor that Affords Robust CNS Aβ Reduction. , 2012, ACS medicinal chemistry letters.

[66]  ANATOLY M. RUVINSKY Role of binding entropy in the refinement of protein–ligand docking predictions: Analysis based on the use of 11 scoring functions , 2007, J. Comput. Chem..

[67]  Lawrence C Kuo,et al.  Discovery and evaluation of potent P1 aryl heterocycle-based thrombin inhibitors. , 2004, Journal of medicinal chemistry.

[68]  Woody Sherman,et al.  Consensus Induced Fit Docking (cIFD): methodology, validation, and application to the discovery of novel Crm1 inhibitors , 2012, Journal of Computer-Aided Molecular Design.

[69]  Josef Kittler,et al.  Combining classifiers: A theoretical framework , 1998, Pattern Analysis and Applications.

[70]  T. Halgren MMFF VII. Characterization of MMFF94, MMFF94s, and other widely available force fields for conformational energies and for intermolecular‐interaction energies and geometries , 1999, Journal of computational chemistry.

[71]  Chang Park,et al.  Design, synthesis, and biological activity of 5,10-dihydro-dibenzo[b,e][1,4]diazepin-11-one-based potent and selective Chk-1 inhibitors. , 2007, Journal of medicinal chemistry.

[72]  Woody Sherman,et al.  Rapid Shape-Based Ligand Alignment and Virtual Screening Method Based on Atom/Feature-Pair Similarities and Volume Overlap Scoring , 2011, J. Chem. Inf. Model..

[73]  P. Kollman,et al.  Calculating structures and free energies of complex molecules: combining molecular mechanics and continuum models. , 2000, Accounts of chemical research.

[74]  S. Noha,et al.  Discovery of a novel IKK-β inhibitor by ligand-based virtual screening techniques , 2011, Bioorganic & medicinal chemistry letters.

[75]  Alessandro Contini,et al.  Explicit Ligand Hydration Shells Improve the Correlation between MM-PB/GBSA Binding Energies and Experimental Activities. , 2013, Journal of chemical theory and computation.

[76]  Christian Kramer,et al.  MM/GBSA Binding Energy Prediction on the PDBbind Data Set: Successes, Failures, and Directions for Further Improvement , 2013, J. Chem. Inf. Model..

[77]  Woody Sherman,et al.  ConfGen: A Conformational Search Method for Efficient Generation of Bioactive Conformers , 2010, J. Chem. Inf. Model..

[78]  Julian Tirado-Rives,et al.  Improving MM-GB/SA Scoring through the Application of the Variable Dielectric Model. , 2011, Journal of chemical theory and computation.

[79]  Z. Xiang,et al.  On the role of the crystal environment in determining protein side-chain conformations. , 2002, Journal of molecular biology.

[80]  Woody Sherman,et al.  Boosting Virtual Screening Enrichments with Data Fusion: Coalescing Hits from Two-Dimensional Fingerprints, Shape, and Docking , 2013, J. Chem. Inf. Model..

[81]  Victor Guallar,et al.  Exploring hierarchical refinement techniques for induced fit docking with protein and ligand flexibility , 2009, J. Comput. Chem..

[82]  Giulio Rastelli,et al.  Fast and accurate predictions of binding free energies using MM‐PBSA and MM‐GBSA , 2009, J. Comput. Chem..

[83]  B. Graves,et al.  Discovery of 6-(2,4-difluorophenoxy)-2-[3-hydroxy-1-(2-hydroxyethyl)propylamino]-8-methyl-8H-pyrido[2,3-d]pyrimidin-7-one (pamapimod) and 6-(2,4-difluorophenoxy)-8-methyl-2-(tetrahydro-2H-pyran-4-ylamino)pyrido[2,3-d]pyrimidin-7(8H)-one (R1487) as orally bioavailable and highly selective inhibitors , 2011, Journal of medicinal chemistry.

[84]  R. Friesner,et al.  The VSGB 2.0 model: A next generation energy model for high resolution protein structure modeling , 2011, Proteins.

[85]  Anatoly M. Ruvinsky,et al.  New and fast statistical‐thermodynamic method for computation of protein‐ligand binding entropy substantially improves docking accuracy , 2005, J. Comput. Chem..

[86]  Woody Sherman,et al.  Protein and ligand preparation: parameters, protocols, and influence on virtual screening enrichments , 2013, Journal of Computer-Aided Molecular Design.

[87]  William Greenlee,et al.  Design and validation of bicyclic iminopyrimidinones as beta amyloid cleaving enzyme-1 (BACE1) inhibitors: conformational constraint to favor a bioactive conformation. , 2012, Journal of medicinal chemistry.

[88]  Daniel Hoffmann,et al.  The Normal-Mode Entropy in the MM/GBSA Method: Effect of System Truncation, Buffer Region, and Dielectric Constant , 2012, J. Chem. Inf. Model..

[89]  J. Aqvist,et al.  A new method for predicting binding affinity in computer-aided drug design. , 1994, Protein engineering.