Enrichment of Chemical Libraries Docked to Protein Conformational Ensembles and Application to Aldehyde Dehydrogenase 2

Molecular recognition is a complex process that involves a large ensemble of structures of the receptor and ligand. Yet, most structure-based virtual screening is carried out on a single structure typically from X-ray crystallography. Explicit-solvent molecular dynamics (MD) simulations offer an opportunity to sample multiple conformational states of a protein. Here we evaluate our recently developed scoring method SVMSP in its ability to enrich chemical libraries docked to MD structures of seven proteins from the Directory of Useful Decoys (DUD). SVMSP is a target-specific rescoring method that combines machine learning with statistical potentials. We find that enrichment power as measured by the area under the ROC curve (ROC-AUC) is not affected by increasing the number of MD structures. Among individual MD snapshots, many exhibited enrichment that was significantly better than the crystal structure, but no correlation between enrichment and structural deviation from crystal structure was found. We followed an innovative approach by training SVMSP scoring models using MD structures (SVMSPMD). The resulting models were applied to two difficult cases (p38 and CDK2) for which enrichment was not better than random. We found remarkable increase in enrichment power, particularly for p38, where the ROC-AUC increased by 0.30 to 0.85. Finally, we explored approaches for a priori identification of MD snapshots with high enrichment power from an MD simulation in the absence of active compounds. We found that the use of randomly selected compounds docked to the target of interest using SVMSP led to notable enrichment for EGFR and Src MD snapshots. SVMSP rescoring of protein–compound MD structures was applied for the search of small-molecule inhibitors of the mitochondrial enzyme aldehyde dehydrogenase 2 (ALDH2). Rank-ordering of a commercial library of 50 000 compounds docked to MD structures of ALDH2 led to five small-molecule inhibitors. Four compounds had IC50s below 5 μM. These compounds serve as leads for the design and synthesis of more potent and selective ALDH2 inhibitors.

[1]  Ingo Muegge Effect of ligand volume correction on PMF scoring , 2001, J. Comput. Chem..

[2]  W. L. Jorgensen,et al.  Comparison of simple potential functions for simulating liquid water , 1983 .

[3]  I. Muegge A knowledge-based scoring function for protein-ligand interactions: Probing the reference state , 2000 .

[4]  Chris Morley,et al.  Open Babel: An open chemical toolbox , 2011, J. Cheminformatics.

[5]  Brian K Shoichet,et al.  Structure-based drug screening for G-protein-coupled receptors. , 2012, Trends in pharmacological sciences.

[6]  Avner Schlessinger,et al.  Molecular modeling and ligand docking for solute carrier (SLC) transporters. , 2013, Current topics in medicinal chemistry.

[7]  Jing Li,et al.  Targeting multiple conformations leads to small molecule inhibitors of the uPAR·uPA protein-protein interaction that block cancer cell invasion. , 2011, ACS chemical biology.

[8]  Matthew P. Repasky,et al.  Glide: a new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy. , 2004, Journal of medicinal chemistry.

[9]  Xiaoqin Zou,et al.  An iterative knowledge‐based scoring function to predict protein–ligand interactions: II. Validation of the scoring function , 2006, J. Comput. Chem..

[10]  K. Merz,et al.  Large-scale validation of a quantum mechanics based scoring function: predicting the binding affinity and the binding mode of a diverse set of protein-ligand complexes. , 2005, Journal of medicinal chemistry.

[11]  James Andrew McCammon,et al.  Predictive Power of Molecular Dynamics Receptor Structures in Virtual Screening , 2011, J. Chem. Inf. Model..

[12]  Hans-Joachim Böhm,et al.  The development of a simple empirical scoring function to estimate the binding constant for a protein-ligand complex of known three-dimensional structure , 1994, J. Comput. Aided Mol. Des..

[13]  Bo Wang,et al.  Support Vector Regression Scoring of Receptor-Ligand Complexes for Rank-Ordering and Virtual Screening of Chemical Libraries , 2011, J. Chem. Inf. Model..

[14]  G. V. Paolini,et al.  Empirical scoring functions: I. The development of a fast empirical scoring function to estimate the binding affinity of ligands in receptor complexes , 1997, J. Comput. Aided Mol. Des..

[15]  E. Shakhnovich,et al.  SMall Molecule Growth 2001 (SMoG2001): an improved knowledge-based scoring function for protein-ligand interactions. , 2002, Journal of medicinal chemistry.

[16]  Gerhard Klebe,et al.  DSX: A Knowledge-Based Scoring Function for the Assessment of Protein-Ligand Complexes , 2011, J. Chem. Inf. Model..

[17]  Thomas Lengauer,et al.  A fast flexible docking method using an incremental construction algorithm. , 1996, Journal of molecular biology.

[18]  Yong Duan,et al.  Distinguish protein decoys by Using a scoring function based on a new AMBER force field, short molecular dynamics simulations, and the generalized born solvent model , 2004, Proteins.

[19]  G. Klebe,et al.  Virtual screening for potential inhibitors of bacterial MurC and MurD ligases , 2012, Journal of Molecular Modeling.

[20]  Arthur J. Olson,et al.  AutoDock Vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading , 2009, J. Comput. Chem..

[21]  I. Kuntz,et al.  Molecular docking to ensembles of protein structures. , 1997, Journal of molecular biology.

[22]  Per Källblad,et al.  Receptor flexibility in the in silico screening of reagents in the S1' pocket of human collagenase. , 2004, Journal of medicinal chemistry.

[23]  Niu Huang,et al.  Physics-Based Scoring of Protein-Ligand Complexes: Enrichment of Known Inhibitors in Large-Scale Virtual Screening , 2006, J. Chem. Inf. Model..

[24]  Chunmei Yang,et al.  Discovery of new potent inhibitors for carbonic anhydrase IX by structure-based virtual screening. , 2013, Bioorganic & medicinal chemistry letters.

[25]  J Andrew McCammon,et al.  Discovery of Novel Inhibitors of HIV‐1 Reverse Transcriptase Through Virtual Screening of Experimental and Theoretical Ensembles , 2014, Chemical biology & drug design.

[26]  Young-Ho Lee,et al.  Z-DNA binding proteins as targets for structure-based virtual screening. , 2010, Current drug targets.

[27]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[28]  J. Richardson,et al.  Asparagine and glutamine: using hydrogen atom contacts in the choice of side-chain amide orientation. , 1999, Journal of molecular biology.

[29]  Maria A Miteva,et al.  Exploring NMR ensembles of calcium binding proteins: Perspectives to design inhibitors of protein-protein interactions , 2011, BMC Structural Biology.

[30]  Vasilis Vasiliou,et al.  Analysis and update of the human aldehyde dehydrogenase (ALDH) gene family , 2005, Human Genomics.

[31]  Mengang Xu,et al.  Utilizing Experimental Data for Reducing Ensemble Size in Flexible-Protein Docking , 2012, J. Chem. Inf. Model..

[32]  J. Mccammon,et al.  Exploring the role of receptor flexibility in structure-based drug discovery. , 2014, Biophysical chemistry.

[33]  Lin-Li Li,et al.  ID-Score: A New Empirical Scoring Function Based on a Comprehensive Set of Descriptors Related to Protein-Ligand Interactions , 2013, J. Chem. Inf. Model..

[34]  William L. Jorgensen,et al.  Journal of Chemical Information and Modeling , 2005, J. Chem. Inf. Model..

[35]  G. Klebe,et al.  Knowledge-based scoring function to predict protein-ligand interactions. , 2000, Journal of molecular biology.

[36]  G. Ciccotti,et al.  Numerical Integration of the Cartesian Equations of Motion of a System with Constraints: Molecular Dynamics of n-Alkanes , 1977 .

[37]  B Tidor,et al.  Computation of electrostatic complements to proteins: A case of charge stabilized binding , 1998, Protein science : a publication of the Protein Society.

[38]  May Khanna,et al.  Discovery of novel regulators of aldehyde dehydrogenase isoenzymes. , 2011, Chemico-biological interactions.

[39]  Amedeo Caflisch,et al.  Flaviviral protease inhibitors identified by fragment-based library docking into a structure generated by molecular dynamics. , 2009, Journal of medicinal chemistry.

[40]  J. Irwin,et al.  Benchmarking sets for molecular docking. , 2006, Journal of medicinal chemistry.

[41]  K. Morgan,et al.  Targeting of a Novel Ca2+/Calmodulin-Dependent Protein Kinase II Is Essential for Extracellular Signal-Regulated Kinase–Mediated Signaling in Differentiated Smooth Muscle Cells , 2005, Circulation research.

[42]  György M. Keserü,et al.  The Impact of Molecular Dynamics Sampling on the Performance of Virtual Screening against GPCRs , 2013, J. Chem. Inf. Model..

[43]  A. Srivastava,et al.  CaMKII knockdown attenuates H2O2-induced phosphorylation of ERK1/2, PKB/Akt, and IGF-1R in vascular smooth muscle cells. , 2009, Free radical biology & medicine.

[44]  Gennady Verkhivker,et al.  Integrating Ligand-Based and Protein-Centric Virtual Screening of Kinase Inhibitors Using Ensembles of Multiple Protein Kinase Genes and Conformations , 2012, J. Chem. Inf. Model..

[45]  Huan Meng,et al.  A Novel Anti-Tumor Inhibitor Identified by Virtual Screen with PLK1 Structure and Zebrafish Assay , 2013, PloS one.

[46]  P Willett,et al.  Development and validation of a genetic algorithm for flexible docking. , 1997, Journal of molecular biology.

[47]  Alain Milon,et al.  Virtual and Biophysical Screening Targeting the γ-Tubulin Complex – A New Target for the Inhibition of Microtubule Nucleation , 2013, PloS one.

[48]  G. Klebe,et al.  DrugScore(CSD)-knowledge-based scoring function derived from small molecule crystal data with superior recognition rate of near-native ligand poses and better affinity prediction. , 2005, Journal of medicinal chemistry.

[49]  Alessandro Curioni,et al.  New Scoring Functions for Virtual Screening from Molecular Dynamics Simulations with a Quantum-Refined Force-Field (QRFF-MD). Application to Cyclin-Dependent Kinase 2 , 2006, J. Chem. Inf. Model..

[50]  J. A. Grant,et al.  Gaussian docking functions. , 2003, Biopolymers.

[51]  Thomas Stützle,et al.  Empirical Scoring Functions for Advanced Protein-Ligand Docking with PLANTS , 2009, J. Chem. Inf. Model..

[52]  J. Pin,et al.  Virtual screening workflow development guided by the "receiver operating characteristic" curve approach. Application to high-throughput docking on metabotropic glutamate receptor subtype 4. , 2005, Journal of medicinal chemistry.

[53]  Woody Sherman,et al.  Large-Scale Systematic Analysis of 2D Fingerprint Methods and Parameters to Improve Virtual Screening Enrichments , 2010, J. Chem. Inf. Model..

[54]  Janet M. Thornton,et al.  BLEEP - potential of mean force describing protein-ligand interactions: I. Generating potential , 1999, J. Comput. Chem..

[55]  Woody Sherman,et al.  Protein and ligand preparation: parameters, protocols, and influence on virtual screening enrichments , 2013, Journal of Computer-Aided Molecular Design.

[56]  Liwei Li,et al.  Target-Specific Support Vector Machine Scoring in Structure-Based Virtual Screening: Computational Validation, In Vitro Testing in Kinases, and Effects on Lung Cancer Cell Proliferation , 2011, J. Chem. Inf. Model..

[57]  Y. Martin,et al.  A general and fast scoring function for protein-ligand interactions: a simplified potential approach. , 1999, Journal of medicinal chemistry.

[58]  Alessandro Curioni,et al.  New Scoring Functions for Virtual Screening from Molecular Dynamics Simulations with a Quantum‐Refined Force‐Field (QRFF‐MD). Application to Cyclin‐Dependent Kinase 2. , 2006 .

[59]  Janet M. Thornton,et al.  BLEEP—potential of mean force describing protein–ligand interactions: I. Generating potential , 1999 .

[60]  Luhua Lai,et al.  SCORE: A New Empirical Method for Estimating the Binding Affinity of a Protein-Ligand Complex , 1998 .

[61]  Ping Zhang,et al.  Calcium/calmodulin‐dependent kinase activity is required for efficient induction of osteoclast differentiation and bone resorption by receptor activator of nuclear factor kappa B ligand (RANKL) , 2007, Journal of cellular physiology.

[62]  Christoph A. Sotriffer,et al.  SFCscoreRF: A Random Forest-Based Scoring Function for Improved Affinity Prediction of Protein-Ligand Complexes , 2013, J. Chem. Inf. Model..

[63]  Maryse Lowinski,et al.  The use of virtual screening and differential scanning fluorimetry for the rapid identification of fragments active against MEK1. , 2013, Bioorganic & medicinal chemistry letters.

[64]  Woody Sherman,et al.  Analysis and comparison of 2D fingerprints: insights into database screening performance using eight fingerprint methods , 2010, J. Cheminformatics.

[65]  Xiaoqin Zou,et al.  Efficient molecular docking of NMR structures: Application to HIV‐1 protease , 2006, Protein science : a publication of the Protein Society.

[66]  Ryan G. Coleman,et al.  ZINC: A Free Tool to Discover Chemistry for Biology , 2012, J. Chem. Inf. Model..

[67]  Nikolay V. Dokholyan,et al.  MedusaScore: An Accurate Force Field-Based Scoring Function for Virtual Drug Screening , 2008, J. Chem. Inf. Model..

[68]  Bo Wang,et al.  Molecular Recognition in a Diverse Set of Protein-Ligand Interactions Studied with Molecular Dynamics Simulations and End-Point Free Energy Calculations , 2013, J. Chem. Inf. Model..

[69]  Tania Pencheva,et al.  BMC Bioinformatics BioMed Central Methodology article AMMOS: Automated Molecular Mechanics Optimization tool for in silico Screening , 2022 .

[70]  A. Caflisch,et al.  Discovery of Tyrosine Kinase Inhibitors by Docking into an Inactive Kinase Conformation Generated by Molecular Dynamics , 2012, ChemMedChem.

[71]  Xiaoqin Zou,et al.  Inclusion of Solvation and Entropy in the Knowledge-Based Scoring Function for Protein-Ligand Interactions , 2010, J. Chem. Inf. Model..

[72]  Tingjun Hou,et al.  Feasibility of Using Molecular Docking-Based Virtual Screening for Searching Dual Target Kinase Inhibitors , 2013, J. Chem. Inf. Model..

[73]  R. Glen,et al.  Molecular recognition of receptor sites using a genetic algorithm with a description of desolvation. , 1995, Journal of molecular biology.

[74]  Zheng Zheng,et al.  Ligand Identification Scoring Algorithm (LISA) , 2011, J. Chem. Inf. Model..

[75]  Wei Zhang,et al.  A point‐charge force field for molecular mechanics simulations of proteins based on condensed‐phase quantum mechanical calculations , 2003, J. Comput. Chem..

[76]  Didier Rognan,et al.  sc-PDB: an Annotated Database of Druggable Binding Sites from the Protein Data Bank , 2006, J. Chem. Inf. Model..

[77]  Todd J. A. Ewing,et al.  DOCK 4.0: Search strategies for automated molecular docking of flexible molecule databases , 2001, J. Comput. Aided Mol. Des..

[78]  S. Collins,et al.  CaMKII regulates retinoic acid receptor transcriptional activity and the differentiation of myeloid leukemia cells. , 2007, The Journal of clinical investigation.

[79]  V. Bernardes-Génisson,et al.  Cross-docking study on InhA inhibitors: a combination of Autodock Vina and PM6-DH2 simulations to retrieve bio-active conformations. , 2012, Organic & biomolecular chemistry.