Computation of Binding Energies Including Their Enthalpy and Entropy Components for Protein-Ligand Complexes Using Support Vector Machines

Computing binding energies of protein-ligand complexes including their enthalpy and entropy terms by means of computational methods is an appealing approach for selecting initial hits and for further optimization in early stages of drug discovery. Despite the importance, computational predictions of thermodynamic components have evaded attention and reasonable solutions. In this study, support vector machines are used for developing scoring functions to compute binding energies and their enthalpy and entropy components of protein-ligand complexes. The binding energies computed from our newly derived scoring functions have better Pearson's correlation coefficients with experimental data than previously reported scoring functions in benchmarks for protein-ligand complexes from the PDBBind database. The protein-ligand complexes with binding energies dominated by enthalpy or entropy term could be qualitatively classified by the newly derived scoring functions with high accuracy. Furthermore, it is found that the inclusion of comprehensive descriptors based on ligand properties in the scoring functions improved the accuracy of classification as well as the prediction of binding energies including their thermodynamic components. The prediction of binding energies including the enthalpy and entropy components using the support vector machine based scoring functions should be of value in the drug discovery process.

[1]  Jaques Reifman,et al.  Support vector machines with selective kernel scaling for protein classification and identification of key amino acid positions , 2002, Bioinform..

[2]  Renxiao Wang,et al.  The PDBbind database: methodologies and updates. , 2005, Journal of medicinal chemistry.

[3]  Richard D. Taylor,et al.  Improved protein–ligand docking using GOLD , 2003, Proteins.

[4]  Bernard F. Buxton,et al.  Drug Design by Machine Learning: Support Vector Machines for Pharmaceutical Data Analysis , 2001, Comput. Chem..

[5]  György G Ferenczy,et al.  Thermodynamics guided lead discovery and optimization. , 2010, Drug discovery today.

[6]  G. V. Paolini,et al.  Empirical scoring functions: I. The development of a fast empirical scoring function to estimate the binding affinity of ligands in receptor complexes , 1997, J. Comput. Aided Mol. Des..

[7]  Robert L. Grossman What is analytic infrastructure and why should you care? , 2009, SKDD.

[8]  Peter J. Reilly,et al.  Specific empirical free energy function for automated docking of carbohydrates to proteins , 2003, J. Comput. Chem..

[9]  Joel R. Bock,et al.  A New Method to Estimate Ligand-Receptor Energetics* , 2002, Molecular & Cellular Proteomics.

[10]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[11]  Thorsten Meinl,et al.  KNIME: The Konstanz Information Miner , 2007, GfKl.

[12]  David S. Goodsell,et al.  AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility , 2009, J. Comput. Chem..

[13]  Jacob Kongsted,et al.  Accurate predictions of nonpolar solvation free energies require explicit consideration of binding-site hydration. , 2011, Journal of the American Chemical Society.

[14]  ANATOLY M. RUVINSKY Role of binding entropy in the refinement of protein–ligand docking predictions: Analysis based on the use of 11 scoring functions , 2007, J. Comput. Chem..

[15]  Scott Bowden,et al.  Drug resistance in antiviral therapy. , 2010, Clinics in liver disease.

[16]  E. Freire Do enthalpy and entropy distinguish first in class from best in class? , 2008, Drug discovery today.

[17]  Shaomeng Wang,et al.  An Extensive Test of 14 Scoring Functions Using the PDBbind Refined Set of 800 Protein-Ligand Complexes , 2004, J. Chem. Inf. Model..

[18]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[19]  Tjelvar S. G. Olsson,et al.  The thermodynamics of protein-ligand interaction and solvation: insights for ligand design. , 2008, Journal of molecular biology.

[20]  Philip E. Bourne,et al.  A Machine Learning-Based Method To Improve Docking Scoring Functions and Its Application to Drug Repurposing , 2011, J. Chem. Inf. Model..

[21]  Otto Exner,et al.  Entropy–enthalpy compensation and anticompensation: solvation and ligand binding , 2000 .

[22]  Anirudh Ranganathan Protein – Ligand Binding: Estimation of Binding Free Energies , 2012 .

[23]  Yu-dong Cai,et al.  Support vector machines for predicting rRNA-, RNA-, and DNA-binding proteins from amino acid sequence. , 2003, Biochimica et biophysica acta.

[24]  Luhua Lai,et al.  Further development and validation of empirical scoring functions for structure-based binding affinity prediction , 2002, J. Comput. Aided Mol. Des..

[25]  Celia A Schiffer,et al.  Extreme entropy-enthalpy compensation in a drug-resistant variant of HIV-1 protease. , 2012, ACS chemical biology.

[26]  Yu-Dong Cai,et al.  Support vector machines for prediction of protein domain structural class. , 2003, Journal of theoretical biology.

[27]  Magnolia Vanegas,et al.  Decreasing the configurational entropy and the hydrophobicity of EBV-derived peptide 11389 increased its antigenicity , immunogenicity and its ability of inducing IL-6 , 2012 .

[28]  Liwei Li,et al.  Target-Specific Support Vector Machine Scoring in Structure-Based Virtual Screening: Computational Validation, In Vitro Testing in Kinases, and Effects on Lung Cancer Cell Proliferation , 2011, J. Chem. Inf. Model..

[29]  Renxiao Wang,et al.  Comparative evaluation of 11 scoring functions for molecular docking. , 2003, Journal of medicinal chemistry.

[30]  W. Sherman,et al.  Thermodynamic analysis of water molecules at the surface of proteins and applications to binding site prediction and characterization , 2011, Proteins.

[31]  Garland R. Marshall,et al.  PHOENIX: A Scoring Function for Affinity Prediction Derived Using High-Resolution Crystal Structures and Calorimetry Measurements , 2011, J. Chem. Inf. Model..

[32]  Gastone Gilli,et al.  Enthalpy-entropy compensation in drug-receptor binding , 1994 .

[33]  Haruki Nakamura,et al.  Announcing the worldwide Protein Data Bank , 2003, Nature Structural Biology.

[34]  Yoshifumi Fukunishi,et al.  Structure-based drug screening and ligand-based drug screening with machine learning. , 2009, Combinatorial chemistry & high throughput screening.

[35]  L. Amzel,et al.  Compensating Enthalpic and Entropic Changes Hinder Binding Affinity Optimization , 2007, Chemical biology & drug design.

[36]  B. Lee,et al.  Enthalpy-entropy compensation in the thermodynamics of hydrophobicity. , 1994, Biophysical chemistry.

[37]  J. Chaires,et al.  Thermodynamic studies for drug design and screening , 2012, Expert opinion on drug discovery.

[38]  Xiaoqin Zou,et al.  Inclusion of Solvation and Entropy in the Knowledge-Based Scoring Function for Protein-Ligand Interactions , 2010, J. Chem. Inf. Model..

[39]  Samy O Meroueh,et al.  PDBcal: A Comprehensive Dataset for Receptor–Ligand Interactions with Three‐dimensional Structures and Binding Thermodynamics from Isothermal Titration Calorimetry , 2008, Chemical biology & drug design.

[40]  Thomas Hofmann,et al.  Predicting CNS Permeability of Drug Molecules: Comparison of Neural Network and Support Vector Machine Algorithms , 2002, J. Comput. Biol..

[41]  Bo Wang,et al.  Support Vector Regression Scoring of Receptor-Ligand Complexes for Rank-Ordering and Virtual Screening of Chemical Libraries , 2011, J. Chem. Inf. Model..

[42]  Trilce Estrada,et al.  Evaluation of Several Two-Step Scoring Functions Based on Linear Interaction Energy, Effective Ligand Size, and Empirical Pair Potentials for Prediction of Protein-Ligand Binding Geometry and Free Energy , 2011, J. Chem. Inf. Model..

[43]  S. Sathiya Keerthi,et al.  Improvements to Platt's SMO Algorithm for SVM Classifier Design , 2001, Neural Computation.

[44]  David A. Gough,et al.  Predicting protein-protein interactions from primary structure , 2001, Bioinform..

[45]  Ronald M. Levy,et al.  Entropy−Enthalpy Compensation in Solvation and Ligand Binding Revisited , 1998 .

[46]  Jing Li,et al.  Knowledge-Based Scoring Functions in Drug Design: 3. A Two-Dimensional Knowledge-Based Hydrogen-Bonding Potential for the Prediction of Protein-Ligand Interactions , 2011, J. Chem. Inf. Model..

[47]  Gary B. Fogel,et al.  Computational Intelligence Methods for Docking Scores , 2009 .

[48]  H. Bosshard,et al.  Isothermal titration calorimetry and differential scanning calorimetry as complementary tools to investigate the energetics of biomolecular recognition , 1999, Journal of molecular recognition : JMR.

[49]  Garland R. Marshall,et al.  VALIDATE: A New Method for the Receptor-Based Prediction of Binding Affinities of Novel Ligands , 1996 .