Develop and Test a Solvent Accessible Surface Area-Based Model in Conformational Entropy Calculations

It is of great interest in modern drug design to accurately calculate the free energies of protein-ligand or nucleic acid-ligand binding. MM-PBSA (molecular mechanics Poisson-Boltzmann surface area) and MM-GBSA (molecular mechanics generalized Born surface area) have gained popularity in this field. For both methods, the conformational entropy, which is usually calculated through normal-mode analysis (NMA), is needed to calculate the absolute binding free energies. Unfortunately, NMA is computationally demanding and becomes a bottleneck of the MM-PB/GBSA-NMA methods. In this work, we have developed a fast approach to estimate the conformational entropy based upon solvent accessible surface area calculations. In our approach, the conformational entropy of a molecule, S, can be obtained by summing up the contributions of all atoms, no matter they are buried or exposed. Each atom has two types of surface areas, solvent accessible surface area (SAS) and buried SAS (BSAS). The two types of surface areas are weighted to estimate the contribution of an atom to S. Atoms having the same atom type share the same weight and a general parameter k is applied to balance the contributions of the two types of surface areas. This entropy model was parametrized using a large set of small molecules for which their conformational entropies were calculated at the B3LYP/6-31G* level taking the solvent effect into account. The weighted solvent accessible surface area (WSAS) model was extensively evaluated in three tests. For convenience, TS values, the product of temperature T and conformational entropy S, were calculated in those tests. T was always set to 298.15 K through the text. First of all, good correlations were achieved between WSAS TS and NMA TS for 44 protein or nucleic acid systems sampled with molecular dynamics simulations (10 snapshots were collected for postentropy calculations): the mean correlation coefficient squares (R²) was 0.56. As to the 20 complexes, the TS changes upon binding; TΔS values were also calculated, and the mean R² was 0.67 between NMA and WSAS. In the second test, TS values were calculated for 12 proteins decoy sets (each set has 31 conformations) generated by the Rosetta software package. Again, good correlations were achieved for all decoy sets: the mean, maximum, and minimum of R² were 0.73, 0.89, and 0.55, respectively. Finally, binding free energies were calculated for 6 protein systems (the numbers of inhibitors range from 4 to 18) using four scoring functions. Compared to the measured binding free energies, the mean R² of the six protein systems were 0.51, 0.47, 0.40, and 0.43 for MM-GBSA-WSAS, MM-GBSA-NMA, MM-PBSA-WSAS, and MM-PBSA-NMA, respectively. The mean rms errors of prediction were 1.19, 1.24, 1.41, 1.29 kcal/mol for the four scoring functions, correspondingly. Therefore, the two scoring functions employing WSAS achieved a comparable prediction performance to that of the scoring functions using NMA. It should be emphasized that no minimization was performed prior to the WSAS calculation in the last test. Although WSAS is not as rigorous as physical models such as quasi-harmonic analysis and thermodynamic integration (TI), it is computationally very efficient as only surface area calculation is involved and no structural minimization is required. Moreover, WSAS has achieved a comparable performance to normal-mode analysis. We expect that this model could find its applications in the fields like high throughput screening (HTS), molecular docking, and rational protein design. In those fields, efficiency is crucial since there are a large number of compounds, docking poses, or protein models to be evaluated. A list of acronyms and abbreviations used in this work is provided for quick reference.

[1]  T Darden,et al.  New tricks for modelers from the crystallography toolkit: the particle mesh Ewald algorithm and its use in nucleic acid simulations. , 1999, Structure.

[2]  H. Meirovitch,et al.  Calculation of the entropy and free energy of peptides by molecular dynamics simulations using the hypothetical scanning molecular dynamics method. , 2006, The Journal of chemical physics.

[3]  Richard H. Henchman,et al.  Revisiting free energy calculations: a theoretical connection to MM/PBSA and direct calculation of the association free energy. , 2004, Biophysical journal.

[4]  G. V. Paolini,et al.  Empirical scoring functions: I. The development of a fast empirical scoring function to estimate the binding affinity of ligands in receptor complexes , 1997, J. Comput. Aided Mol. Des..

[5]  R. Abagyan,et al.  Biased probability Monte Carlo conformational searches and electrostatic calculations for peptides and proteins. , 1994, Journal of molecular biology.

[6]  K. Sharp,et al.  Calculation of configurational entropy with a Boltzmann-quasiharmonic model: the origin of high-affinity protein-ligand binding. , 2011, The journal of physical chemistry. B.

[7]  Philip E. Bourne,et al.  The Protein Data Bank (PDB) | NIST , 2002 .

[8]  M. Karplus,et al.  Method for estimating the configurational entropy of macromolecules , 1981 .

[9]  H. Meirovitch Methods for calculating the absolute entropy and free energy of biological systems based on ideas from polymer physics , 2009, Journal of molecular recognition : JMR.

[10]  P. Kollman,et al.  Continuum Solvent Studies of the Stability of DNA, RNA, and Phosphoramidate−DNA Helices , 1998 .

[11]  W. L. Jorgensen Free energy calculations: a breakthrough for modeling organic chemistry in solution , 1989 .

[12]  A. Nicholls,et al.  Ligand Entropy in Gas-Phase, Upon Solvation and Protein Complexation. Fast Estimation with Quasi-Newton Hessian. , 2010, Journal of chemical theory and computation.

[13]  R. C. Weast CRC Handbook of Chemistry and Physics , 1973 .

[14]  Rafael Brüschweiler,et al.  Evaluation of configurational entropy methods from peptide folding-unfolding simulation. , 2007, The journal of physical chemistry. B.

[15]  B. Kuhn,et al.  Validation and use of the MM-PBSA approach for drug discovery. , 2005, Journal of medicinal chemistry.

[16]  P. Kollman,et al.  Biomolecular simulations: recent developments in force fields, simulations of enzyme catalysis, protein-ligand, protein-protein, and protein-nucleic acid noncovalent interactions. , 2001, Annual review of biophysics and biomolecular structure.

[17]  M. Murcko,et al.  Crystal Structure of HIV-1 Protease in Complex with Vx-478, a Potent and Orally Bioavailable Inhibitor of the Enzyme , 1995 .

[18]  P. Kollman,et al.  Automatic atom type and bond type perception in molecular mechanical calculations. , 2006, Journal of molecular graphics & modelling.

[19]  Marian Anghel,et al.  Synchronization of trajectories in canonical molecular-dynamics simulations: observation, explanation, and exploitation. , 2004, The Journal of chemical physics.

[20]  M. Sanner,et al.  Reduced surface: an efficient way to compute molecular surfaces. , 1996, Biopolymers.

[21]  Tingjun Hou,et al.  Assessing the performance of the molecular mechanics/Poisson Boltzmann surface area and molecular mechanics/generalized Born surface area methods. II. The accuracy of ranking poses generated from docking , 2011, J. Comput. Chem..

[22]  D. Beveridge,et al.  Free energy via molecular simulation: applications to chemical and biomolecular systems. , 1989, Annual review of biophysics and biophysical chemistry.

[23]  M. Gilson,et al.  The statistical-thermodynamic basis for computation of binding affinities: a critical review. , 1997, Biophysical journal.

[24]  Tingjun Hou,et al.  Development of Reliable Aqueous Solubility Models and Their Application in Druglike Analysis , 2007, J. Chem. Inf. Model..

[25]  Xiaojie Xu,et al.  Recent Advances in Free Energy Calculations with a Combination of Molecular Mechanics and Continuum Models , 2006 .

[26]  Thomas Lengauer,et al.  A fast flexible docking method using an incremental construction algorithm. , 1996, Journal of molecular biology.

[27]  P. Kollman,et al.  Settle: An analytical version of the SHAKE and RATTLE algorithm for rigid water models , 1992 .

[28]  W. M. Haynes CRC Handbook of Chemistry and Physics , 1990 .

[29]  M. Gilson,et al.  Free energy, entropy, and induced fit in host-guest recognition: calculations with the second-generation mining minima algorithm. , 2004, Journal of the American Chemical Society.

[30]  P A Kollman,et al.  An analysis of the interactions between the Sem-5 SH3 domain and its ligands using molecular dynamics, free energy calculations, and sequence analysis. , 2001, Journal of the American Chemical Society.

[31]  Hans-Joachim Böhm,et al.  The development of a simple empirical scoring function to estimate the binding constant for a protein-ligand complex of known three-dimensional structure , 1994, J. Comput. Aided Mol. Des..

[32]  Michael K Gilson,et al.  Concepts in receptor optimization: targeting the RGD peptide. , 2006, Journal of the American Chemical Society.

[33]  P. Kollman,et al.  An approach to computing electrostatic charges for molecules , 1984 .

[34]  K. Dill,et al.  Binding of small-molecule ligands to proteins: "what you see" is not always "what you get". , 2009, Structure.

[35]  P. Kollman,et al.  Use of MM-PBSA in reproducing the binding free energies to HIV-1 RT of TIBO derivatives and predicting the binding mode to HIV-1 RT of efavirenz by docking and MM-PBSA. , 2001, Journal of the American Chemical Society.

[36]  Piotr Cieplak,et al.  Molecular dynamics and free energy analyses of cathepsin D-inhibitor interactions: insight into structure-based ligand design. , 2002, Journal of medicinal chemistry.

[37]  I. Kuntz,et al.  Hierarchical database screenings for HIV-1 reverse transcriptase using a pharmacophore model, rigid docking, solvation docking, and MM-PB/SA. , 2005, Journal of medicinal chemistry.

[38]  Jacob Kongsted,et al.  An improved method to predict the entropy term with the MM/PBSA approach , 2009, J. Comput. Aided Mol. Des..

[39]  W. V. van Gunsteren,et al.  Estimating entropies from molecular dynamics simulations. , 2004, The Journal of chemical physics.

[40]  E. Goldsmith,et al.  Structural basis of inhibitor selectivity in MAP kinases. , 1998, Structure.

[41]  P. Kollman,et al.  How well does a restrained electrostatic potential (RESP) model perform in calculating conformational energies of organic and biological molecules? , 2000 .

[42]  M. Gilson,et al.  Ligand configurational entropy and protein binding , 2007, Proceedings of the National Academy of Sciences.

[43]  Jacob Kongsted,et al.  Accurate predictions of nonpolar solvation free energies require explicit consideration of binding-site hydration. , 2011, Journal of the American Chemical Society.

[44]  P. Kollman,et al.  Combined molecular mechanical and continuum solvent approach (MM-PBSA/GBSA) to predict ligand binding , 2000 .

[45]  Guang Song,et al.  How well can we understand large-scale protein motions using normal modes of elastic network models? , 2007, Biophysical journal.

[46]  V. Hornak,et al.  Comparison of multiple Amber force fields and development of improved protein backbone parameters , 2006, Proteins.

[47]  Gregory D. Hawkins,et al.  Parametrized Models of Aqueous Free Energies of Solvation Based on Pairwise Descreening of Solute Atomic Charges from a Dielectric Medium , 1996 .

[48]  Ray Luo,et al.  Virtual screening using molecular simulations , 2011, Proteins.

[49]  Celeste Sagui,et al.  Towards an accurate representation of electrostatics in classical force fields: efficient implementation of multipolar interactions in biomolecular simulations. , 2004, The Journal of chemical physics.

[50]  D. Baker,et al.  Molecular dynamics in the endgame of protein structure prediction. , 2001, Journal of molecular biology.

[51]  I. Bahar,et al.  Coarse-grained normal mode analysis in structural biology. , 2005, Current opinion in structural biology.

[52]  Andriy Kovalenko,et al.  An MM/3D-RISM approach for ligand binding affinities. , 2010, The journal of physical chemistry. B.

[53]  P. Kollman,et al.  A well-behaved electrostatic potential-based method using charge restraints for deriving atomic char , 1993 .

[54]  Anna Vulpetti,et al.  Novel Scoring Functions Comprising QXP, SASA, and Protein Side-Chain Entropy Terms , 2004, J. Chem. Inf. Model..

[55]  H. Grubmüller,et al.  Estimating Absolute Configurational Entropies of Macromolecules: The Minimally Coupled Subspace Approach , 2010, PloS one.

[56]  B. Brooks,et al.  Langevin dynamics of peptides: The frictional dependence of isomerization rates of N‐acetylalanyl‐N′‐methylamide , 1992, Biopolymers.

[57]  Stefan Boresch,et al.  Absolute Binding Free Energies: A Quantitative Approach for Their Calculation , 2003 .

[58]  M. Gilson,et al.  Calculation of Molecular Configuration Integrals , 2003 .

[59]  Luhua Lai,et al.  Further development and validation of empirical scoring functions for structure-based binding affinity prediction , 2002, J. Comput. Aided Mol. Des..

[60]  Wilfred F van Gunsteren,et al.  Principles of carbopeptoid folding: a molecular dynamics simulation study , 2005, Journal of peptide science : an official publication of the European Peptide Society.

[61]  Leo Radom,et al.  Harmonic Vibrational Frequencies: An Evaluation of Hartree−Fock, Møller−Plesset, Quadratic Configuration Interaction, Density Functional Theory, and Semiempirical Scale Factors , 1996 .

[62]  Junmei Wang,et al.  Development and testing of a general amber force field , 2004, J. Comput. Chem..

[63]  M. Gilson,et al.  Calculation of cyclodextrin binding affinities: energy, entropy, and implications for drug design. , 2004, Biophysical journal.

[64]  Michael K Gilson,et al.  Extraction of configurational entropy from molecular simulations via an expansion approximation. , 2007, The Journal of chemical physics.

[65]  P. Kollman,et al.  Calculating structures and free energies of complex molecules: combining molecular mechanics and continuum models. , 2000, Accounts of chemical research.

[66]  William L. Jorgensen,et al.  Free energy calculations: a breakthrough for modeling organic chemistry in solution , 1989 .

[67]  Tingjun Hou,et al.  Assessing the Performance of the MM/PBSA and MM/GBSA Methods. 1. The Accuracy of Binding Free Energy Calculations Based on Molecular Dynamics Simulations , 2011, J. Chem. Inf. Model..

[68]  G. G. Wood,et al.  A flexible approach for understanding protein stability , 2004, FEBS letters.

[69]  T. Darden,et al.  A smooth particle mesh Ewald method , 1995 .

[70]  R. Mannella,et al.  Langevin stabilization of molecular-dynamics simulations of polymers by means of quasisymplectic algorithms. , 2007, The Journal of chemical physics.

[71]  Michael K. Gilson,et al.  Tork: Conformational analysis method for molecules and complexes , 2003, J. Comput. Chem..

[72]  Helmut Grubmüller,et al.  Adaptive anisotropic kernels for nonparametric estimation of absolute configurational entropies in high-dimensional configuration spaces. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[73]  R. Skeel,et al.  Langevin stabilization of molecular dynamics , 2001 .

[74]  L Liu,et al.  A study on the enthalpy-entropy compensation in protein unfolding. , 2000, Biophysical chemistry.

[75]  Harshinder Singh,et al.  Nearest‐neighbor nonparametric method for estimating the configurational entropy of complex molecules , 2007, J. Comput. Chem..

[76]  H. Meirovitch Recent developments in methodologies for calculating the entropy and free energy of biological systems by computer simulation. , 2007, Current opinion in structural biology.

[77]  S. Jusuf,et al.  Configurational entropy and cooperativity between ligand binding and dimerization in glycopeptide antibiotics. , 2003, Journal of the American Chemical Society.

[78]  Jun Tan,et al.  Efficient calculation of configurational entropy from molecular simulations by combining the mutual‐information expansion and nearest‐neighbor methods , 2008, J. Comput. Chem..

[79]  Tingjun Hou,et al.  Aqueous Solubility Prediction Based on Weighted Atom Type Counts and Solvent Accessible Surface Areas , 2009, J. Chem. Inf. Model..

[80]  U. Ryde,et al.  Ligand affinities predicted with the MM/PBSA method: dependence on the simulation method and the force field. , 2006, Journal of Medicinal Chemistry.