Molecular Interactions and Forces that Make Proteins Stable: A Quantitative Inventory from Atomistic Molecular Dynamics Simulations

Protein design requires a deep control of protein folding energetics, which can be determined experimentally on a case-by-case basis but is not understood in sufficient detail. Calorimetry, protein engineering and biophysical modeling have outlined the fundamentals of protein stability, but these approaches face difficulties in elucidating the specific contributions of the intervening molecules and elementary interactions to the folding energy balance. Recently, we showed that, using Molecular Dynamics (MD) simulations of native proteins and their unfolded ensembles, one can calculate, within experimental error, the enthalpy and heat capacity changes of the folding reaction. Analyzing MD simulations of four model proteins (CI2, barnase, SNase and apoflavodoxin) whose folding enthalpy and heat capacity changes have been successfully calculated, we dissect here the energetic contributions to protein stability made by the different molecular players (polypeptide and solvent molecules) and elementary interactions (electrostatic, van der Waals and bonded) involved. Although the proteins analyzed differ in length (65-168 amino acid residues), isoelectric point (4.0-8.99) and overall fold, their folding energetics is governed by the same quantitative pattern. Relative to the unfolded ensemble, the native conformation is enthalpically stabilized by comparable contributions from protein-protein and solvent-solvent interactions, and it is nearly equally destabilized by interactions between protein and solvent molecules. From the perspective of elementary physical interactions, the native conformation is stabilized by van de Waals and coulombic interactions and is destabilized by bonded interactions. Also common to the four proteins, the sign of the heat capacity change is set by protein-solvent interactions or, from the alternative perspective, by coulombic interactions.

[1]  J. Sancho,et al.  Calculation of Protein Folding Thermodynamics Using Molecular Dynamics Simulations , 2023, bioRxiv.

[2]  J. Sancho,et al.  Protposer: The web server that readily proposes protein stabilizing mutations with high PPV , 2022, Computational and structural biotechnology journal.

[3]  A. Rehman,et al.  Extensive evaluation of environment-specific force field for ordered and disordered proteins. , 2021, Physical chemistry chemical physics : PCCP.

[4]  M. Michael Gromiha,et al.  ProThermDB: thermodynamic database for proteins and mutants revisited after 15 years , 2020, Nucleic Acids Res..

[5]  K. Liedl,et al.  Polarizable and non-polarizable force fields: Protein folding, unfolding, and misfolding. , 2020, The Journal of chemical physics.

[6]  Douglas E. V. Pires,et al.  ThermoMutDB: a thermodynamic database for missense mutations , 2020, Nucleic Acids Res..

[7]  Paul Robustelli,et al.  Development of a force field for the simulation of single-chain proteins and protein-protein complexes. , 2020, Journal of chemical theory and computation.

[8]  K. Lindorff-Larsen,et al.  Biophysical and Mechanistic Models for Disease-Causing Protein Variants. , 2019, Trends in biochemical sciences.

[9]  R. Raines,et al.  Secondary Forces in Protein Folding. , 2019, ACS chemical biology.

[10]  Weiliang Zhu,et al.  Selective enhanced sampling in dihedral energy facilitates overcoming the dihedral energy increase in protein folding and accelerates the searching for protein native structure. , 2019, Physical chemistry chemical physics : PCCP.

[11]  R. Best,et al.  Evolution of All-Atom Protein Force Fields to Improve Local and Global Properties. , 2019, The journal of physical chemistry letters.

[12]  Valerie Daggett,et al.  Visualizing Protein Folding and Unfolding. , 2019, Journal of molecular biology.

[13]  David Baker,et al.  What has de novo protein design taught us about protein folding and biophysics? , 2019, Protein science : a publication of the Protein Society.

[14]  Kresten Lindorff-Larsen,et al.  Biophysical experiments and biomolecular simulations: A perfect match? , 2018, Science.

[15]  Paul Robustelli,et al.  Developing a molecular dynamics force field for both folded and disordered protein states , 2018, Proceedings of the National Academy of Sciences.

[16]  Jason W. Chin,et al.  Expanding and reprogramming the genetic code , 2017, Nature.

[17]  S. Boyken,et al.  The coming of age of de novo protein design , 2016, Nature.

[18]  Hyungdon Yun,et al.  Unnatural amino acid mutagenesis-based enzyme engineering. , 2015, Trends in biotechnology.

[19]  Paul Robustelli,et al.  Water dispersion interactions strongly influence simulated structural properties of disordered protein states. , 2015, The journal of physical chemistry. B.

[20]  R. Best,et al.  Balanced Protein–Water Interactions Improve Properties of Disordered Proteins and Non-Specific Protein Association , 2014, Journal of chemical theory and computation.

[21]  Stefano Piana,et al.  Assessing the accuracy of physical models used in protein-folding simulations: quantitative evidence from long molecular dynamics simulations. , 2014, Current opinion in structural biology.

[22]  J. Sancho The stability of 2-state, 3-state and more-state proteins from simple spectroscopic techniques... plus the structure of the equilibrium intermediates at the same time. , 2013, Archives of biochemistry and biophysics.

[23]  K. Dill,et al.  The Protein-Folding Problem, 50 Years On , 2012, Science.

[24]  Kresten Lindorff-Larsen,et al.  Protein folding kinetics and thermodynamics from atomistic simulation , 2012, Proceedings of the National Academy of Sciences.

[25]  R. Best Atomistic molecular simulations of protein folding. , 2012, Current opinion in structural biology.

[26]  R. Dror,et al.  How Fast-Folding Proteins Fold , 2011, Science.

[27]  M. Rooman,et al.  PoPMuSiC 2.1: a web server for the estimation of protein stability changes upon mutation and sequence optimality , 2011, BMC Bioinformatics.

[28]  R. Dror,et al.  Improved side-chain torsion potentials for the Amber ff99SB protein force field , 2010, Proteins.

[29]  Javier Sancho,et al.  ProtSA: a web application for calculating sequence specific protein solvent accessibilities in the unfolded ensemble , 2009, BMC Bioinformatics.

[30]  Y. Duan,et al.  Folding free-energy landscape of villin headpiece subdomain from molecular dynamics simulations , 2007, Proceedings of the National Academy of Sciences.

[31]  Gerrit Groenhof,et al.  GROMACS: Fast, flexible, and free , 2005, J. Comput. Chem..

[32]  K. Sharp,et al.  Heat capacity in proteins. , 2005, Annual review of physical chemistry.

[33]  J. Sancho,et al.  A double-deletion method to quantifying incremental binding energies in proteins from experiment: example of a destabilizing hydrogen bonding pair. , 2004, Biophysical journal.

[34]  Alexander D. MacKerell Empirical force fields for biological macromolecules: Overview and issues , 2004, J. Comput. Chem..

[35]  Alexander D. MacKerell,et al.  Extending the treatment of backbone energetics in protein force fields: Limitations of gas‐phase quantum mechanics in reproducing protein conformational distributions in molecular dynamics simulations , 2004, J. Comput. Chem..

[36]  Themis Lazaridis,et al.  Thermodynamics of protein folding: a microscopic view. , 2002, Biophysical chemistry.

[37]  K. Sharp,et al.  Heat Capacity Changes Accompanying Hydrophobic and Ionic Solvation: A Monte-Carlo and Random Network Model Study , 2001 .

[38]  R. Hartley,et al.  Refinement and structural analysis of barnase at 1.5 A resolution. , 1999, Acta crystallographica. Section D, Biological crystallography.

[39]  I. Kullik,et al.  Deletion of the Alternative Sigma Factor ςB in Staphylococcus aureus Reveals Its Function as a Global Regulator of Virulence Genes , 1998, Journal of bacteriology.

[40]  Andrew D. Robertson,et al.  Protein Structure and the Energetics of Protein Stability. , 1997, Chemical reviews.

[41]  A. Horovitz,et al.  Double-mutant cycles: a powerful tool for analyzing protein structure and function. , 1996, Folding & design.

[42]  V. Hilser,et al.  The enthalpy change in protein folding and binding: Refinement of parameters for structure‐based calculations , 1996, Proteins.

[43]  Javier Sancho,et al.  Closure of a tyrosine/tryptophan aromatic gate leads to a compact fold in apo flavodoxin , 1996, Nature Structural Biology.

[44]  P. Privalov,et al.  Contribution of hydration and non-covalent interactions to the heat capacity effect on protein unfolding. , 1992, Journal of molecular biology.

[45]  K. P. Murphy,et al.  Solid model compounds and the thermodynamics of protein unfolding. , 1991, Journal of molecular biology.

[46]  M. Fillat,et al.  Structural and chemical properties of a flavodoxin from Anabaena PCC 7119. , 1990, Biochimica et biophysica acta.

[47]  P. Privalov,et al.  Heat capacity of proteins. I. Partial molar heat capacity of individual amino acid residues in aqueous solution: hydration effect. , 1990, Journal of molecular biology.

[48]  K. P. Murphy,et al.  Common features of protein unfolding and dissolution of hydrophobic compounds. , 1990, Science.

[49]  M. James,et al.  Crystal and molecular structure of the serine proteinase inhibitor CI-2 from barley seeds. , 1988, Biochemistry.

[50]  W. J. Becktel,et al.  Protein stability curves , 1987, Biopolymers.

[51]  R. L. Baldwin,et al.  Temperature dependence of the hydrophobic interaction in protein folding. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[52]  W. L. Jorgensen,et al.  Comparison of simple potential functions for simulating liquid water , 1983 .

[53]  I. Svendsen,et al.  Characteristics of Hiproly barley III. Amino acid sequences of two lysine-rich proteins , 1980 .

[54]  F. A. Cotton,et al.  Staphylococcal nuclease: proposed mechanism of action based on structure of enzyme-thymidine 3',5'-bisphosphate-calcium ion complex at 1.5-A resolution. , 1979, Proceedings of the National Academy of Sciences of the United States of America.

[55]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[56]  R. Hartley,et al.  Amino-acid sequence of extracellular ribonuclease (barnase) of Bacillus amyloliquefaciens. , 1972, Nature: New biology.

[57]  J. Sancho,et al.  Accurate Calculation of Barnase and SNase Folding Energetics Using Short Molecular Dynamics Simulations and an Atomistic Model of the Unfolded Ensemble: Evaluation of Force Fields and Water Models , 2019, J. Chem. Inf. Model..

[58]  Aruna Rajan Analysis of Molecular Dynamics Simulations of Protein Folding , 2009 .

[59]  P. Privalov,et al.  Stability of protein structure and hydrophobic interaction. , 1988, Advances in protein chemistry.