Energy, water, and protein folding: A molecular dynamics‐based quantitative inventory of molecular interactions and forces that make proteins stable

Abstract Protein folding energetics can be determined experimentally on a case‐by‐case basis but it is not understood in sufficient detail to provide deep control in protein design. The fundamentals of protein stability have been outlined by calorimetry, protein engineering, and biophysical modeling, but these approaches still face great difficulty in elucidating the specific contributions of the intervening molecules and physical interactions. Recently, we have shown that the enthalpy and heat capacity changes associated to the protein folding reaction can be calculated within experimental error using molecular dynamics simulations of native protein structures and their corresponding unfolded ensembles. Analyzing in depth molecular dynamics simulations of four model proteins (CI2, barnase, SNase, and apoflavodoxin), we dissect here the energy contributions to ΔH (a key component of protein stability) made by the molecular players (polypeptide and solvent molecules) and physical interactions (electrostatic, van der Waals, and bonded) involved. Although the proteins analyzed differ in length, isoelectric point and fold class, their folding energetics is governed by the same quantitative pattern. Relative to the unfolded ensemble, the native conformations are enthalpically stabilized by comparable contributions from protein–protein and solvent–solvent interactions, and almost equally destabilized by interactions between protein and solvent molecules. The native protein surface seems to interact better with water than the unfolded one, but this is outweighed by the unfolded surface being larger. From the perspective of physical interactions, the native conformations are stabilized by van de Waals and Coulomb interactions and destabilized by conformational strain arising from bonded interactions. Also common to the four proteins, the sign of the heat capacity change is set by interactions between protein and solvent molecules or, from the alternative perspective, by Coulomb interactions.

[1]  H. Kono,et al.  Benchmarking of force fields to characterize the intrinsically disordered R2-FUS-LC region , 2023, Scientific reports.

[2]  T. Cheatham,et al.  Benchmarking the Drude Polarizable Force Field Using the r(GACC) Tetranucleotide , 2023, J. Chem. Inf. Model..

[3]  J. Sancho,et al.  Calculation of Protein Folding Thermodynamics Using Molecular Dynamics Simulations , 2023, bioRxiv.

[4]  Julie C. Mitchell,et al.  Protein folds vs. protein folding: Differing questions, different challenges , 2022, Proceedings of the National Academy of Sciences of the United States of America.

[5]  J. Sancho,et al.  Protposer: The web server that readily proposes protein stabilizing mutations with high PPV , 2022, Computational and structural biotechnology journal.

[6]  Oriol Vinyals,et al.  Highly accurate protein structure prediction with AlphaFold , 2021, Nature.

[7]  A. Rehman,et al.  Extensive evaluation of environment-specific force field for ordered and disordered proteins. , 2021, Physical chemistry chemical physics : PCCP.

[8]  M. Michael Gromiha,et al.  ProThermDB: thermodynamic database for proteins and mutants revisited after 15 years , 2020, Nucleic Acids Res..

[9]  K. Liedl,et al.  Polarizable and non-polarizable force fields: Protein folding, unfolding, and misfolding. , 2020, The Journal of chemical physics.

[10]  Douglas E. V. Pires,et al.  ThermoMutDB: a thermodynamic database for missense mutations , 2020, Nucleic Acids Res..

[11]  L. Žídek,et al.  Choice of Force Field for Proteins Containing Structured and Intrinsically Disordered Regions. , 2020, Biophysical journal.

[12]  Paul Robustelli,et al.  Development of a force field for the simulation of single-chain proteins and protein-protein complexes. , 2020, Journal of chemical theory and computation.

[13]  Juan José Galano-Frutos,et al.  Accurate Calculation of Barnase and SNase Folding Energetics using short MD simulations and an Atomistic Model of the Unfolded Ensemble. Evaluation of Force Fields and Water Models. , 2019, Journal of chemical information and modeling.

[14]  K. Lindorff-Larsen,et al.  Biophysical and Mechanistic Models for Disease-Causing Protein Variants. , 2019, Trends in biochemical sciences.

[15]  R. Raines,et al.  Secondary Forces in Protein Folding. , 2019, ACS chemical biology.

[16]  Weiliang Zhu,et al.  Selective enhanced sampling in dihedral energy facilitates overcoming the dihedral energy increase in protein folding and accelerates the searching for protein native structure. , 2019, Physical chemistry chemical physics : PCCP.

[17]  R. Best,et al.  Evolution of All-Atom Protein Force Fields to Improve Local and Global Properties. , 2019, The journal of physical chemistry letters.

[18]  Valerie Daggett,et al.  Visualizing Protein Folding and Unfolding. , 2019, Journal of molecular biology.

[19]  David Baker,et al.  What has de novo protein design taught us about protein folding and biophysics? , 2019, Protein science : a publication of the Protein Society.

[20]  C. Royer,et al.  Lessons from pressure denaturation of proteins , 2018, Journal of The Royal Society Interface.

[21]  Kresten Lindorff-Larsen,et al.  Biophysical experiments and biomolecular simulations: A perfect match? , 2018, Science.

[22]  Paul Robustelli,et al.  Developing a molecular dynamics force field for both folded and disordered protein states , 2018, Proceedings of the National Academy of Sciences.

[23]  Jason W. Chin,et al.  Expanding and reprogramming the genetic code , 2017, Nature.

[24]  B. L. de Groot,et al.  CHARMM36m: an improved force field for folded and intrinsically disordered proteins , 2016, Nature Methods.

[25]  S. Boyken,et al.  The coming of age of de novo protein design , 2016, Nature.

[26]  Alexander D. MacKerell,et al.  An Empirical Polarizable Force Field Based on the Classical Drude Oscillator Model: Development History and Recent Applications , 2016, Chemical reviews.

[27]  L. Regan,et al.  Random close packing in protein cores. , 2015, Physical review. E.

[28]  Hyungdon Yun,et al.  Unnatural amino acid mutagenesis-based enzyme engineering. , 2015, Trends in biotechnology.

[29]  Paul Robustelli,et al.  Water dispersion interactions strongly influence simulated structural properties of disordered protein states. , 2015, The journal of physical chemistry. B.

[30]  R. Best,et al.  Balanced Protein–Water Interactions Improve Properties of Disordered Proteins and Non-Specific Protein Association , 2014, Journal of chemical theory and computation.

[31]  Stefano Piana,et al.  Assessing the accuracy of physical models used in protein-folding simulations: quantitative evidence from long molecular dynamics simulations. , 2014, Current opinion in structural biology.

[32]  J. Sancho The stability of 2-state, 3-state and more-state proteins from simple spectroscopic techniques... plus the structure of the equilibrium intermediates at the same time. , 2013, Archives of biochemistry and biophysics.

[33]  K. Dill,et al.  The Protein-Folding Problem, 50 Years On , 2012, Science.

[34]  Kresten Lindorff-Larsen,et al.  Protein folding kinetics and thermodynamics from atomistic simulation , 2012, Proceedings of the National Academy of Sciences.

[35]  R. Best Atomistic molecular simulations of protein folding. , 2012, Current opinion in structural biology.

[36]  R. Dror,et al.  How Fast-Folding Proteins Fold , 2011, Science.

[37]  M. Rooman,et al.  PoPMuSiC 2.1: a web server for the estimation of protein stability changes upon mutation and sequence optimality , 2011, BMC Bioinformatics.

[38]  Klaus Schulten,et al.  High-performance scalable molecular dynamics simulations of a polarizable force field based on classical Drude oscillators in NAMD. , 2011, The journal of physical chemistry letters.

[39]  Javier Sancho,et al.  ProtSA: a web application for calculating sequence specific protein solvent accessibilities in the unfolded ensemble , 2009, BMC Bioinformatics.

[40]  Y. Duan,et al.  Folding free-energy landscape of villin headpiece subdomain from molecular dynamics simulations , 2007, Proceedings of the National Academy of Sciences.

[41]  Yi Liu,et al.  RosettaDesign server for protein design , 2006, Nucleic Acids Res..

[42]  Alexander D. MacKerell,et al.  A polarizable model of water for molecular dynamics simulations of biomolecules , 2006 .

[43]  Gerrit Groenhof,et al.  GROMACS: Fast, flexible, and free , 2005, J. Comput. Chem..

[44]  Laxmikant V. Kalé,et al.  Scalable molecular dynamics with NAMD , 2005, J. Comput. Chem..

[45]  K. Sharp,et al.  Heat capacity in proteins. , 2005, Annual review of physical chemistry.

[46]  François Stricher,et al.  The FoldX web server: an online force field , 2005, Nucleic Acids Res..

[47]  J. Sancho,et al.  A double-deletion method to quantifying incremental binding energies in proteins from experiment: example of a destabilizing hydrogen bonding pair. , 2004, Biophysical journal.

[48]  Alexander D. MacKerell Empirical force fields for biological macromolecules: Overview and issues , 2004, J. Comput. Chem..

[49]  Alexander D. MacKerell,et al.  Extending the treatment of backbone energetics in protein force fields: Limitations of gas‐phase quantum mechanics in reproducing protein conformational distributions in molecular dynamics simulations , 2004, J. Comput. Chem..

[50]  Themis Lazaridis,et al.  Thermodynamics of protein folding: a microscopic view. , 2002, Biophysical chemistry.

[51]  K. Sharp,et al.  Heat Capacity Changes Accompanying Hydrophobic and Ionic Solvation: A Monte-Carlo and Random Network Model Study , 2001 .

[52]  R. Hartley,et al.  Refinement and structural analysis of barnase at 1.5 A resolution. , 1999, Acta crystallographica. Section D, Biological crystallography.

[53]  Andrew D. Robertson,et al.  Protein Structure and the Energetics of Protein Stability. , 1997, Chemical reviews.

[54]  A. Horovitz,et al.  Double-mutant cycles: a powerful tool for analyzing protein structure and function. , 1996, Folding & design.

[55]  V. Hilser,et al.  The enthalpy change in protein folding and binding: Refinement of parameters for structure‐based calculations , 1996, Proteins.

[56]  Javier Sancho,et al.  Closure of a tyrosine/tryptophan aromatic gate leads to a compact fold in apo flavodoxin , 1996, Nature Structural Biology.

[57]  V. Hilser,et al.  The heat capacity of proteins , 1995, Proteins.

[58]  M Gerstein,et al.  Volume changes on protein folding. , 1994, Structure.

[59]  P. Privalov,et al.  Contribution of hydration to protein folding thermodynamics. II. The entropy and Gibbs energy of hydration. , 1993, Journal of molecular biology.

[60]  P. Privalov,et al.  Contribution of hydration and non-covalent interactions to the heat capacity effect on protein unfolding. , 1992, Journal of molecular biology.

[61]  K. P. Murphy,et al.  Solid model compounds and the thermodynamics of protein unfolding. , 1991, Journal of molecular biology.

[62]  M. Fillat,et al.  Structural and chemical properties of a flavodoxin from Anabaena PCC 7119. , 1990, Biochimica et biophysica acta.

[63]  K. Dill Dominant forces in protein folding. , 1990, Biochemistry.

[64]  P. Privalov,et al.  Heat capacity of proteins. I. Partial molar heat capacity of individual amino acid residues in aqueous solution: hydration effect. , 1990, Journal of molecular biology.

[65]  K. P. Murphy,et al.  Common features of protein unfolding and dissolution of hydrophobic compounds. , 1990, Science.

[66]  M. James,et al.  Crystal and molecular structure of the serine proteinase inhibitor CI-2 from barley seeds. , 1988, Biochemistry.

[67]  W. J. Becktel,et al.  Protein stability curves , 1987, Biopolymers.

[68]  R. L. Baldwin,et al.  Temperature dependence of the hydrophobic interaction in protein folding. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[69]  W. L. Jorgensen,et al.  Comparison of simple potential functions for simulating liquid water , 1983 .

[70]  D. Shortle A genetic system for analysis of staphylococcal nuclease. , 1983, Gene.

[71]  I. Svendsen,et al.  Characteristics of Hiproly barley III. Amino acid sequences of two lysine-rich proteins , 1980 .

[72]  F. A. Cotton,et al.  Staphylococcal nuclease: proposed mechanism of action based on structure of enzyme-thymidine 3',5'-bisphosphate-calcium ion complex at 1.5-A resolution. , 1979, Proceedings of the National Academy of Sciences of the United States of America.

[73]  F. Richards The interpretation of protein structures: total volume, group volume distributions and packing density. , 1974, Journal of molecular biology.

[74]  W. Kauzmann,et al.  Pressure denaturation of metmyoglobin. , 1973, Biochemistry.

[75]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[76]  R. Hartley,et al.  Amino-acid sequence of extracellular ribonuclease (barnase) of Bacillus amyloliquefaciens. , 1972, Nature: New biology.

[77]  S. Hawley,et al.  Reversible pressure--temperature denaturation of chymotrypsinogen. , 1971, Biochemistry.

[78]  J. Brandts,et al.  Thermodynamics of protein denaturation. Effect of pressu on the denaturation of ribonuclease A. , 1970, Biochemistry.

[79]  R. Best Analysis of Molecular Dynamics Simulations of Protein Folding. , 2022, Methods in molecular biology.

[80]  P. Privalov,et al.  Energetics of protein structure. , 1995, Advances in protein chemistry.

[81]  M Karplus,et al.  Enthalpic contribution to protein stability: insights from atom-based calculations and statistical mechanics. , 1995, Advances in protein chemistry.

[82]  S. N. Timasheff,et al.  The control of protein stability and association by weak interactions with water: how do solvents affect these processes? , 1993, Annual review of biophysics and biomolecular structure.

[83]  P. Privalov,et al.  Stability of protein structure and hydrophobic interaction. , 1988, Advances in protein chemistry.