Integrated prediction of protein folding and unfolding rates from only size and structural class.

Protein stability, folding and unfolding rates are all determined by the multidimensional folding free energy surface, which in turn is dictated by factors such as size, structure, and amino-acid sequence. Work over the last 15 years has highlighted the role of size and 3D structure in determining folding rates, resulting in many procedures for their prediction. In contrast, unfolding rates are thought to depend on sequence specifics and be much more difficult to predict. Here we introduce a minimalist physics-based model that computes one-dimensional folding free energy surfaces using the number of aminoacids (N) and the structural class (α-helical, all-β, or α-β) as only protein-specific input. In this model N sets the overall cost in conformational entropy and the net stabilization energy, whereas the structural class defines the partitioning of the stabilization energy between local and non-local interactions. To test its predictive power, we calibrated the model empirically and implemented it into an algorithm for the PREdiction of Folding and Unfolding Rates (PREFUR). We found that PREFUR predicts the absolute folding and unfolding rates of an experimental database of 52 proteins with accuracies of ±0.7 and ±1.4 orders of magnitude, respectively (relative to experimental spans of 6 and 8 orders of magnitude). Such prediction uncertainty for proteins vastly varying in size and structure is only two-fold larger than the differences in folding (±0.34) and unfolding rates (±0.7) caused by single-point mutations. Moreover, PREFUR predicts protein stability with an accuracy of ±6.3 kJ mol(-1), relative to the 5 kJ mol(-1) average perturbation induced by single-point mutations. The remarkable performance of our simplistic model demonstrates that size and structural class are the major determinants of the folding landscapes of natural proteins, whereas sequence variability only provides the final 10-20% tuning. PREFUR is thus a powerful bioinformatic tool for the prediction of folding properties and analysis of experimental data.

[1]  A. Fersht,et al.  Phi-analysis of the folding of the B domain of protein A using multiple optical probes. , 2006, Journal of molecular biology.

[2]  Andrew D. Robertson,et al.  Protein Structure and the Energetics of Protein Stability. , 1997, Chemical reviews.

[3]  L. Gregoret,et al.  Role of a solvent‐exposed aromatic cluster in the folding of Escherichia coli CspA , 2000, Protein science : a publication of the Protein Society.

[4]  D. Baker,et al.  Contact order, transition state placement and the refolding rates of single domain proteins. , 1998, Journal of molecular biology.

[5]  Gerhard Hummer,et al.  Coordinate-dependent diffusion in protein folding , 2009, Proceedings of the National Academy of Sciences.

[6]  Hue Sun Chan,et al.  Contact order dependent protein folding rates: Kinetic consequences of a cooperative interplay between favorable nonlocal interactions and local conformational preferences , 2003, Proteins.

[7]  N. Go,et al.  Studies on protein folding, unfolding and fluctuations by computer simulation. I. The effect of specific amino acid sequence represented by specific inter-unit interactions. , 2009 .

[8]  Luis Serrano,et al.  Elucidating the folding problem of helical peptides using empirical parameters , 1994, Nature Structural Biology.

[9]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[10]  E. Shakhnovich,et al.  Chain Length Scaling of Protein Folding Time. , 1996, Physical review letters.

[11]  V. Muñoz,et al.  Experimental Identification of Downhill Protein Folding , 2002, Science.

[12]  D Baker,et al.  Folding dynamics of the src SH3 domain. , 1997, Biochemistry.

[13]  D. Baker,et al.  Critical role of β-hairpin formation in protein G folding , 2000, Nature Structural Biology.

[14]  C. Dobson,et al.  Thermodynamics and kinetics of folding of common-type acylphosphatase: comparison to the highly homologous muscle isoenzyme. , 1999, Biochemistry.

[15]  T. Kiefhaber,et al.  Folding of the disulfide-bonded beta-sheet protein tendamistat: rapid two-state folding without hydrophobic collapse. , 1997, Journal of molecular biology.

[16]  A. Fersht,et al.  Ultra-fast barrier-limited folding in the peripheral subunit-binding domain family. , 2005, Journal of molecular biology.

[17]  Mohamed A. Marahiel,et al.  Conservation of rapid two-state folding in mesophilic, thermophilic and hyperthermophilic cold shock proteins , 1998, Nature Structural Biology.

[18]  K. Simons,et al.  Local interactions and the optimization of protein folding , 1997, Proteins.

[19]  H. Chan,et al.  Theoretical and experimental demonstration of the importance of specific nonnative interactions in protein folding , 2008, Proceedings of the National Academy of Sciences.

[20]  Athi N. Naganathan,et al.  Protein folding kinetics: barrier effects in chemical and thermal denaturation experiments. , 2007, Journal of the American Chemical Society.

[21]  Kevin W Plaxco,et al.  The topomer search model: A simple, quantitative theory of two‐state protein folding kinetics , 2003, Protein science : a publication of the Protein Society.

[22]  J. Onuchic,et al.  DIFFUSIVE DYNAMICS OF THE REACTION COORDINATE FOR PROTEIN FOLDING FUNNELS , 1996, cond-mat/9601091.

[23]  I D Campbell,et al.  Folding kinetics of the SH3 domain of PI3 kinase by real-time NMR combined with optical spectroscopy. , 1998, Journal of molecular biology.

[24]  D. Raleigh,et al.  Analysis of the pH-dependent folding and stability of histidine point mutants allows characterization of the denatured state and transition state for protein folding. , 2005, Journal of molecular biology.

[25]  Kevin W Plaxco,et al.  Contact order revisited: Influence of protein size on the folding rate , 2003, Protein science : a publication of the Protein Society.

[26]  M. Marahiel,et al.  Extremely rapid protein folding in the absence of intermediates , 1995, Nature Structural Biology.

[27]  E. Cota,et al.  Folding studies of immunoglobulin-like beta-sandwich proteins suggest that they share a common folding pathway. , 1999, Structure.

[28]  Athi N. Naganathan,et al.  Direct observation of downhill folding of lambda-repressor in a microfluidic mixer. , 2009, Biophysical journal.

[29]  C M Dobson,et al.  Slow folding of muscle acylphosphatase in the absence of intermediates. , 1998, Journal of molecular biology.

[30]  Athi N. Naganathan,et al.  Navigating the downhill protein folding regime via structural homologues. , 2010, Journal of the American Chemical Society.

[31]  L Serrano,et al.  The SH3-fold family: experimental evidence and prediction of variations in the folding pathways. , 2000, Journal of molecular biology.

[32]  Victor Muñoz,et al.  Protein folding rates and stability: how much is there beyond size? , 2009, Journal of the American Chemical Society.

[33]  D. Thirumalai,et al.  Thermal denaturation and folding rates of single domain proteins: size matters , 2003, q-bio/0310020.

[34]  Cecilia Clementi,et al.  Coarse-grained models of protein folding: toy models or predictive tools? , 2008, Current opinion in structural biology.

[35]  Athi N. Naganathan,et al.  Scaling of folding times with protein size. , 2005, Journal of the American Chemical Society.

[36]  Victor Muñoz,et al.  Atom-by-atom analysis of global downhill protein folding , 2006, Nature.

[37]  D Baker,et al.  Topology, stability, sequence, and length: defining the determinants of two-state protein folding kinetics. , 2000, Biochemistry.

[38]  Charles L. Brooks,et al.  Insights from Coarse-Grained Gō Models for Protein Folding and Dynamics , 2009, International journal of molecular sciences.

[39]  L. Mirny,et al.  Protein folding theory: from lattice to all-atom models. , 2001, Annual review of biophysics and biomolecular structure.

[40]  J. Clarke,et al.  The effect of boundary selection on the stability and folding of the third fibronectin type III domain from human tenascin. , 1998, Biochemistry.

[41]  T. Kiefhaber,et al.  Hammond behavior versus ground state effects in protein folding: evidence for narrow free energy barriers and residual structure in unfolded states. , 2003, Journal of molecular biology.

[42]  J. Onuchic,et al.  Funnels, pathways, and the energy landscape of protein folding: A synthesis , 1994, Proteins.

[43]  John Karanicolas,et al.  Improved Gō-like models demonstrate the robustness of protein folding mechanisms towards non-native interactions. , 2003, Journal of molecular biology.

[44]  A. Finkelstein,et al.  A theoretical search for folding/unfolding nuclei in three-dimensional protein structures. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[45]  Kevin W Plaxco,et al.  Protein folding: Defining a “standard” set of experimental conditions and a preliminary kinetic data set of two‐state proteins , 2005, Protein science : a publication of the Protein Society.

[46]  D Baker,et al.  A breakdown of symmetry in the folding transition state of protein L. , 2000, Journal of molecular biology.

[47]  Valentina Tozzini,et al.  Coarse-grained models for proteins. , 2005, Current opinion in structural biology.

[48]  L Serrano,et al.  Structure of the transition state in the folding process of human procarboxypeptidase A2 activation domain. , 1998, Journal of molecular biology.

[49]  Valerie Daggett,et al.  Unifying features in protein-folding mechanisms , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[50]  V Muñoz,et al.  Local versus nonlocal interactions in protein folding and stability--an experimentalist's point of view. , 1996, Folding & design.

[51]  P. Wolynes,et al.  Folding funnels and energy landscapes of larger proteins within the capillarity approximation. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[52]  D. Thirumalai,et al.  From Minimal Models to Real Proteins: Time Scales for Protein Folding Kinetics , 1995 .

[53]  Hongyi Zhou,et al.  Folding rate prediction using total contact distance. , 2002, Biophysical journal.

[54]  M. Oliveberg,et al.  High-energy channeling in protein folding. , 1997, Biochemistry.

[55]  L Serrano,et al.  Thermodynamic and kinetic analysis of the SH3 domain of spectrin shows a two-state folding transition. , 1994, Biochemistry.

[56]  Athi N. Naganathan,et al.  Direct measurement of barrier heights in protein folding. , 2005, Journal of the American Chemical Society.

[57]  A. Fersht,et al.  Folding of chymotrypsin inhibitor 2. 1. Evidence for a two-state transition. , 1991, Biochemistry.

[58]  I D Campbell,et al.  The folding kinetics and thermodynamics of the Fyn-SH3 domain. , 1998, Biochemistry.

[59]  M. Gromiha,et al.  Importance of long-range interactions in protein folding. , 1999, Biophysical chemistry.

[60]  Athi N. Naganathan,et al.  Insights into protein folding mechanisms from large scale analysis of mutational effects , 2010, Proceedings of the National Academy of Sciences.

[61]  V. Muñoz,et al.  Expanding the realm of ultrafast protein folding: gpW, a midsize natural single-domain with alpha+beta topology that folds downhill. , 2008, Journal of the American Chemical Society.

[62]  C M Dobson,et al.  Slow cooperative folding of a small globular protein HPr. , 1998, Biochemistry.

[63]  Athi N. Naganathan,et al.  Dynamics of one-state downhill protein folding , 2009, Proceedings of the National Academy of Sciences.

[64]  K. Dill,et al.  Protein folding in the landscape perspective: Chevron plots and non‐arrhenius kinetics , 1998, Proteins.

[65]  Dennis R Livesay,et al.  On the role of structural class of a protein with two‐state folding kinetics in determining correlations between its size, topology, and folding rate , 2007, Protein science : a publication of the Protein Society.

[66]  M. Michael Gromiha,et al.  A Statistical Method for Predicting Protein Unfolding Rates from Amino Acid Sequence , 2006, J. Chem. Inf. Model..

[67]  J. Portman,et al.  Cooperativity and protein folding rates. , 2010, Current opinion in structural biology.

[68]  Hue Sun Chan,et al.  Probing possible downhill folding: native contact topology likely places a significant constraint on the folding cooperativity of proteins with approximately 40 residues. , 2008, Journal of molecular biology.

[69]  Dmitry N Ivankov,et al.  Chain length is the main determinant of the folding rate for proteins with three‐state folding kinetics , 2003, Proteins.

[70]  V. Muñoz,et al.  A simple model for calculating the kinetics of protein folding from three-dimensional structures. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[71]  A. Finkelstein,et al.  Prediction of protein folding rates from the amino acid sequence-predicted secondary structure , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[72]  M. Proctor,et al.  Structural changes in the transition state of protein folding: alternative interpretations of curved chevron plots. , 1999, Biochemistry.

[73]  M. Gromiha,et al.  Comparison between long-range interactions and contact order in determining the folding rate of two-state proteins: application of long-range order to folding rate prediction. , 2001, Journal of molecular biology.

[74]  Scattered Hammond plots reveal second level of site-specific information in protein folding: phi' (beta++). , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[75]  A. Fersht,et al.  The denatured state of Engrailed Homeodomain under denaturing and native conditions. , 2003, Journal of molecular biology.

[76]  Martin Gruebele,et al.  Tuning λ6- 85 towards downhill folding at its melting temperature , 2007 .

[77]  I D Campbell,et al.  A comparison of the folding kinetics and thermodynamics of two homologous fibronectin type III modules. , 1997, Journal of molecular biology.

[78]  E. Alm,et al.  Prediction of protein-folding mechanisms from free-energy landscapes derived from native structures. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[79]  Athi N. Naganathan,et al.  Dynamics, energetics, and structure in protein folding. , 2006, Biochemistry.

[80]  A V Finkelstein,et al.  Rate of protein folding near the point of thermodynamic equilibrium between the coil and the most stable chain fold. , 1997, Folding & design.

[81]  S. Jackson,et al.  Folding pathway of FKBP12 and characterisation of the transition state. , 1999, Journal of molecular biology.

[82]  Yawen Bai,et al.  Relationship between the native-state hydrogen exchange and folding pathways of a four-helix bundle protein. , 2002, Biochemistry.

[83]  K Henrick,et al.  Electronic Reprint Biological Crystallography Secondary-structure Matching (ssm), a New Tool for Fast Protein Structure Alignment in Three Dimensions Biological Crystallography Secondary-structure Matching (ssm), a New Tool for Fast Protein Structure Alignment in Three Dimensions , 2022 .

[84]  A. Fersht,et al.  Ultrafast folding of WW domains without structured aromatic clusters in the denatured state , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[85]  E I Shakhnovich,et al.  Impact of local and non-local interactions on thermodynamics and kinetics of protein folding. , 1995, Journal of molecular biology.

[86]  J. Onuchic,et al.  Protein folding funnels: the nature of the transition state ensemble. , 1996, Folding & design.