Early Folding Events, Local Interactions, and Conservation of Protein Backbone Rigidity.

Protein folding is in its early stages largely determined by the protein sequence and complex local interactions between amino acids, resulting in lower energy conformations that provide the context for further folding into the native state. We compiled a comprehensive data set of early folding residues based on pulsed labeling hydrogen deuterium exchange experiments. These early folding residues have corresponding higher backbone rigidity as predicted by DynaMine from sequence, an effect also present when accounting for the secondary structures in the folded protein. We then show that the amino acids involved in early folding events are not more conserved than others, but rather, early folding fragments and the secondary structure elements they are part of show a clear trend toward conserving a rigid backbone. We therefore propose that backbone rigidity is a fundamental physical feature conserved by proteins that can provide important insights into their folding mechanisms and stability.

[1]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[2]  Yawen Bai,et al.  The folding pathway of T4 lysozyme: an on-pathway hidden folding intermediate. , 2007, Journal of molecular biology.

[3]  U. Arnold,et al.  The folding pathway of onconase is directed by a conserved intermediate. , 2009, Biochemistry.

[4]  S. Wodak,et al.  Relations between protein sequence and structure and their significance. , 1990, Journal of molecular biology.

[5]  Valerie Daggett,et al.  Dynameomics: Large‐scale assessment of native protein flexibility , 2008, Protein science : a publication of the Protein Society.

[6]  Aleksey A. Porollo,et al.  POLYVIEW: a flexible visualization tool for structural and functional annotations of proteins , 2004, Bioinform..

[7]  K Wüthrich,et al.  Protein folding kinetics by combined use of rapid mixing techniques and NMR observation of individual amide protons , 1986, Proteins.

[8]  B. Erman,et al.  Information‐theoretical entropy as a measure of sequence variability , 1991, Proteins.

[9]  B. Jones,et al.  Early intermediates in the folding of dihydrofolate reductase from escherichia coli detected by hydrogen exchange and NMR , 1995, Protein science : a publication of the Protein Society.

[10]  K. Dill,et al.  The Protein-Folding Problem, 50 Years On , 2012, Science.

[11]  R. L. Baldwin,et al.  Early folding intermediate of ribonuclease A. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[12]  M. Oliveberg,et al.  Malleability of protein folding pathways: a simple reason for complex behaviour. , 2007, Current opinion in structural biology.

[13]  Joseph A. Bank,et al.  Supporting Online Material Materials and Methods Figs. S1 to S10 Table S1 References Movies S1 to S3 Atomic-level Characterization of the Structural Dynamics of Proteins , 2022 .

[14]  B. Rost Review: protein secondary structure prediction continues to rise. , 2001, Journal of structural biology.

[15]  P Fariselli,et al.  An entropy criterion to detect minimally frustrated intermediates in native proteins. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[16]  H. Bull,et al.  Surface tension of amino acid solutions: a hydrophobicity scale of the amino acid residues. , 1974, Archives of biochemistry and biophysics.

[17]  Peter Tompa,et al.  Start2Fold: a database of hydrogen/deuterium exchange data on protein folding and stability , 2015, Nucleic Acids Res..

[18]  Ming Xu,et al.  Ultrafast hydrogen exchange reveals specific structural events during the initial stages of folding of cytochrome c. , 2014, Journal of the American Chemical Society.

[19]  H. Dyson,et al.  Absence of a stable intermediate on the folding pathway of protein A , 1997, Protein science : a publication of the Protein Society.

[20]  Beat Kleiner,et al.  Graphical Methods for Data Analysis , 1983 .

[21]  Silvio C. E. Tosatto,et al.  Comprehensive large-scale assessment of intrinsic protein disorder , 2015, Bioinform..

[22]  Jane Clarke,et al.  Take home lessons from studies of related proteins , 2013, Current opinion in structural biology.

[23]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[24]  Christopher M. Dobson,et al.  Demonstration by NMR of folding domains in lysozyme , 1991, Nature.

[25]  Tom Lenaerts,et al.  From protein sequence to dynamics and disorder with DynaMine , 2013, Nature Communications.

[26]  Michele Vendruscolo,et al.  Structural interpretation of hydrogen exchange protection factors in proteins: characterization of the native state fluctuations of CI2. , 2006, Structure.

[27]  M. J. Parker,et al.  Acquisition of native beta-strand topology during the rapid collapse phase of protein folding. , 1997, Biochemistry.

[28]  David S. Wishart,et al.  Application of the random coil index to studying protein flexibility , 2008, Journal of biomolecular NMR.

[29]  Chin Yu,et al.  Refolding of a small all beta-sheet protein proceeds with accumulation of kinetic intermediates. , 2006, Archives of biochemistry and biophysics.

[30]  L. Mayne,et al.  The nature of protein folding pathways , 2014, Proceedings of the National Academy of Sciences.

[31]  C. Pace,et al.  Investigation of ribonuclease T1 folding intermediates by hydrogen-deuterium amide exchange-two-dimensional NMR spectroscopy. , 1993, Biochemistry.

[32]  E. Shakhnovich,et al.  Conserved residues and the mechanism of protein folding , 1996, Nature.

[33]  Piero Fariselli,et al.  The s2D method: simultaneous sequence-based prediction of the statistical populations of ordered and disordered regions in proteins. , 2015, Journal of molecular biology.

[34]  T. Petersen,et al.  A generic method for assignment of reliability scores applied to solvent accessibility predictions , 2009, BMC Structural Biology.

[35]  J. Hofrichter,et al.  The protein folding 'speed limit'. , 2004, Current opinion in structural biology.

[36]  H. Roder,et al.  Early hydrogen-bonding events in the folding reaction of ubiquitin. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[37]  Jayant B Udgaonkar,et al.  Polypeptide chain collapse and protein folding. , 2013, Archives of biochemistry and biophysics.

[38]  Ken A Dill,et al.  General mechanism of two-state protein folding kinetics. , 2014, Journal of the American Chemical Society.

[39]  Silvio C. E. Tosatto,et al.  ESpritz: accurate and fast prediction of protein disorder , 2012, Bioinform..

[40]  John L Markley,et al.  Early formation of a beta hairpin during folding of staphylococcal nuclease H124L as detected by pulsed hydrogen exchange , 2002, Protein science : a publication of the Protein Society.

[41]  A. D. Robertson,et al.  Microsecond protein folding kinetics from native-state hydrogen exchange. , 1997, Biochemistry.

[42]  Christoph H. Borchers,et al.  Characterizing short-lived protein folding intermediates by top-down hydrogen exchange mass spectrometry. , 2010, Analytical chemistry.

[43]  H. B. Mann,et al.  On a Test of Whether one of Two Random Variables is Stochastically Larger than the Other , 1947 .

[44]  Nicolas Doucet,et al.  Conservation of Flexible Residue Clusters among Structural and Functional Enzyme Homologues* , 2012, The Journal of Biological Chemistry.

[45]  Shoshana J. Wodak,et al.  Identification of predictive sequence motifs limited by protein structure data base size , 1988, Nature.

[46]  M J Rooman,et al.  Extracting information on folding from the amino acid sequence: consensus regions with preferred conformation in homologous proteins. , 1992, Biochemistry.

[47]  B. Schuler,et al.  Single-molecule spectroscopy of protein folding dynamics--expanding scope and timescales. , 2013, Current opinion in structural biology.

[48]  Victor Muñoz,et al.  Protein folding rates and stability: how much is there beyond size? , 2009, Journal of the American Chemical Society.

[49]  Andrew L. Lee,et al.  Conservation of side-chain dynamics within a protein family. , 2009, Journal of the American Chemical Society.

[50]  Zoran Obradovic,et al.  Improving Sequence Alignments For Intrinsically Disordered Proteins , 2001, Pacific Symposium on Biocomputing.

[51]  E I Shakhnovich,et al.  Folding kinetics of villin 14T, a protein domain with a central beta-sheet and two hydrophobic cores. , 1998, Biochemistry.

[52]  Tom Lenaerts,et al.  The DynaMine webserver: predicting protein dynamics from sequence , 2014, Nucleic Acids Res..

[53]  Chin Yu,et al.  Events in the Kinetic Folding Pathway of a Small, All β-Sheet Protein* , 1998, The Journal of Biological Chemistry.

[54]  Hui Chen,et al.  Secondary structure length as a determinant of folding rate of proteins with two‐ and three‐state kinetics , 2007, Proteins.

[55]  E. Meiering,et al.  Conserved and nonconserved features of the folding pathway of hisactophilin, a β‐trefoil protein , 2002, Protein science : a publication of the Protein Society.

[56]  Tobin R Sosnick,et al.  The folding of single domain proteins--have we reached a consensus? , 2011, Current opinion in structural biology.

[57]  Arthur G. Palmer,et al.  Thermal Adaptation of Conformational Dynamics in Ribonuclease H , 2013, PLoS Comput. Biol..

[58]  Lesley H. Greene,et al.  Folding of an all-helical Greek-key protein monitored by quenched-flow hydrogen–deuterium exchange and NMR spectroscopy , 2011, European Biophysics Journal.

[59]  L. Kay,et al.  Observing biological dynamics at atomic resolution using NMR. , 2009, Trends in biochemical sciences.

[60]  C. Dobson,et al.  The folding of hen lysozyme involves partially structured intermediates and multiple pathways , 1992, Nature.

[61]  J. Udgaonkar,et al.  Initial hydrophobic collapse in the folding of barstar , 1995, Nature.

[62]  A M Gronenborn,et al.  Kinetics of folding of the all-beta sheet protein interleukin-1 beta. , 1993, Science.

[63]  C. Yu,et al.  Structural Events during the Refolding of an All β-Sheet Protein* , 2001, The Journal of Biological Chemistry.

[64]  Terrence G. Oas,et al.  Preorganized secondary structure as an important determinant of fast protein folding , 2001, Nature Structural Biology.

[65]  Damian C Crowther,et al.  Protein misfolding and disease: from the test tube to the organism. , 2008, Current opinion in chemical biology.

[66]  Jie Liang,et al.  Are residues in a protein folding nucleus evolutionarily conserved? , 2003, Journal of molecular biology.

[67]  Athi N. Naganathan,et al.  Insights into protein folding mechanisms from large scale analysis of mutational effects , 2010, Proceedings of the National Academy of Sciences.

[68]  C. Batt,et al.  Is folding of β-lactoglobulin non-hierarchic? intermediate with native-like β-sheet and non-native α-helix , 2000 .

[69]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[70]  R D Appel,et al.  Protein identification and analysis tools in the ExPASy server. , 1999, Methods in molecular biology.

[71]  Gunther Kern,et al.  Characterization of a folding intermediate from HIV‐1 ribonuclease H , 1998, Protein science : a publication of the Protein Society.

[72]  S. Marqusee,et al.  Stepwise protein folding at near amino acid resolution by hydrogen exchange and mass spectrometry , 2013, Proceedings of the National Academy of Sciences.

[73]  H. Dyson,et al.  Characterization of a folding intermediate of apoplastocyanin trapped by proline isomerization. , 1993, Biochemistry.

[74]  L. Mirny,et al.  Evolutionary conservation of the folding nucleus. , 2000, Journal of molecular biology.

[75]  F M Poulsen,et al.  Formation of hydrogen bonds precedes the rate-limiting formation of persistent structure in the folding of ACBP. , 2000, Journal of molecular biology.

[76]  Michelle L. Scalley,et al.  Characterization of the free energy spectrum of peptostreptococcal protein L. , 1997, Folding & design.

[77]  C M Dobson,et al.  Independent nucleation and heterogeneous assembly of structure during folding of equine lysozyme. , 1999, Journal of molecular biology.

[78]  T. Blundell,et al.  Evolution and the tertiary structure of proteins. , 1984, Annual review of biophysics and bioengineering.

[79]  A. Biegert,et al.  HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment , 2011, Nature Methods.

[80]  Kevin W Plaxco,et al.  Residues participating in the protein folding nucleus do not exhibit preferential evolutionary conservation. , 2002, Journal of molecular biology.

[81]  Rafael Brüschweiler,et al.  Contact model for the prediction of NMR N-H order parameters in globular proteins. , 2002, Journal of the American Chemical Society.

[82]  C. Dobson,et al.  The refolding of human lysozyme: a comparison with the structurally homologous hen lysozyme. , 1994, Biochemistry.

[83]  E. De Pauw,et al.  Rapid collapse into a molten globule is followed by simple two-state kinetics in the folding of lysozyme from bacteriophage λ. , 2010, Biochemistry.

[84]  Giovanni Colonna,et al.  Amino acid propensities for secondary structures are influenced by the protein structural class. , 2006, Biochemical and biophysical research communications.

[85]  David S Wishart,et al.  A simple method to predict protein flexibility using secondary chemical shifts. , 2005, Journal of the American Chemical Society.

[86]  Robert D. Finn,et al.  HMMER web server: interactive sequence similarity searching , 2011, Nucleic Acids Res..

[87]  Christopher J. Oldfield,et al.  Evolutionary Rate Heterogeneity in Proteins with Long Disordered Regions , 2002, Journal of Molecular Evolution.

[88]  H. Roder,et al.  Identification of the predominant non-native histidine ligand in unfolded cytochrome c. , 1997, Biochemistry.

[89]  M. Oliveberg,et al.  Transient aggregates in protein folding are easily mistaken for folding intermediates. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[90]  David S. Wishart,et al.  The RCI server: rapid and accurate calculation of protein flexibility using chemical shifts , 2007, Nucleic Acids Res..

[91]  A. Gronenborn,et al.  Fast folding of a prototypic polypeptide: The immunoglobulin binding domain of streptococcal protein G , 1994, Protein science : a publication of the Protein Society.

[92]  D. Baker,et al.  Contact order, transition state placement and the refolding rates of single domain proteins. , 1998, Journal of molecular biology.

[93]  R. Li,et al.  The hydrogen exchange core and protein folding , 1999, Protein science : a publication of the Protein Society.

[94]  A. Fersht,et al.  Is there a unifying mechanism for protein folding? , 2003, Trends in biochemical sciences.

[95]  L A Mirny,et al.  How evolution makes proteins fold quickly. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[96]  Shuji Akiyama,et al.  Hierarchical folding mechanism of apomyoglobin revealed by ultra-fast H/D exchange coupled with 2D NMR , 2008, Proceedings of the National Academy of Sciences.