Extreme stability in de novo-designed repeat arrays is determined by unusually stable short-range interactions

Significance We apply a statistical thermodynamic formalism to quantify the cooperativity of folding of de novo-designed helical repeat proteins (DHRs). This analysis provides a fundamental thermodynamic description of folding for de novo-designed proteins and permits comparison with naturally occurring repeat protein thermodynamics. We find that individual DHR units are intrinsically stable, unlike those of naturally occurring proteins. This observation reveals local (intrarepeat) interactions as a source of high stability in Rosetta-designed proteins and suggests that different types of DHR repeats may be combined in a single polypeptide chain, expanding the repertoire of folded DHRs for applications such as molecular recognition. Favorable intrinsic stability imparts a downhill shape to the energy landscape, suggesting that DHRs fold fast and through parallel pathways. Designed helical repeats (DHRs) are modular helix–loop–helix–loop protein structures that are tandemly repeated to form a superhelical array. Structures combining tandem DHRs demonstrate a wide range of molecular geometries, many of which are not observed in nature. Understanding cooperativity of DHR proteins provides insight into the molecular origins of Rosetta-based protein design hyperstability and facilitates comparison of energy distributions in artificial and naturally occurring protein folds. Here, we use a nearest-neighbor Ising model to quantify the intrinsic and interfacial free energies of four different DHRs. We measure the folding free energies of constructs with varying numbers of internal and terminal capping repeats for four different DHR folds, using guanidine-HCl and glycerol as destabilizing and solubilizing cosolvents. One-dimensional Ising analysis of these series reveals that, although interrepeat coupling energies are within the range seen for naturally occurring repeat proteins, the individual repeats of DHR proteins are intrinsically stable. This favorable intrinsic stability, which has not been observed for naturally occurring repeat proteins, adds to stabilizing interfaces, resulting in extraordinarily high stability. Stable repeats also impart a downhill shape to the energy landscape for DHR folding. These intrinsic stability differences suggest that part of the success of Rosetta-based design results from capturing favorable local interactions.

[1]  P. Wolynes,et al.  Spin glasses and the statistical mechanics of protein folding. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Doug Barrick,et al.  Analysis of repeat-protein folding using nearest-neighbor statistical mechanical models. , 2009, Methods in enzymology.

[3]  A. Plückthun,et al.  Efficient selection of DARPins with sub-nanomolar affinities using SRP phage display. , 2008, Journal of molecular biology.

[4]  Andreas Plückthun,et al.  Rigidly connected multispecific artificial binders with adjustable geometries , 2017, Scientific Reports.

[5]  D. Barrick,et al.  Broken TALEs: Transcription Activator-like Effectors Populate Partly Folded States. , 2016, Biophysical journal.

[6]  D. Barrick,et al.  Direct observation of parallel folding pathways revealed using a symmetric repeat protein system. , 2014, Biophysical journal.

[7]  Doug Barrick,et al.  An experimentally determined protein folding energy landscape. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[8]  H. Scheraga,et al.  Theory of helix-coil transitions in biopolymers : statistical mechanical theory of order-disorder transitions in biological macromolecules , 1970 .

[9]  J. Onuchic,et al.  Funnels, pathways, and the energy landscape of protein folding: A synthesis , 1994, Proteins.

[10]  G. Olsen,et al.  Thermal adaptation analyzed by comparison of protein sequences from mesophilic and extremely thermophilic Methanococcus species. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Doug Barrick,et al.  Repeat-protein folding: new insights into origins of cooperativity, stability, and topology. , 2008, Archives of biochemistry and biophysics.

[12]  M. Newville,et al.  Lmfit: Non-Linear Least-Square Minimization and Curve-Fitting for Python , 2014 .

[13]  P. Wolynes,et al.  The experimental survey of protein-folding energy landscapes , 2005, Quarterly Reviews of Biophysics.

[14]  Eugene I. Shakhnovich,et al.  Protein stability imposes limits on organism complexity and speed of molecular evolution , 2007, Proceedings of the National Academy of Sciences.

[15]  L. W. Seidman THE CONTRIBUTION OF , 2004 .

[16]  A. Plückthun,et al.  Design and applications of a clamp for Green Fluorescent Protein with picomolar affinity , 2017, Scientific Reports.

[17]  J. Onuchic,et al.  Theory of protein folding: the energy landscape perspective. , 1997, Annual review of physical chemistry.

[18]  Tommi Kajander,et al.  Protein design to understand peptide ligand recognition by tetratricopeptide repeat proteins. , 2004, Protein engineering, design & selection : PEDS.

[19]  D. Barrick,et al.  Creating a Homeodomain with High Stability and DNA Binding Affinity by Sequence Averaging. , 2017, Journal of the American Chemical Society.

[20]  C. Pace,et al.  Denaturant m values and heat capacity changes: Relation to changes in accessible surface areas of protein unfolding , 1995, Protein science : a publication of the Protein Society.

[21]  Erin L. Doyle,et al.  Targeting DNA Double-Strand Breaks with TAL Effector Nucleases , 2010, Genetics.

[22]  J. Onuchic,et al.  Theory of Protein Folding This Review Comes from a Themed Issue on Folding and Binding Edited Basic Concepts Perfect Funnel Landscapes and Common Features of Folding Mechanisms , 2022 .

[23]  Doug Barrick,et al.  The contribution of entropy, enthalpy, and hydrophobic desolvation to cooperativity in repeat-protein folding. , 2011, Structure.

[24]  Sheng Huang,et al.  TAL nucleases (TALNs): hybrid proteins composed of TAL effectors and FokI DNA-cleavage domain , 2010, Nucleic Acids Res..

[25]  David Baker,et al.  Exploring the repeat protein universe through computational protein design , 2015, Nature.

[26]  Peter G Wolynes,et al.  The energy landscape of modular repeat proteins: topology determines folding mechanism in the ankyrin family. , 2005, Journal of molecular biology.

[27]  A. Plückthun,et al.  A novel strategy to design binding molecules harnessing the modular nature of repeat proteins , 2003, FEBS letters.

[28]  Andreas Plückthun,et al.  Designed ankyrin repeat proteins (DARPins): binding proteins for research, diagnostics, and therapy. , 2015, Annual review of pharmacology and toxicology.

[29]  Andreas Plückthun,et al.  Folding and unfolding mechanism of highly stable full-consensus ankyrin repeat proteins. , 2008, Journal of molecular biology.

[30]  Tommi Kajander,et al.  A new folding paradigm for repeat proteins. , 2005, Journal of the American Chemical Society.

[31]  D. M. Taverna,et al.  Why are proteins marginally stable? , 2002, Proteins.

[32]  D. Barrick,et al.  Synergistic enhancement of cellulase pairs linked by consensus ankyrin repeats: Determination of the roles of spacing, orientation, and enzyme identity , 2016, Proteins.

[33]  Doug Barrick,et al.  A Naturally Occurring Repeat Protein with High Internal Sequence Identity Defines a New Class of TPR-like Proteins. , 2015, Structure.