PyFolding: An open-source software package for graphing, analysis and simulation of thermodynamic and kinetic models of protein folding

Our understanding of how proteins find and adopt their functional three-dimensional structure has largely arisen through experimental studies of the denaturant-and primary sequence-dependence of protein stability and the kinetics of folding. For many years, curve fitting software packages have been heavily utilized to fit simple models to these data. Although such software packages are easy to use for simple functions, they are often expensive and provide substantial impediments to applying more complex models or for the analysis of large datasets. Moreover, over the past decade, increasingly sophisticated analytical models have been generated, but without simple tools to enable routine analysis. Consequently, users have needed to generate their own tools or otherwise find willing collaborators. Here we present PyFolding, a free, open source, and extensible Python framework for the analysis and modeling of experimental protein folding and thermodynamic data. To demonstrate the utility of PyFolding, we provide examples of complex analysis: (i) multi-phase kinetic folding data fitted to linked equations and (ii) thermodynamic equilibrium data from consensus designed repeat proteins to both homo-and heteropolymer variants of the Ising model. Example scripts to perform these and other operations are supplied with the software. Further, we show that PyFolding can be used in conjunction with Jupyter notebooks as an easy way to share methods and analysis for publication and amongst research teams.

[1]  Andreas Plückthun,et al.  Engineered proteins with desired specificity: DARPins, other alternative scaffolds and bispecific IgGs. , 2014, Current opinion in structural biology.

[2]  M. Kamal,et al.  DARPins Bioengineering and its Theranostic Approaches: Emerging Trends in Protein Engineering. , 2017, Current pharmaceutical design.

[3]  Peter G Wolynes,et al.  The capillarity picture and the kinetics of one-dimensional protein folding , 2008, Proceedings of the National Academy of Sciences.

[4]  Randy J. Read,et al.  Overview of the CCP4 suite and current developments , 2011, Acta crystallographica. Section D, Biological crystallography.

[5]  Elizabeth A Komives,et al.  Folding landscapes of ankyrin repeat proteins: experiments meet theory. , 2008, Current opinion in structural biology.

[6]  Sophie E Jackson,et al.  A comparison of the folding of two knotted proteins: YbeA and YibK. , 2007, Journal of molecular biology.

[7]  V Muñoz,et al.  Folding dynamics and mechanism of beta-hairpin formation. , 1997, Nature.

[8]  J. Kubelka,et al.  Site-specific thermodynamic stability and unfolding of a de novo designed protein structural motif mapped by 13C isotopically edited IR spectroscopy. , 2014, Journal of the American Chemical Society.

[9]  H. Lilie,et al.  Structural insights into an equilibrium folding intermediate of an archaeal ankyrin repeat protein , 2008, Proceedings of the National Academy of Sciences.

[10]  Andreas Plückthun,et al.  Folding and unfolding mechanism of highly stable full-consensus ankyrin repeat proteins. , 2008, Journal of molecular biology.

[11]  Eric Jones,et al.  SciPy: Open Source Scientific Tools for Python , 2001 .

[12]  D. Barrick,et al.  Direct observation of parallel folding pathways revealed using a symmetric repeat protein system. , 2014, Biophysical journal.

[13]  V. Muñoz,et al.  Folding dynamics and mechanism of β-hairpin formation , 1997, Nature.

[14]  L Serrano,et al.  The folding of an enzyme. III. Structure of the transition state for unfolding of barnase analysed by a protein engineering procedure. , 1992, Journal of molecular biology.

[15]  Athi N. Naganathan,et al.  Thermodynamics of downhill folding: multi-probe analysis of PDD, a protein that folds over a marginal free energy barrier. , 2014, The journal of physical chemistry. B.

[16]  Tommi Kajander,et al.  A new folding paradigm for repeat proteins. , 2005, Journal of the American Chemical Society.

[17]  P. Wolynes,et al.  Complex energy landscape of a giant repeat protein. , 2013, Structure.

[18]  Martin Niss,et al.  History of the Lenz-Ising Model 1920–1950: From Ferromagnetic to Cooperative Phenomena , 2005 .

[19]  Andreas Plückthun,et al.  Advances in the design and engineering of peptide-binding repeat proteins , 2017, Biological chemistry.

[20]  E. Main,et al.  Fibrous nanostructures from the self-assembly of designed repeat protein modules. , 2012, Angewandte Chemie.

[21]  A. Fersht,et al.  The folding of an enzyme. I. Theory of protein engineering analysis of stability and pathway of protein folding. , 1992, Journal of molecular biology.

[22]  E. Main,et al.  Repeat protein engineering: creating functional nanostructures/biomaterials from modular building blocks. , 2013, Biochemical Society transactions.

[23]  V. Muñoz,et al.  A simple model for calculating the kinetics of protein folding from three-dimensional structures. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[24]  A. Fersht,et al.  Folding of chymotrypsin inhibitor 2. 1. Evidence for a two-state transition. , 1991, Biochemistry.

[25]  S. Brush History of the Lenz-Ising Model , 1967 .

[26]  V. Muñoz,et al.  Experimental Identification of Downhill Protein Folding , 2002, Science.

[27]  Kevin W Plaxco,et al.  Protein folding: Defining a “standard” set of experimental conditions and a preliminary kinetic data set of two‐state proteins , 2005, Protein science : a publication of the Protein Society.

[28]  J. Viala,et al.  Correction for Javadi and Main, Exploring the folding energy landscape of a series of designed consensus tetratricopeptide repeat proteins , 2009, Proceedings of the National Academy of Sciences.

[29]  S. Jackson,et al.  Folding pathway of FKBP12 and characterisation of the transition state. , 1999, Journal of molecular biology.

[30]  E. Main,et al.  Ising Model Reprogramming of a Repeat Protein's Equilibrium Unfolding Pathway , 2016, Journal of molecular biology.

[31]  B. Zimm,et al.  Theory of the Phase Transition between Helix and Random Coil in Polypeptide Chains , 1959 .

[32]  L. Regan,et al.  NextGen protein design. , 2013, Biochemical Society transactions.

[33]  J. Kubelka,et al.  Sequence, structure, and cooperativity in folding of elementary protein structural motifs , 2015, Proceedings of the National Academy of Sciences.

[34]  Elizabeth A Komives,et al.  Folding kinetics of the cooperatively folded subdomain of the IκBα ankyrin repeat domain. , 2011, Journal of molecular biology.

[35]  Jane Clarke,et al.  The folding of spectrin domains II: phi-value analysis of R16. , 2004, Journal of molecular biology.

[36]  Doug Barrick,et al.  The contribution of entropy, enthalpy, and hydrophobic desolvation to cooperativity in repeat-protein folding. , 2011, Structure.

[37]  Jane Clarke,et al.  Experimental evidence for a frustrated energy landscape in a 3-helix bundle protein family , 2009, Nature.

[38]  Alan R. Lowe,et al.  Mapping the Topography of a Protein Energy Landscape. , 2015, Journal of the American Chemical Society.

[39]  Rainer Storn,et al.  Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces , 1997, J. Glob. Optim..

[40]  L. Regan,et al.  New materials from proteins and peptides. , 2012, Current opinion in structural biology.

[41]  Alan R. Lowe,et al.  Biophysical characterisation of the small ankyrin repeat protein myotrophin. , 2007, Journal of molecular biology.

[42]  Tijana Z Grove,et al.  Nanostructured functional films from engineered repeat proteins , 2013, Journal of The Royal Society Interface.

[43]  L. Regan,et al.  Designed Tpr Modules as Novel Anticancer Agents , 2022 .

[44]  E. Henry,et al.  Chemical, physical, and theoretical kinetics of an ultrafast folding protein , 2008, Proceedings of the National Academy of Sciences.

[45]  L. Regan,et al.  A modular approach to the design of protein‐based smart gels , 2012, Biopolymers.

[46]  Doug Barrick,et al.  Analysis of repeat-protein folding using nearest-neighbor statistical mechanical models. , 2009, Methods in enzymology.

[47]  T. Kiefhaber,et al.  Shape of the free energy barriers for protein folding probed by multiple perturbation analysis. , 2006, Journal of molecular biology.

[48]  H. Roder,et al.  Microsecond folding dynamics of apomyoglobin at acidic pH. , 2012, The journal of physical chemistry. B.