Structure and dynamics of de novo proteins from a designed superfamily of 4‐helix bundles

Libraries of de novo proteins provide an opportunity to explore the structural and functional potential of biological molecules that have not been biased by billions of years of evolutionary selection. Given the enormity of sequence space, a rational approach to library design is likely to yield a higher fraction of folded and functional proteins than a stochastic sampling of random sequences. We previously investigated the potential of library design by binary patterning of hydrophobic and hydrophilic amino acids. The structure of the most stable protein from a binary patterned library of de novo 4‐helix bundles was solved previously and shown to be consistent with the design. One structure, however, cannot fully assess the potential of the design strategy, nor can it account for differences in the stabilities of individual proteins. To more fully probe the quality of the library, we now report the NMR structure of a second protein, S‐836. Protein S‐836 proved to be a 4‐helix bundle, consistent with design. The similarity between the two solved structures reinforces previous evidence that binary patterning can encode stable, 4‐helix bundles. Despite their global similarities, the two proteins have cores that are packed at different degrees of tightness. The relationship between packing and dynamics was probed using the Modelfree approach, which showed that regions containing a high frequency of chemical exchange coincide with less well‐packed side chains. These studies show (1) that binary patterning can drive folding into a particular topology without the explicit design of residue‐by‐residue packing, and (2) that within a superfamily of binary patterned proteins, the structures and dynamics of individual proteins are modulated by the identity and packing of residues in the hydrophobic core.

[1]  F. Crick,et al.  The packing of α‐helices: simple coiled‐coils , 1953 .

[2]  Martin Karplus,et al.  Vicinal Proton Coupling in Nuclear Magnetic Resonance , 1963 .

[3]  H. Scheraga Theoretical and experimental studies of conformations of polypeptides. , 1971, Chemical reviews.

[4]  C B Anfinsen,et al.  The formation and stabilization of protein structure. , 1972, The Biochemical journal.

[5]  R. Jensen Enzyme recruitment in evolution of new function. , 1976, Annual review of microbiology.

[6]  C. Chothia,et al.  Structure of proteins: packing of alpha-helices and pleated sheets. , 1977, Proceedings of the National Academy of Sciences of the United States of America.

[7]  P M Cullis,et al.  Affinities of amino acid side chains for solvent water. , 1981, Biochemistry.

[8]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[9]  K. Wüthrich NMR of proteins and nucleic acids , 1988 .

[10]  Scott R. Presnell,et al.  Topological distribution of four-alpha-helix bundles. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[11]  W. Lim,et al.  Alternative packing arrangements in the hydrophobic core of λrepresser , 1989, Nature.

[12]  W Mandecki,et al.  A method for construction of long randomized open reading frames and polypeptides. , 1990, Protein engineering.

[13]  P. Wright,et al.  Intramolecular motions of a zinc finger DNA-binding domain from Xfin characterized by proton-detected natural abundance carbon-13 heteronuclear NMR spectroscopy , 1991 .

[14]  R A Smith,et al.  Loop mobility in a four-helix-bundle protein: 15N NMR relaxation measurements on human interleukin-4. , 1992, Biochemistry.

[15]  B. Matthews,et al.  Response of a protein structure to cavity-creating mutations and its relation to the hydrophobic effect. , 1992, Science.

[16]  Ad Bax,et al.  Quantitative J correlation: a new approach for measuring homonuclear three-bond J(HNH.alpha.) coupling constants in 15N-enriched proteins , 1993 .

[17]  M. Hecht,et al.  Recombinant Proteins Can Be Isolated from E. coli Cells by Repeated Cycles of Freezing and Thawing , 1994, Bio/Technology.

[18]  N L Harris,et al.  Four helix bundle diversity in globular proteins. , 1994, Journal of molecular biology.

[19]  A. Gronenborn,et al.  Investigation of the backbone dynamics of the igg‐binding domain of streptococcal protein g by heteronuclear two‐dimensional 1H‐15N nuclear magnetic resonance spectroscopy , 1994, Protein science : a publication of the Protein Society.

[20]  M. Saraste,et al.  Invariant tryptophan at a shielded site promotes folding of the conformational unit of spectrin. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[21]  A. Palmer,et al.  Backbone dynamics of Escherichia coli ribonuclease HI: correlations with structure and function in an active enzyme. , 1995, Journal of molecular biology.

[22]  A. Bax,et al.  Rotational diffusion anisotropy of human ubiquitin from 15N NMR relaxation , 1995 .

[23]  C. Dobson,et al.  Structural determinants of protein dynamics: analysis of 15N NMR relaxation measurements for main-chain and side-chain nuclei of hen egg white lysozyme. , 1995, Biochemistry.

[24]  S. Grzesiek,et al.  NMRPipe: A multidimensional spectral processing system based on UNIX pipes , 1995, Journal of biomolecular NMR.

[25]  Eric Oldfield,et al.  1H, 13C and 15N chemical shift referencing in biomolecular NMR , 1995, Journal of biomolecular NMR.

[26]  J. Thornton,et al.  AQUA and PROCHECK-NMR: Programs for checking the quality of protein structures solved by NMR , 1996, Journal of biomolecular NMR.

[27]  B. Matthews,et al.  A test of the "jigsaw puzzle" model for protein folding by multiple methionine substitutions within the core of T4 lysozyme. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[28]  J M Sturtevant,et al.  What makes a protein a protein? Hydrophobic core designs that specify stability and structural properties , 1996, Protein science : a publication of the Protein Society.

[29]  M. Billeter,et al.  MOLMOL: a program for display and analysis of macromolecular structures. , 1996, Journal of molecular graphics.

[30]  R. Farid,et al.  De novo heme proteins from designed combinatorial libraries , 1997, Protein science : a publication of the Protein Society.

[31]  S Roy,et al.  Detecting native-like properties in combinatorial libraries of de novo proteins. , 1997, Folding & design.

[32]  S. L. Mayo,et al.  Probing the role of packing specificity in protein design. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[33]  A. Wand,et al.  Global topology & stability and local structure & dynamics in a synthetic spin-labeled four-helix bundle protein. , 1997, Biochemistry.

[34]  M. Hecht,et al.  De novo amyloid proteins from designed combinatorial libraries. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[35]  W. DeGrado,et al.  Solution structure and dynamics of a de novo designed three-helix bundle protein. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[36]  Michael H. Hecht,et al.  Peroxidase Activity in Heme Proteins Derived from a Designed Combinatorial Library , 2000 .

[37]  J. García de la Torre,et al.  HYDRONMR: prediction of NMR relaxation of globular proteins from atomic-level structures and hydrodynamic calculations. , 2000, Journal of magnetic resonance.

[38]  M. Hecht,et al.  Cooperative thermal denaturation of proteins designed by binary patterning of polar and nonpolar amino acids. , 2000, Biochemistry.

[39]  L. Regan,et al.  Dramatic structural and thermodynamic consequences of repacking a protein's hydrophobic core. , 2000, Structure.

[40]  C. Chothia,et al.  Structure of proteins: Packing of a-helices and pleated sheets , 2000 .

[41]  Guofeng Xu,et al.  Self-assembled monolayers from a designed combinatorial library of de novo β-sheet proteins , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[42]  C. Wallace,et al.  Conserved tryptophan in cytochrome c: importance of the unique side-chain features of the indole moiety. , 2001, Biochemical Journal.

[43]  Anthony D. Keefe,et al.  Functional proteins from a random-sequence library , 2001, Nature.

[44]  Clay Bracken,et al.  A method for efficient isotopic labeling of recombinant proteins , 2001, Journal of biomolecular NMR.

[45]  W. DeGrado,et al.  Hydrophobic core malleability of a de novo designed three-helix bundle protein. , 2001, Journal of molecular biology.

[46]  R. D. Williams,et al.  Carbon monoxide binding by de novo heme proteins derived from designed combinatorial libraries. , 2001, Journal of the American Chemical Society.

[47]  C. Orengo,et al.  One fold with many functions: the evolutionary relationships between TIM barrel families based on their sequences, structures and functions. , 2002, Journal of molecular biology.

[48]  M. Hecht,et al.  Rationally designed mutations convert de novo amyloid-like fibrils into monomeric β-sheet proteins , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[49]  Lorna J. Smith,et al.  Long-Range Interactions Within a Nonnative Protein , 2002, Science.

[50]  Yinan Wei,et al.  Stably folded de novo proteins from a designed combinatorial library , 2003, Protein science : a publication of the Protein Society.

[51]  Michael H Hecht,et al.  Midpoint reduction potentials and heme binding stoichiometries of de novo proteins from designed combinatorial libraries. , 2003, Biophysical chemistry.

[52]  Yinan Wei,et al.  Solution structure of a de novo protein from a designed combinatorial library , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[53]  Charles D Schwieters,et al.  The Xplor-NIH NMR molecular structure determination package. , 2003, Journal of magnetic resonance.

[54]  David Baker,et al.  Searching for folded proteins in vitro and in silico. , 2004, European journal of biochemistry.

[55]  An on/off resonance rotating frame relaxation experiment to monitor millisecond to microsecond timescale dynamics , 2004, Journal of biomolecular NMR.

[56]  L. H. Bradley,et al.  De novo proteins from designed combinatorial libraries , 2004, Protein science : a publication of the Protein Society.

[57]  Yinan Wei,et al.  Enzyme-like proteins from an unselected library of designed amino acid sequences. , 2004, Protein engineering, design & selection : PEDS.

[58]  F. Studier,et al.  Protein production by auto-induction in high density shaking cultures. , 2005, Protein expression and purification.

[59]  C. Orengo,et al.  Protein families and their evolution-a structural perspective. , 2005, Annual review of biochemistry.

[60]  Pier Luigi Luisi,et al.  Investigation of de novo Totally Random Biosequences, Part I , 2006, Chemistry & biodiversity.

[61]  Michael H Hecht,et al.  De novo proteins from binary-patterned combinatorial libraries. , 2006, Methods in molecular biology.

[62]  M. Hecht,et al.  Electrochemical and ligand binding studies of a de novo heme protein. , 2006, Biophysical chemistry.

[63]  P. Babbitt,et al.  Evolution of enzyme superfamilies. , 2006, Current opinion in chemical biology.

[64]  Fabio Polticelli,et al.  Investigation of de novo Totally Random Biosequences, Part II , 2006, Chemistry & biodiversity.

[65]  Jie Liang,et al.  CASTp: computed atlas of surface topography of proteins with structural and topographical mapping of functionally annotated residues , 2006, Nucleic Acids Res..

[66]  Avijit Chakrabartty,et al.  Getting specificity from simplicity in putative proteins from the prebiotic Earth , 2007, Proceedings of the National Academy of Sciences.

[67]  M. Hecht,et al.  NMR assignment of S836: a de novo protein from a designed superfamily , 2007, Biomolecular NMR assignments.

[68]  L. H. Bradley,et al.  Protein design by binary patterning of polar and nonpolar amino acids. , 1993, Methods in molecular biology.

[69]  M. Hecht,et al.  Peroxidase activity of de novo heme proteins immobilized on electrodes. , 2007, Journal of inorganic biochemistry.

[70]  Jack Snoeyink,et al.  MolProbity: all-atom contacts and structure validation for proteins and nucleic acids , 2007, Nucleic Acids Res..