Principles for designing ideal protein structures

Unlike random heteropolymers, natural proteins fold into unique ordered structures. Understanding how these are encoded in amino-acid sequences is complicated by energetically unfavourable non-ideal features—for example kinked α-helices, bulged β-strands, strained loops and buried polar groups—that arise in proteins from evolutionary selection for biological function or from neutral drift. Here we describe an approach to designing ideal protein structures stabilized by completely consistent local and non-local interactions. The approach is based on a set of rules relating secondary structure patterns to protein tertiary motifs, which make possible the design of funnel-shaped protein folding energy landscapes leading into the target folded state. Guided by these rules, we designed sequences predicted to fold into ideal protein structures consisting of α-helices, β-strands and minimal loops. Designs for five different topologies were found to be monomeric and very stable and to adopt structures in solution nearly identical to the computational models. These results illuminate how the folding funnels of natural proteins arise and provide the foundation for engineering a new generation of functional proteins free from natural evolution.

[1]  N. Go Theoretical studies of protein folding. , 1983, Annual review of biophysics and bioengineering.

[2]  D. W. Bolen,et al.  Unfolding free energy changes determined by the linear extrapolation method. 1. Unfolding of phenylmethanesulfonyl alpha-chymotrypsin using different denaturants. , 1988, Biochemistry.

[3]  K. Wüthrich,et al.  Stereospecific nuclear magnetic resonance assignments of the methyl groups of valine and leucine in the DNA-binding domain of the 434 repressor by biosynthetically directed fractional 13C labeling. , 1989, Biochemistry.

[4]  K. Dill Dominant forces in protein folding. , 1990, Biochemistry.

[5]  J. Richardson,et al.  De novo design, expression, and characterization of Felix: a four-helix bundle protein of native-like sequence. , 1990, Science.

[6]  D. Eisenberg,et al.  Assessment of protein models with three-dimensional profiles , 1992, Nature.

[7]  J. Onuchic,et al.  Protein folding funnels: a kinetic approach to the sequence-structure relationship. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[8]  J. Thornton,et al.  PROCHECK: a program to check the stereochemical quality of protein structures , 1993 .

[9]  M. Sippl Recognition of errors in three‐dimensional structures of proteins , 1993, Proteins.

[10]  K Wüthrich,et al.  The program XEASY for computer-supported NMR spectral analysis of biological macromolecules , 1995, Journal of biomolecular NMR.

[11]  C. Pace,et al.  How to measure and predict the molar absorption coefficient of a protein , 1995, Protein science : a publication of the Protein Society.

[12]  J. Onuchic,et al.  Toward an outline of the topography of a realistic protein-folding funnel. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[13]  S. Grzesiek,et al.  NMRPipe: A multidimensional spectral processing system based on UNIX pipes , 1995, Journal of biomolecular NMR.

[14]  Ad Bax,et al.  Magnetic Field Dependence of Nitrogen−Proton J Splittings in 15N-Enriched Human Ubiquitin Resulting from Relaxation Interference and Residual Dipolar Coupling , 1996 .

[15]  G. Montelione,et al.  High-level production of uniformly ¹⁵N- and ¹³C-enriched fusion proteins in Escherichia coli. , 1996, Journal of biomolecular NMR.

[16]  G. Montelione,et al.  High-level production of uniformly 15N-and 13C-enriched fusion proteins in Escherichia coli , 1996 .

[17]  C Kooperberg,et al.  Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. , 1997, Journal of molecular biology.

[18]  K. Wüthrich,et al.  Torsion angle dynamics for NMR structure calculation with the new program DYANA. , 1997, Journal of molecular biology.

[19]  S. L. Mayo,et al.  De novo protein design: fully automated sequence selection. , 1997, Science.

[20]  K. Dill,et al.  From Levinthal to pathways to funnels , 1997, Nature Structural Biology.

[21]  P. S. Kim,et al.  High-resolution protein design with backbone freedom. , 1998, Science.

[22]  D. Baker,et al.  Prediction of local structure in proteins using a library of sequence-structure motifs. , 1998, Journal of molecular biology.

[23]  R J Read,et al.  Crystallography & NMR system: A new software suite for macromolecular structure determination. , 1998, Acta crystallographica. Section D, Biological crystallography.

[24]  D. Raleigh,et al.  De novo design of helical bundles as models for understanding protein folding and function. , 2000, Accounts of chemical research.

[25]  D. Richardson,et al.  Exploring steric constraints on protein mutations using MAGE/PROBE , 2000, Protein science : a publication of the Protein Society.

[26]  S. L. Mayo,et al.  Enzyme-like proteins by computational design , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Torsten Herrmann,et al.  Protein NMR structure determination with automated NOE assignment using the new software CANDID and the torsion angle dynamics algorithm DYANA. , 2002, Journal of molecular biology.

[28]  Richard Bonneau,et al.  Contact order and ab initio protein structure prediction , 2002, Protein science : a publication of the Protein Society.

[29]  C. M. Summa,et al.  Computational de novo design, and characterization of an A(2)B(2) diiron protein. , 2002, Journal of molecular biology.

[30]  J. Richardson,et al.  Natural β-sheet proteins use negative design to avoid edge-to-edge aggregation , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[31]  W. Jin,et al.  De novo design of foldable proteins with smooth folding funnel: automated negative design and experimental verification. , 2003, Structure.

[32]  D. Baker,et al.  A large scale test of computational protein design: folding and stability of nine completely redesigned globular proteins. , 2003, Journal of molecular biology.

[33]  D. Baker,et al.  Design of a Novel Globular Protein Fold with Atomic-Level Accuracy , 2003, Science.

[34]  P. Harbury,et al.  Automated design of specificity in molecular recognition , 2003, Nature Structural Biology.

[35]  Shankar Subramaniam,et al.  Protein local structure prediction from sequence , 2003, Proteins.

[36]  Hidetoshi Kono,et al.  Computational design and characterization of a monomeric helical dinuclear metalloprotein. , 2003, Journal of molecular biology.

[37]  M. Nilges,et al.  Refinement of protein structures in explicit solvent , 2003, Proteins.

[38]  W. DeGrado,et al.  De novo design of catalytic proteins. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[39]  David Baker,et al.  Protein Structure Prediction Using Rosetta , 2004, Numerical Computer Methods, Part D.

[40]  D. Baker,et al.  Computational redesign of protein-protein interaction specificity , 2004, Nature Structural &Molecular Biology.

[41]  Gaohua Liu,et al.  NMR data collection and analysis protocol for high-throughput protein structure determination. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[42]  Robert Powers,et al.  An integrated platform for automated analysis of protein NMR structures. , 2005, Methods in enzymology.

[43]  Design of lambda Cro fold: solution structure of a monomeric variant of the de novo protein. , 2005, Journal of molecular biology.

[44]  F. Studier,et al.  Protein production by auto-induction in high density shaking cultures. , 2005, Protein expression and purification.

[45]  Thomas Szyperski,et al.  G-matrix Fourier transform NOESY-based protocol for high-quality protein structure determination. , 2005, Journal of the American Chemical Society.

[46]  M. Ota,et al.  Design of λ Cro Fold: Solution Structure of a Monomeric Variant of the De Novo Protein , 2005 .

[47]  Robert Powers,et al.  Protein NMR recall, precision, and F-measure scores (RPF scores): structure quality assessment measures based on information retrieval statistics. , 2005, Journal of the American Chemical Society.

[48]  C. Etchebest,et al.  A structural alphabet for local protein structures: Improved prediction methods , 2005, Proteins.

[49]  Brian Kuhlman,et al.  Computer-based design of novel protein structures. , 2006, Annual review of biophysics and biomolecular structure.

[50]  G. Rose,et al.  Secondary structure determines protein topology , 2006, Protein science : a publication of the Protein Society.

[51]  S. Takada,et al.  Shaping up the protein folding funnel by local interaction: lesson from a structure prediction study. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[52]  Robert Powers,et al.  A topology‐constrained distance network algorithm for protein structure determination from NOESY data , 2005, Proteins.

[53]  Lauren L. Perskie,et al.  Physical‐chemical determinants of turn conformations in globular proteins , 2007, Protein science : a publication of the Protein Society.

[54]  Geoffrey K. Hom,et al.  Full-sequence computational design and solution structure of a thermostable protein variant. , 2007, Journal of molecular biology.

[55]  Gaetano T Montelione,et al.  Evaluating protein structures determined by structural genomics consortia , 2006, Proteins.

[56]  Xiaozhen Hu,et al.  Computer-based redesign of a beta sandwich protein suggests that extensive negative design is not required for de novo beta sheet design. , 2008, Structure.

[57]  Eric A. Althoff,et al.  De Novo Computational Design of Retro-Aldol Enzymes , 2008, Science.

[58]  Eric A. Althoff,et al.  Kemp elimination catalysts by computational enzyme design , 2008, Nature.

[59]  D. Baker,et al.  RosettaHoles: Rapid assessment of protein core packing for structure prediction, refinement, design, and validation , 2008, Protein science : a publication of the Protein Society.

[60]  Ken A. Dill,et al.  Predicting Peptide Structures in Native Proteins from Physical Simulations of Fragments , 2009, PLoS Comput. Biol..

[61]  A. Bax,et al.  TALOS+: a hybrid method for predicting protein backbone torsion angles from NMR chemical shifts , 2009, Journal of biomolecular NMR.

[62]  Jasmine L. Gallaher,et al.  Computational Design of an Enzyme Catalyst for a Stereoselective Bimolecular Diels-Alder Reaction , 2010, Science.

[63]  Adrien Treuille,et al.  Predicting protein structures with a multiplayer online game , 2010, Nature.

[64]  L. Stamatatos,et al.  Computational design of epitope-scaffolds allows induction of antibodies specific for a poorly immunogenic HIV vaccine epitope. , 2010, Structure.

[65]  D. Baker,et al.  Alternate states of proteins revealed by detailed energy landscape mapping. , 2011, Journal of molecular biology.

[66]  Christopher M. MacDermaid,et al.  Theoretical and computational protein design. , 2011, Annual review of physical chemistry.

[67]  Timothy A. Whitehead,et al.  Computational Design of Proteins Targeting the Conserved Stem Region of Influenza Hemagglutinin , 2011, Science.

[68]  Gozde Bozdagi Akar,et al.  Improved prediction methods for scalable predictive animated mesh compression , 2011, J. Vis. Commun. Image Represent..

[69]  Gaohua Liu,et al.  Preparation of protein samples for NMR structure, function, and small-molecule screening studies. , 2011, Methods in enzymology.

[70]  D. Baker,et al.  Computation-Guided Backbone Grafting of a Discontinuous Motif onto a Protein Scaffold , 2011, Science.

[71]  D. Baker,et al.  RosettaRemodel: A Generalized Framework for Flexible Backbone Protein Design , 2011, PloS one.

[72]  Ryo Takeuchi,et al.  Computational redesign of a mononuclear zinc metalloenzyme for organophosphate hydrolysis. , 2012, Nature chemical biology.

[73]  D. Baker,et al.  Computational Design of Self-Assembling Protein Nanomaterials with Atomic Level Accuracy , 2012, Science.

[74]  Jens Meiler,et al.  Potential of fragment recombination for rational design of proteins. , 2012, Journal of the American Chemical Society.