Design of structurally distinct proteins using strategies inspired by evolution

Building new proteins from the old Proteins are the workhorses of biology. Designing new, stable proteins with functions desirable in biotechnology or biomedicine remains challenging. Jacobs et al. developed a computational method called SEWING that designs proteins using pieces of existing structures (see the Perspective by Netzer and Fleishman). The new proteins can contain structural features such as pockets or grooves that are required for function. The solved structures of two designed proteins agreed well with the design models. The method allows rapid design of a diverse set of structures that will facilitate functional design. Science, this issue p. 687; see also p. 657 A design strategy builds new protein structures by recombining pieces of existing protein domains. Natural recombination combines pieces of preexisting proteins to create new tertiary structures and functions. We describe a computational protocol, called SEWING, which is inspired by this process and builds new proteins from connected or disconnected pieces of existing structures. Helical proteins designed with SEWING contain structural features absent from other de novo designed proteins and, in some cases, remain folded at more than 100°C. High-resolution structures of the designed proteins CA01 and DA05R1 were solved by x-ray crystallography (2.2 angstrom resolution) and nuclear magnetic resonance, respectively, and there was excellent agreement with the design models. This method provides a new strategy to rapidly create large numbers of diverse and designable protein scaffolds.

[1]  Yong Zhou,et al.  Roll: a new algorithm for the detection of protein pockets and cavities with a rolling probe sphere , 2010, Bioinform..

[2]  N. Grishin Fold change in evolution of protein structures. , 2001, Journal of structural biology.

[3]  Liisa Holm,et al.  Dali server: conservation mapping in 3D , 2010, Nucleic Acids Res..

[4]  P. Emsley,et al.  Features and development of Coot , 2010, Acta crystallographica. Section D, Biological crystallography.

[5]  P. Zwart,et al.  Towards automated crystallographic structure refinement with phenix.refine , 2012, Acta crystallographica. Section D, Biological crystallography.

[6]  D. Torchia,et al.  Tautomeric states of the active‐site histidines of phosphorylated and unphosphorylated IIIGlc, a signal‐transducing protein from escherichia coli, using two‐dimensional heteronuclear NMR techniques , 1993, Protein science : a publication of the Protein Society.

[7]  David Baker,et al.  Rational design of alpha-helical tandem repeat proteins with closed architectures , 2015, Nature.

[8]  Gabrielle A. Reeves,et al.  Structural diversity of domain superfamilies in the CATH database. , 2006, Journal of molecular biology.

[9]  Jens Meiler,et al.  ROSETTA3: an object-oriented software suite for the simulation and design of macromolecules. , 2011, Methods in enzymology.

[10]  Kurt Wüthrich,et al.  Processing of multi-dimensional NMR data with the new software PROSA , 1992 .

[11]  K. Wüthrich,et al.  Stereospecific nuclear magnetic resonance assignments of the methyl groups of valine and leucine in the DNA-binding domain of the 434 repressor by biosynthetically directed fractional 13C labeling. , 1989, Biochemistry.

[12]  M. Nilges,et al.  Refinement of protein structures in explicit solvent , 2003, Proteins.

[13]  Torsten Herrmann,et al.  Protein NMR structure determination with automated NOE assignment using the new software CANDID and the torsion angle dynamics algorithm DYANA. , 2002, Journal of molecular biology.

[14]  H. Wolfson,et al.  Efficient detection of three-dimensional structural motifs in biological macromolecules by computer vision techniques. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[15]  K. Wüthrich,et al.  Torsion angle dynamics for NMR structure calculation with the new program DYANA. , 1997, Journal of molecular biology.

[16]  D. Baker,et al.  High thermodynamic stability of parametrically designed helical bundles , 2014, Science.

[17]  T. Szyperski,et al.  GFT NMR, a new approach to rapidly obtain precise high-dimensional NMR spectral information. , 2003, Journal of the American Chemical Society.

[18]  H N Moseley,et al.  Automatic determination of protein backbone resonance assignments from triple resonance nuclear magnetic resonance data. , 2001, Methods in enzymology.

[19]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[20]  Jens Meiler,et al.  Potential of fragment recombination for rational design of proteins. , 2012, Journal of the American Chemical Society.

[21]  W. Hendrickson,et al.  Quantification of tertiary structural conservation despite primary sequence drift in the globin fold , 1994, Protein science : a publication of the Protein Society.

[22]  A. Hughes,et al.  Gene duplication and the origin of novel proteins. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[23]  C. Blake,et al.  Do genes-in-pieces imply proteins-in-pieces? , 1978, Nature.

[24]  András Fiser,et al.  Structural Characteristics of Novel Protein Folds , 2010, PLoS Comput. Biol..

[25]  Tanmay A M Bharat,et al.  A βα-barrel built by the combination of fragments from different folds , 2008, Proceedings of the National Academy of Sciences.

[26]  S. Koide Generation of new protein functions by nonhomologous combinations and rearrangements of domains and modules. , 2009, Current opinion in biotechnology.

[27]  L. Spyracopoulos,et al.  Context-dependent remodeling of structure in two large protein fragments. , 2010, Journal of molecular biology.

[28]  D. Baker,et al.  Design of a Novel Globular Protein Fold with Atomic-Level Accuracy , 2003, Science.

[29]  B. Kuhlman,et al.  Global analysis of the thermal and chemical denaturation of the N‐terminal domain of the ribosomal protein L9 in H2O and D2O. Determination of the thermodynamic parameters, ΔH°, ΔS°, and ΔC°p, and evaluation of solvent isotope effects , 1998 .

[30]  C. Chothia,et al.  Structure, function and evolution of multidomain proteins. , 2004, Current opinion in structural biology.

[31]  Lutz Riechmann,et al.  A segment of cold shock protein directs the folding of a combinatorial protein. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[32]  Daniel W. Kulp,et al.  Generalized Fragment Picking in Rosetta: Design, Protocols and Applications , 2011, PloS one.

[33]  Jens Meiler,et al.  RosettaScripts: A Scripting Language Interface to the Rosetta Macromolecular Modeling Suite , 2011, PloS one.

[34]  Gevorg Grigoryan,et al.  De novo design of a transmembrane Zn2+-transporting four-helix bundle , 2014, Science.

[35]  P Prabakaran,et al.  Thermodynamic databases for proteins and protein-nucleic acid interactions. , 2001, Biopolymers.

[36]  David C Richardson,et al.  Studying and polishing the PDB's macromolecules. , 2013, Biopolymers.

[37]  Thomas Szyperski,et al.  G-matrix Fourier transform NMR spectroscopy for complete protein resonance assignment. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[38]  P. Güntert Automated NMR structure calculation with CYANA. , 2004, Methods in molecular biology.

[39]  D. Baker,et al.  RosettaHoles2: A volumetric packing measure for protein structure refinement and validation , 2010, Protein science : a publication of the Protein Society.

[40]  David Baker,et al.  Efficient sampling of protein conformational space using fast loop building and batch minimization on highly parallel computers , 2012, J. Comput. Chem..

[41]  J. Söding,et al.  More than the sum of their parts: On the evolution of proteins from peptides , 2003, BioEssays : news and reviews in molecular, cellular and developmental biology.

[42]  A. Bax,et al.  Protein backbone and sidechain torsion angles predicted from NMR chemical shifts using artificial neural networks , 2013, Journal of Biomolecular NMR.

[43]  Randy J. Read,et al.  Phaser crystallographic software , 2007, Journal of applied crystallography.

[44]  T. Szyperski,et al.  NMR solution structure of Thermotoga maritima protein TM1509 reveals a Zn-metalloprotease-like tertiary structure , 2005, Journal of Structural and Functional Genomics.

[45]  Thomas Szyperski,et al.  G-matrix Fourier transform NOESY-based protocol for high-quality protein structure determination. , 2005, Journal of the American Chemical Society.

[46]  D. Baker,et al.  Principles for designing ideal protein structures , 2012, Nature.

[47]  C. Chothia,et al.  The generation of new protein functions by the combination of domains. , 2007, Structure.

[48]  R J Read,et al.  Crystallography & NMR system: A new software suite for macromolecular structure determination. , 1998, Acta crystallographica. Section D, Biological crystallography.

[49]  Gaetano T Montelione,et al.  Evaluating protein structures determined by structural genomics consortia , 2006, Proteins.

[50]  G. Montelione,et al.  Automated analysis of protein NMR assignments using methods from artificial intelligence. , 1997, Journal of molecular biology.

[51]  Guoli Wang,et al.  PISCES: a protein sequence culling server , 2003, Bioinform..

[52]  David R. Westhead,et al.  Calculation of Helix Packing Angles in Protein Structures , 2003, Bioinform..

[53]  David Baker,et al.  Exploring the repeat protein universe through computational protein design , 2015, Nature.