Design of novel protein building modules and modular architectures.

Nature uses only a limited number of protein topologies and while several folds have evolved independently over time, there are clearly many possible topologies that have not been explored by evolution. With recent advances of protein design concepts, computational modeling tools, high resolution and high-throughput experimental methods it is now possible to design new protein architectures. The collection of building blocks and design principles widened both in size and complexity, offering an expanded toolset for building new modular folds and functional protein structures. Here we review and discuss recent achievements of protein design, focusing in particular on the use and prospects of modular approaches for assembling new protein folds.

[1]  David Baker,et al.  Exploring the repeat protein universe through computational protein design , 2015, Nature.

[2]  R. Kolodny,et al.  Complex evolutionary footprints revealed in an analysis of reused protein segments of diverse lengths , 2017, Proceedings of the National Academy of Sciences.

[3]  D. Baker,et al.  Global analysis of protein folding using massively parallel design, synthesis, and testing , 2017, Science.

[4]  Andrew C Doxey,et al.  Modular evolution and the origins of symmetry: reconstruction of a three-fold symmetric globular protein. , 2012, Structure.

[5]  Z. X. Wang,et al.  A re-estimation for the total numbers of protein folds and superfamilies. , 1998, Protein engineering.

[6]  Robert A. Langan,et al.  De novo design of protein homo-oligomers with modular hydrogen-bond network–mediated specificity , 2016, Science.

[7]  W. Gilbert,et al.  The exon theory of genes. , 1987, Cold Spring Harbor symposia on quantitative biology.

[8]  Timothy W. Craven,et al.  A Miniature Protein Stabilized by a Cation-π Interaction Network. , 2016, Journal of the American Chemical Society.

[9]  D. Baker,et al.  Design of ordered two-dimensional arrays mediated by noncovalent protein-protein interfaces , 2015, Science.

[10]  B. Höcker,et al.  Mimicking enzyme evolution by generating new (betaalpha)8-barrels from (betaalpha)4-half-barrels. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Kevin W Plaxco,et al.  Contact order revisited: Influence of protein size on the folding rate , 2003, Protein science : a publication of the Protein Society.

[12]  D. Baker,et al.  The coming of age of de novo protein design , 2016, Nature.

[13]  Ben M. Webb,et al.  Design of a single-chain polypeptide tetrahedron assembled from coiled-coil segments , 2013, Nature chemical biology.

[14]  Richard B. Sessions,et al.  Computational design of water-soluble α-helical barrels , 2014, Science.

[15]  David A. Lee,et al.  CATH: an expanded resource to predict protein function through structure and sequence , 2016, Nucleic Acids Res..

[16]  M. Go Correlation of DNA exonic regions with protein structural units in haemoglobin , 1981, Nature.

[17]  D. Baker,et al.  Accurate computational design of multipass transmembrane proteins , 2018, Science.

[18]  Robert A. Langan,et al.  Programmable design of orthogonal protein heterodimers , 2019 .

[19]  D. Baker,et al.  Contact order, transition state placement and the refolding rates of single domain proteins. , 1998, Journal of molecular biology.

[20]  F. Arnold,et al.  Innovation by Evolution: Bringing New Chemistry to Life (Nobel Lecture). , 2019, Angewandte Chemie.

[21]  David Baker,et al.  What has de novo protein design taught us about protein folding and biophysics? , 2019, Protein science : a publication of the Protein Society.

[22]  Kenneth Evan Thompson,et al.  SYNZIP Protein Interaction Toolbox: in Vitro and in Vivo Specifications of Heterospecific Coiled-Coil Interaction Domains , 2012, ACS synthetic biology.

[23]  S J de Souza,et al.  Origin of genes. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Noah Linden,et al.  A de novo peptide hexamer with a mutable channel , 2011, Nature chemical biology.

[25]  W. Chiu,et al.  Designer nanoscale DNA assemblies programmed from the top down , 2016, Science.

[26]  D. Baker,et al.  De novo design of a four-fold symmetric TIM-barrel protein with atomic-level accuracy , 2015, Nature chemical biology.

[27]  David Baker,et al.  Accurate de novo design of hyperstable constrained peptides , 2016, Nature.

[28]  Hiroki Noguchi,et al.  Computational design of a self-assembling symmetrical β-propeller protein , 2014, Proceedings of the National Academy of Sciences.

[29]  Fabio Lapenta,et al.  Coiled coil protein origami: from modular design principles towards biotechnological applications. , 2018, Chemical Society reviews.

[30]  R. Pappu,et al.  Covalently-assembled single-chain protein nanostructures with ultra-high stability , 2019, Nature Communications.

[31]  D. Baker,et al.  Principles for designing ideal protein structures , 2012, Nature.

[32]  Derek N. Woolfson,et al.  Installing hydrolytic activity into a completely de novo protein framework. , 2016, Nature chemistry.

[33]  J. Richardson,et al.  De novo design, expression, and characterization of Felix: a four-helix bundle protein of native-like sequence. , 1990, Science.

[34]  Derek N Woolfson,et al.  Engineering protein stability with atomic precision in a monomeric miniprotein. , 2017, Nature chemical biology.

[35]  C. Ponting,et al.  On the evolution of protein folds: are similar motifs in different protein folds the result of convergence, insertion, or relics of an ancient peptide world? , 2001, Journal of structural biology.

[36]  D. Baker,et al.  Prediction of local structure in proteins using a library of sequence-structure motifs. , 1998, Journal of molecular biology.

[37]  A. Urvoas,et al.  Alpha repeat proteins (αRep) as expression and crystallization helpers. , 2017, Journal of structural biology.

[38]  B. Kuhlman,et al.  Design of structurally distinct proteins using strategies inspired by evolution , 2016, Science.

[39]  David Baker,et al.  Accurate design of co-assembling multi-component protein nanomaterials , 2014, Nature.

[40]  Helena Gradišar,et al.  Design principles for rapid folding of knotted DNA nanostructures , 2016, Nature Communications.

[41]  Vikram Khipple Mulligan,et al.  De Novo Design of Bioactive Protein Switches , 2019, Nature.

[42]  D. Woolfson The design of coiled-coil structures and assemblies. , 2005, Advances in protein chemistry.

[43]  Alessandro Laio,et al.  Exploring the Universe of Protein Structures beyond the Protein Data Bank , 2010, PLoS Comput. Biol..

[44]  Gevorg Grigoryan,et al.  Tertiary alphabet for the observable protein structural universe , 2016, Proceedings of the National Academy of Sciences.

[45]  D. Baker,et al.  Control over overall shape and size in de novo designed proteins , 2015, Proceedings of the National Academy of Sciences.

[46]  C. Chothia One thousand families for the molecular biologist , 1992, Nature.

[47]  Tanmay A M Bharat,et al.  A βα-barrel built by the combination of fragments from different folds , 2008, Proceedings of the National Academy of Sciences.

[48]  Ajasja Ljubetič,et al.  Design of coiled-coil protein-origami cages that self-assemble in vitro and in vivo , 2017, Nature Biotechnology.

[49]  D. Baker,et al.  Computational Design of Self-Assembling Protein Nanomaterials with Atomic Level Accuracy , 2012, Science.

[50]  S. Balaji,et al.  Internal symmetry in protein structures: prevalence, functional relevance and evolution. , 2015, Current opinion in structural biology.

[51]  A. Keating,et al.  Modular assembly of a protein nanotriangle using orthogonally interacting coiled coils , 2017, Scientific Reports.

[52]  Aimee L Boyle,et al.  Construction of a Chassis for a Tripartite Protein-Based Molecular Motor , 2017, ACS synthetic biology.

[53]  Gaohua Liu,et al.  Principles for designing proteins with cavities formed by curved β sheets , 2017, Science.

[54]  William R Taylor,et al.  Probing the "dark matter" of protein fold space. , 2009, Structure.

[55]  Steven E. Brenner,et al.  SCOPe: Structural Classification of Proteins—extended, integrating SCOP and ASTRAL data and classification of new structures , 2013, Nucleic Acids Res..

[56]  D. Baker,et al.  De novo design of a non-local beta-sheet protein with high stability and accuracy , 2018 .

[57]  Sarah A. Teichmann,et al.  An insight into domain combinations , 2001, ISMB.

[58]  Baldomero Oliva,et al.  A supersecondary structure library and search algorithm for modeling loops in protein structures , 2006, Nucleic acids research.

[59]  William Sheffler,et al.  De novo design of a fluorescence-activating β-barrel , 2018, Nature.

[60]  L. Regan,et al.  Characterization of a helical protein designed from first principles. , 1988, Science.

[61]  A. Urvoas,et al.  Alpha-helicoidal HEAT-like Repeat Proteins (αRep) Selected as Interactors of HIV-1 Nucleocapsid Negatively Interfere with Viral Genome Packaging and Virus Maturation , 2017, Scientific Reports.

[62]  J. Söding,et al.  A vocabulary of ancient peptides at the origin of folded proteins , 2015, eLife.

[63]  M. A. García,et al.  Assembly of designed protein scaffolds into monolayers for nanoparticle patterning. , 2016, Colloids and surfaces. B, Biointerfaces.

[64]  David Baker,et al.  Accurate design of megadalton-scale two-component icosahedral protein complexes , 2016, Science.

[65]  C DeLisi,et al.  Estimating the number of protein folds. , 1998, Journal of molecular biology.

[66]  T. Yeates,et al.  Principles for designing ordered protein assemblies. , 2012, Trends in cell biology.

[67]  David Baker,et al.  Design of a hyperstable 60-subunit protein icosahedron , 2016, Nature.