Expanding the space of protein geometries by computational design of de novo fold families

Naturally occurring proteins use a limited set of fold topologies, but vary the precise geometries of structural elements to create distinct shapes optimal for function. Here we present a computational design method termed LUCS that mimics nature’s ability to create families of proteins with the same overall fold but precisely tunable geometries. Through near-exhaustive sampling of loop-helix-loop elements, LUCS generates highly diverse geometries encompassing those found in nature but also surpassing known structure space. Biophysical characterization shows that 17 (38%) out of 45 tested LUCS designs were well folded, including 16 with designed non-native geometries. Four experimentally solved structures closely match the designs. LUCS greatly expands the designable structure space and provides a new paradigm for designing proteins with tunable geometries customizable for novel functions. One Sentence Summary A computational method to systematically sample loop-helix-loop geometries expands the structure space of designer proteins.

[1]  Adam M Damry,et al.  Rational design of proteins that exchange on functional timescales. , 2017, Nature chemical biology.

[2]  B. Kuhlman,et al.  Design of structurally distinct proteins using strategies inspired by evolution , 2016, Science.

[3]  Daniel W. Kulp,et al.  Generalized Fragment Picking in Rosetta: Design, Protocols and Applications , 2011, PloS one.

[4]  Tanja Kortemme,et al.  Computational design of structured loops for new protein functions , 2019, Biological chemistry.

[5]  Gevorg Grigoryan,et al.  Rapid search for tertiary fragments reveals protein sequence–structure relationships , 2015, Protein science : a publication of the Protein Society.

[6]  David Baker,et al.  An exciting but challenging road ahead for computational enzyme design , 2010, Protein science : a publication of the Protein Society.

[7]  Gaohua Liu,et al.  Principles for designing proteins with cavities formed by curved β sheets , 2017, Science.

[8]  Tanja Kortemme,et al.  Flexible backbone sampling methods to model and design protein alternative conformations. , 2013, Methods in enzymology.

[9]  David Baker,et al.  Exploring the repeat protein universe through computational protein design , 2015, Nature.

[10]  F. Crick,et al.  The packing of α‐helices: simple coiled‐coils , 1953 .

[11]  David Baker,et al.  What has de novo protein design taught us about protein folding and biophysics? , 2019, Protein science : a publication of the Protein Society.

[12]  Sung-Hou Kim,et al.  Global mapping of the protein structure space and application in structure-based inference of protein function. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[13]  D. Baker,et al.  Global analysis of protein folding using massively parallel design, synthesis, and testing , 2017, Science.

[14]  David E. Kim,et al.  Simultaneous Optimization of Biomolecular Energy Functions on Features from Small Molecules and Macromolecules. , 2016, Journal of chemical theory and computation.

[15]  D. Baker,et al.  De novo design of a four-fold symmetric TIM-barrel protein with atomic-level accuracy , 2015, Nature chemical biology.

[16]  D. Baker,et al.  Control over overall shape and size in de novo designed proteins , 2015, Proceedings of the National Academy of Sciences.

[17]  D. Raleigh,et al.  De novo design of helical bundles as models for understanding protein folding and function. , 2000, Accounts of chemical research.

[18]  David A. Lee,et al.  CATH: an expanded resource to predict protein function through structure and sequence , 2016, Nucleic Acids Res..

[19]  P. Bradley,et al.  Toward High-Resolution de Novo Structure Prediction for Small Proteins , 2005, Science.

[20]  Steven E. Brenner,et al.  SCOPe: Structural Classification of Proteins—extended, integrating SCOP and ASTRAL data and classification of new structures , 2013, Nucleic Acids Res..

[21]  D. Baker,et al.  De novo design of a non-local beta-sheet protein with high stability and accuracy , 2018 .

[22]  Richard B. Sessions,et al.  Computational design of water-soluble α-helical barrels , 2014, Science.

[23]  D. Baker,et al.  Principles for designing ideal protein structures , 2012, Nature.

[24]  P. S. Kim,et al.  High-resolution protein design with backbone freedom. , 1998, Science.

[25]  Archana G. Chavan,et al.  An amino acid packing code for α-helical structure and protein design. , 2012, Journal of molecular biology.

[26]  William Sheffler,et al.  De novo design of a fluorescence-activating β-barrel , 2018, Nature.

[27]  J. Skolnick,et al.  TM-align: a protein structure alignment algorithm based on the TM-score , 2005, Nucleic acids research.

[28]  Roberto A Chica,et al.  Multistate Computational Protein Design with Backbone Ensembles. , 2017, Methods in molecular biology.

[29]  D. Baker,et al.  Design of a Novel Globular Protein Fold with Atomic-Level Accuracy , 2003, Science.

[30]  D. Baker,et al.  High thermodynamic stability of parametrically designed helical bundles , 2014, Science.