An adaptive geometric search algorithm for macromolecular scaffold selection

Abstract A wide variety of protein and peptidomimetic design tasks require matching functional 3D motifs to potential oligomeric scaffolds. For example, during enzyme design, one aims to graft active-site patterns—typically consisting of 3–15 residues—onto new protein surfaces. Identifying protein scaffolds suitable for such active-site engraftment requires costly searches for protein folds that provide the correct side chain positioning to host the desired active site. Other examples of biodesign tasks that require similar fast exact geometric searches of potential side chain positioning include mimicking binding hotspots, design of metal binding clusters and the design of modular hydrogen binding networks for specificity. In these applications, the speed and scaling of geometric searches limits the scope of downstream design to small patterns. Here, we present an adaptive algorithm capable of searching for side chain take-off angles, which is compatible with an arbitrarily specified functional pattern and which enjoys substantive performance improvements over previous methods. We demonstrate this method in both genetically encoded (protein) and synthetic (peptidomimetic) design scenarios. Examples of using this method with the Rosetta framework for protein design are provided. Our implementation is compatible with multiple protein design frameworks and is freely available as a set of python scripts (https://github.com/JiangTian/adaptive-geometric-search-for-protein-design).

[1]  Eric A. Althoff,et al.  Kemp elimination catalysts by computational enzyme design , 2008, Nature.

[2]  Andrew Watkins,et al.  Adding Diverse Noncanonical Backbones to Rosetta: Enabling Peptidomimetic Design , 2013, PloS one.

[3]  Kent Kirshenbaum,et al.  Cyclic peptoids. , 2007, Journal of the American Chemical Society.

[4]  D. Baker,et al.  A large scale test of computational protein design: folding and stability of nine completely redesigned globular proteins. , 2003, Journal of molecular biology.

[5]  Ruth Nussinov,et al.  3-D Substructure Matching in Protein Molecules , 1992, CPM.

[6]  Richard Bonneau,et al.  De novo prediction of three-dimensional structures for major protein families. , 2002, Journal of molecular biology.

[7]  Kun-Yi Hsin,et al.  MESPEUS: a database of the geometry of metal sites in proteins , 2008 .

[8]  Liisa Holm,et al.  Dali server update , 2016, Nucleic Acids Res..

[9]  Jonathan N. Jaworski,et al.  De novo structure prediction and experimental characterization of folded peptoid oligomers , 2012, Proceedings of the National Academy of Sciences.

[10]  D. Baker,et al.  Computational redesign of endonuclease DNA binding and cleavage specificity , 2006, Nature.

[11]  Richard Bonneau,et al.  Rational Design of Topographical Helix Mimics as Potent Inhibitors of Protein–Protein Interactions , 2014, Journal of the American Chemical Society.

[12]  David Baker,et al.  Remodeling a β-peptide bundle , 2013 .

[13]  D. Baker,et al.  Design of a Novel Globular Protein Fold with Atomic-Level Accuracy , 2003, Science.

[14]  E. Coutsias,et al.  Sub-angstrom accuracy in protein loop reconstruction by robotics-inspired conformational sampling , 2009, Nature Methods.

[15]  Kent Kirshenbaum,et al.  Peptoid macrocycles: making the rounds with peptidomimetic oligomers. , 2010, Chemistry.

[16]  Robert A. Langan,et al.  De novo design of protein homo-oligomers with modular hydrogen-bond network–mediated specificity , 2016, Science.

[17]  Amelie Stein,et al.  Improvements to Robotics-Inspired Conformational Sampling in Rosetta , 2013, PloS one.

[18]  David Baker,et al.  Accurate de novo design of hyperstable constrained peptides , 2016, Nature.

[19]  Timothy A. Whitehead,et al.  Computational Design of Proteins Targeting the Conserved Stem Region of Influenza Hemagglutinin , 2011, Science.

[20]  H. Wolfson,et al.  Efficient detection of three-dimensional structural motifs in biological macromolecules by computer vision techniques. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[21]  Stephen B. H. Kent,et al.  Efficient method for the preparation of peptoids [oligo(N-substituted glycines)] by submonomer solid-phase synthesis , 1992 .

[22]  Eric A. Althoff,et al.  De Novo Computational Design of Retro-Aldol Enzymes , 2008, Science.

[23]  Roland L. Dunbrack,et al.  A new clustering of antibody CDR loop conformations. , 2011, Journal of molecular biology.

[24]  D. Baker De novo Design of Protein Homo‐Oligomers with Modular Hydrogen‐Bond Network‐Mediated Specificity. , 2016 .

[25]  Da Chen Emily Koo,et al.  Using the RosettaSurface algorithm to predict protein structure at mineral surfaces. , 2013, Methods in enzymology.

[26]  J. Thornton,et al.  Tess: A geometric hashing algorithm for deriving 3D coordinate templates for searching structural databases. Application to enzyme active sites , 1997, Protein science : a publication of the Protein Society.

[27]  Jens Meiler,et al.  ROSETTA3: an object-oriented software suite for the simulation and design of macromolecules. , 2011, Methods in enzymology.

[28]  Paramjit S. Arora,et al.  A highly stable short α-helix constrained by a main-chain hydrogen-bond surrogate , 2004 .

[29]  Andrew M Watkins,et al.  Structure-based inhibition of protein-protein interactions. , 2015, European journal of medicinal chemistry.

[30]  Ivan Huc,et al.  Synthetic foldamers. , 2011, Chemical communications.

[31]  Paramjit S. Arora,et al.  Oligooxopiperazines as nonpeptidic alpha-helix mimetics. , 2010, Organic letters.