Computational de novo design of a four‐helix bundle protein—DND_4HB

The de novo design of proteins is a rigorous test of our understanding of the key determinants of protein structure. The helix bundle is an interesting de novo design model system due to the diverse topologies that can be generated from a few simple α‐helices. Previously, noncomputational studies demonstrated that connecting amphipathic helices together with short loops can sometimes generate helix bundle proteins, regardless of the bundle's exact sequence. However, using such methods, the precise positions of helices and side chains cannot be predetermined. Since protein function depends on exact positioning of residues, we examined if sequence design tools in the program Rosetta could be used to design a four‐helix bundle with a predetermined structure. Helix position was specified using a folding procedure that constrained the design model to a defined topology, and iterative rounds of rotamer‐based sequence design and backbone refinement were used to identify a low energy sequence for characterization. The designed protein, DND_4HB, unfolds cooperatively (Tm >90°C) and a NMR solution structure shows that it adopts the target helical bundle topology. Helices 2, 3, and 4 agree very closely with the design model (backbone RMSD = 1.11 Å) and >90% of the core side chain χ1 and χ2 angles are correctly predicted. Helix 1 lies in the target groove against the other helices, but is displaced 3 Å along the bundle axis. This result highlights the potential of computational design to create bundles with atomic‐level precision, but also points at remaining challenges for achieving specific positioning between amphipathic helices.

[1]  H. Sugeta,et al.  General method for calculating helical parameters of polymer chains from bond lengths, bond angles, and internal‐rotation angles , 1967 .

[2]  D Eisenberg,et al.  The design, synthesis, and crystallization of an alpha‐helical peptide , 1986, Proteins.

[3]  L. Regan,et al.  Characterization of a helical protein designed from first principles. , 1988, Science.

[4]  Stereospecific nuclear magnetic resonance assignments of the methyl groups of valine and leucine in the DNA-binding domain of the 434 repressor by biosynthetically directed fractional 13C labeling. , 1995, Biochemistry.

[5]  J. Richardson,et al.  De novo design, expression, and characterization of Felix: a four-helix bundle protein of native-like sequence. , 1990, Science.

[6]  Kurt Wüthrich,et al.  Processing of multi-dimensional NMR data with the new software PROSA , 1992 .

[7]  Raleigh,et al.  Crystallization of a designed peptide from a molten globule ensemble. , 1995, Folding & design.

[8]  H. Kalbitzer,et al.  Protein NMR Spectroscopy. Principles and Practice , 1997 .

[9]  G. Montelione,et al.  Automated analysis of protein NMR assignments using methods from artificial intelligence. , 1997, Journal of molecular biology.

[10]  K. Wüthrich,et al.  Torsion angle dynamics for NMR structure calculation with the new program DYANA. , 1997, Journal of molecular biology.

[11]  P. S. Kim,et al.  High-resolution protein design with backbone freedom. , 1998, Science.

[12]  R J Read,et al.  Crystallography & NMR system: A new software suite for macromolecular structure determination. , 1998, Acta crystallographica. Section D, Biological crystallography.

[13]  A. Bax,et al.  Protein backbone angle restraints from searching a database for chemical shift and sequence homology , 1999, Journal of biomolecular NMR.

[14]  J. García de la Torre,et al.  HYDRONMR: prediction of NMR relaxation of globular proteins from atomic-level structures and hydrodynamic calculations. , 2000, Journal of magnetic resonance.

[15]  D. Baker,et al.  Native protein sequences are close to optimal for their structures. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[16]  H N Moseley,et al.  Automatic determination of protein backbone resonance assignments from triple resonance nuclear magnetic resonance data. , 2001, Methods in enzymology.

[17]  Torsten Herrmann,et al.  Protein NMR structure determination with automated NOE assignment using the new software CANDID and the torsion angle dynamics algorithm DYANA. , 2002, Journal of molecular biology.

[18]  T. Szyperski,et al.  GFT NMR, a new approach to rapidly obtain precise high-dimensional NMR spectral information. , 2003, Journal of the American Chemical Society.

[19]  Yinan Wei,et al.  Solution structure of a de novo protein from a designed combinatorial library , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[20]  D. Baker,et al.  Design of a Novel Globular Protein Fold with Atomic-Level Accuracy , 2003, Science.

[21]  M. Nilges,et al.  Refinement of protein structures in explicit solvent , 2003, Proteins.

[22]  Thomas Szyperski,et al.  G-matrix Fourier transform NMR spectroscopy for complete protein resonance assignment. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[23]  R. Keller,et al.  The Computer Aided Resonance Assignment Tutorial , 2004 .

[24]  P. Güntert Automated NMR structure calculation with CYANA. , 2004, Methods in molecular biology.

[25]  P. Bradley,et al.  Toward High-Resolution de Novo Structure Prediction for Small Proteins , 2005, Science.

[26]  Thomas Szyperski,et al.  G-matrix Fourier transform NOESY-based protocol for high-quality protein structure determination. , 2005, Journal of the American Chemical Society.

[27]  W. DeGrado,et al.  Computational de novo design and characterization of a four-helix bundle protein that selectively binds a nonbiological cofactor. , 2005, Journal of the American Chemical Society.

[28]  T. Szyperski,et al.  NMR solution structure of Thermotoga maritima protein TM1509 reveals a Zn-metalloprotease-like tertiary structure , 2005, Journal of Structural and Functional Genomics.

[29]  Ian W. Davis,et al.  The backrub motion: how protein backbone shrugs when a sidechain dances. , 2006, Structure.

[30]  B. Kuhlman,et al.  Computational design of a single amino acid sequence that can switch between two distinct protein folds. , 2006, Journal of the American Chemical Society.

[31]  L. H. Bradley,et al.  Protein design by binary patterning of polar and nonpolar amino acids. , 1993, Methods in molecular biology.

[32]  Hong Cheng,et al.  De novo design of a single-chain diphenylporphyrin metalloprotein. , 2007, Journal of the American Chemical Society.

[33]  Gaetano T Montelione,et al.  Evaluating protein structures determined by structural genomics consortia , 2006, Proteins.

[34]  Shohei Koide,et al.  A Dominant Conformational Role for Amino Acid Diversity in Minimalist Protein-protein Interfaces Nih Public Access Introduction , 2022 .

[35]  M. Hecht,et al.  Structure and dynamics of de novo proteins from a designed superfamily of 4‐helix bundles , 2008, Protein science : a publication of the Protein Society.

[36]  Christian Cole,et al.  The Jpred 3 secondary structure prediction server , 2008, Nucleic Acids Res..

[37]  D. Baker,et al.  RosettaHoles: Rapid assessment of protein core packing for structure prediction, refinement, design, and validation , 2008, Protein science : a publication of the Protein Society.

[38]  A. Bax,et al.  TALOS+: a hybrid method for predicting protein backbone torsion angles from NMR chemical shifts , 2009, Journal of biomolecular NMR.

[39]  D. Baker,et al.  Alternate states of proteins revealed by detailed energy landscape mapping. , 2011, Journal of molecular biology.

[40]  Timothy A. Whitehead,et al.  Computational Design of Proteins Targeting the Conserved Stem Region of Influenza Hemagglutinin , 2011, Science.

[41]  Brian Kuhlman,et al.  Computational design of a symmetric homodimer using β-strand assembly , 2011, Proceedings of the National Academy of Sciences.

[42]  Bryan S. Der,et al.  Metal-mediated affinity and orientation specificity in a computationally designed protein homodimer. , 2012, Journal of the American Chemical Society.

[43]  D. Baker,et al.  Principles for designing ideal protein structures , 2012, Nature.

[44]  B. Kuhlman,et al.  Increasing sequence diversity with flexible backbone protein design: the complete redesign of a protein hydrophobic core. , 2012, Structure.

[45]  D. Baker,et al.  Computational Design of Self-Assembling Protein Nanomaterials with Atomic Level Accuracy , 2012, Science.

[46]  Ryoichi Arai,et al.  Domain-swapped dimeric structure of a stable and functional de novo four-helix bundle protein, WA20. , 2012, The journal of physical chemistry. B.

[47]  Jack Snoeyink,et al.  Scientific benchmarks for guiding macromolecular energy function improvement. , 2013, Methods in enzymology.