Motif‐directed flexible backbone design of functional interactions

Computational protein design relies on a number of approximations to efficiently search the huge sequence space available to proteins. The fixed backbone and rotamer approximations in particular are important for formulating protein design as a discrete combinatorial optimization problem. However, the resulting coarse‐grained sampling of possible side‐chain terminal positions is problematic for the design of protein function, which depends on precise positioning of side‐chain atoms. Although backbone flexibility can greatly increase the conformation freedom of side‐chain functional groups, it is not obvious which backbone movements will generate the critical constellation of atoms responsible for protein function. Here, we report an automated method for identifying protein backbone movements that can give rise to any specified set of desired side‐chain atomic placements and interactions, using protein–DNA interfaces as a model system. We use a library of previously observed protein–DNA interactions (motifs) and a rotamer‐based description of side‐chain conformation freedom to identify placements for the protein backbone that can give rise to a favorable side‐chain interaction with DNA. We describe a tree‐search algorithm for identifying those combinations of interactions from the library that can be realized with minimal perturbation of the protein backbone. We compare the efficiency of this method with the alternative approach of building and screening alternate backbone conformations.

[1]  C. Pabo,et al.  Geometric analysis and comparison of protein-DNA interfaces: why is there no simple code for recognition? , 2000, Journal of molecular biology.

[2]  Adrian A Canutescu,et al.  Cyclic coordinate descent: A robotics algorithm for protein loop closure , 2003, Protein science : a publication of the Protein Society.

[3]  Ian W. Davis,et al.  The backrub motion: how protein backbone shrugs when a sidechain dances. , 2006, Structure.

[4]  De Yonker,et al.  A New Approach to Protein Design: Grafting of a Buried Transition Metal Binding Site into Escherichia coli Thioredoxin , 1992 .

[5]  P. Bradley,et al.  High-resolution structure prediction and the crystallographic phase problem , 2007, Nature.

[6]  B. Stoddard,et al.  Coevolution of a homing endonuclease and its host target sequence. , 2007, Journal of molecular biology.

[7]  Eric A. Althoff,et al.  De Novo Computational Design of Retro-Aldol Enzymes , 2008, Science.

[8]  A. Fischer Severe combined immunodeficiencies (SCID) , 2000, Clinical and experimental immunology.

[9]  Lars Malmström,et al.  Structure prediction for CASP7 targets using extensive all‐atom refinement with Rosetta@home , 2007, Proteins.

[10]  Colin A. Smith,et al.  Backrub-like backbone simulation recapitulates natural protein conformational variability and improves mutant side-chain prediction. , 2008, Journal of molecular biology.

[11]  P. Duchateau,et al.  Crystal structure of I-DmoI in complex with its target DNA provides new insights into meganuclease engineering , 2008, Proceedings of the National Academy of Sciences.

[12]  D. Baker,et al.  Design of a Novel Globular Protein Fold with Atomic-Level Accuracy , 2003, Science.

[13]  Barry L. Stoddard,et al.  The homing endonuclease I-CreI uses three metals, one of which is shared between the two active sites , 2001, Nature Structural Biology.

[14]  Geoffrey K. Hom,et al.  Full-sequence computational design and solution structure of a thermostable protein variant. , 2007, Journal of molecular biology.

[15]  B. Stoddard,et al.  The structure of I-CeuI homing endonuclease: Evolving asymmetric DNA recognition from a symmetric protein scaffold. , 2006, Structure.

[16]  D. E. Benson,et al.  Converting a maltose receptor into a nascent binuclear copper oxygenase by computational design. , 2002, Biochemistry.

[17]  David Baker,et al.  PROTEINS: Structure, Function, and Bioinformatics 58:893–904 (2005) A “Solvated Rotamer ” Approach to Modeling Water- Mediated Hydrogen Bonds at Protein–Protein Interfaces , 2022 .

[18]  Brian W. Matthews,et al.  No code for recognition , 1988, Nature.

[19]  B. Stoddard,et al.  Homing endonucleases: structural and functional insight into the catalysts of intron/intein mobility. , 2001, Nucleic acids research.

[20]  C. Pabo Molecular technology: Designing proteins and peptides , 1983, Nature.

[21]  Nicholas M. Luscombe,et al.  Amino acid?base interactions: a three-dimensional analysis of protein?DNA interactions at an atomic level , 2001, Nucleic Acids Res..

[22]  Stephen L. Mayo,et al.  Design, structure and stability of a hyperthermophilic protein variant , 1998, Nature Structural Biology.

[23]  R. Sauer,et al.  An engineered intersubunit disulfide enhances the stability and DNA binding of the N-terminal domain of lambda repressor. , 1986, Biochemistry.

[24]  H W Hellinga,et al.  Construction of a family of Cys2His2 zinc binding sites in the hydrophobic core of thioredoxin by structure-based design. , 1998, Biochemistry.

[25]  Jens Meiler,et al.  New algorithms and an in silico benchmark for computational enzyme design , 2006, Protein science : a publication of the Protein Society.

[26]  W. Delano The PyMOL Molecular Graphics System , 2002 .

[27]  Monique Turmel,et al.  Flexible DNA target site recognition by divergent homing endonuclease isoschizomers I-CreI and I-MsoI. , 2003, Journal of molecular biology.

[28]  S. L. Mayo,et al.  De novo protein design: fully automated sequence selection. , 1997, Science.

[29]  Eric A. Althoff,et al.  Kemp elimination catalysts by computational enzyme design , 2008, Nature.

[30]  J. Ponder,et al.  Tertiary templates for proteins. Use of packing criteria in the enumeration of allowed sequences for different structural classes. , 1987, Journal of molecular biology.

[31]  D. Baker,et al.  A simple physical model for the prediction and design of protein-DNA interactions. , 2004, Journal of molecular biology.

[32]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[33]  D. Baker,et al.  Protein–DNA binding specificity predictions with structural models , 2005, Nucleic acids research.

[34]  F M Richards,et al.  Construction of new ligand binding sites in proteins of known structure. II. Grafting of a buried transition metal binding site into Escherichia coli thioredoxin. , 1991, Journal of molecular biology.

[35]  H W Hellinga,et al.  The rational design and construction of a cuboidal iron-sulfur protein. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[36]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[37]  B W Matthews,et al.  Protein-DNA interaction. No code for recognition. , 1988, Nature.

[38]  H W Hellinga,et al.  Construction of a novel redox protein by rational design: conversion of a disulfide bridge into a mononuclear iron-sulfur center. , 1998, Biochemistry.

[39]  D. Baker,et al.  A large scale test of computational protein design: folding and stability of nine completely redesigned globular proteins. , 2003, Journal of molecular biology.

[40]  C. Pabo,et al.  Computer-aided model-building strategies for protein design. , 1986, Biochemistry.

[41]  C Kooperberg,et al.  Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. , 1997, Journal of molecular biology.

[42]  P. S. Kim,et al.  High-resolution protein design with backbone freedom. , 1998, Science.

[43]  M. Whitlow,et al.  Crystal structure of a protein‐toxin α1‐purothionin at 2.5Å and a comparison with predicted models , 1990, Proteins.

[44]  N. Seeman,et al.  Sequence-specific Recognition of Double Helical Nucleic Acids by Proteins (base Pairs/hydrogen Bonding/recognition Fidelity/ion Binding) , 2022 .

[45]  H W Hellinga,et al.  Construction of a catalytically active iron superoxide dismutase by rational protein design. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[46]  Michael M Hoffman,et al.  AANT: the Amino Acid-Nucleotide Interaction Database. , 2004, Nucleic acids research.