A Novel Minimized Dead-End Elimination Criterion and Its Application to Protein Redesign in a Hybrid Scoring and Search Algorithm for Computing Partition Functions over Molecular Ensembles

Novel molecular function can be achieved by redesigning an enzyme's active site so that it will perform its chemical reaction on a novel substrate. One of the main challenges for protein redesign is the efficient evaluation of a combinatorial number of candidate structures. The modeling of protein flexibility, typically by using a rotamer library of commonly-observed low-energy side-chain conformations, further increases the complexity of the redesign problem. A dominant algorithm for protein redesign is Dead-End Elimination (DEE), which prunes the majority of candidate conformations by eliminating rigid rotamers that provably are not part of the Global Minimum Energy Conformation (GMEC). The identified GMEC consists of rigid rotamers that have not been energy-minimized and is referred to as the rigid-GMEC. As a post-processing step, the conformations that survive DEE may be energy-minimized. When energy minimization is performed after pruning with DEE, the combined protein design process becomes heuristic, and is no longer provably accurate: That is, the rigid-GMEC and the conformation with the lowest energy among all energy-minimized conformations (the minimized-GMEC, or minGMEC) are likely to be different. While the traditional DEE algorithm succeeds in not pruning rotamers that are part of the rigid-GMEC, it makes no guarantees regarding the identification of the minGMEC. In this paper we derive a novel, provable, and efficient DEE-like algorithm, called minimized-DEE (MinDEE), that guarantees that rotamers belonging to the minGMEC will not be pruned, while still pruning a combinatorial number of conformations. We show that MinDEE is useful not only in identifying the minGMEC, but also as a filter in an ensemble-based scoring and search algorithm for protein redesign that exploits energy-minimized conformations. We compare our results both to our previous computational predictions of protein designs and to biological activity assays of predicted protein mutants. Our provable and efficient minimized-DEE algorithm is applicable in protein redesign, protein-ligand binding prediction, and computer-aided drug design.

[1]  R. Goldstein Efficient rotamer elimination applied to protein side-chains and related spin glasses. , 1994, Biophysical journal.

[2]  J. Richardson,et al.  The penultimate rotamer library , 2000, Proteins.

[3]  A R Leach,et al.  Exploring the conformational space of protein side chains using dead‐end elimination and the A* algorithm , 1998, Proteins.

[4]  Stephen L. Mayo,et al.  Conformational splitting: A more powerful criterion for dead-end elimination , 2000, J. Comput. Chem..

[5]  Lisa Tucker-Kellogg,et al.  Systematic conformational search with constraint satisfaction , 2002 .

[6]  M. Marahiel,et al.  Dipeptide formation on engineered hybrid peptide synthetases. , 2000, Chemistry & biology.

[7]  P. S. Kim,et al.  Side-chain repacking calculations for predicting structures and stabilities of heterodimeric coiled coils , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[8]  U. Singh,et al.  A NEW FORCE FIELD FOR MOLECULAR MECHANICAL SIMULATION OF NUCLEIC ACIDS AND PROTEINS , 1984 .

[9]  T. Stachelhaus,et al.  Exploitation of the selectivity-conferring code of nonribosomal peptide synthetases for the rational design of novel peptide antibiotics. , 2002, Biochemistry.

[10]  J. Marvin,et al.  Conversion of a maltose receptor into a zinc biosensor by computational design , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Julia M. Shifman,et al.  Modulating calmodulin binding specificity through computational protein design. , 2002, Journal of molecular biology.

[12]  I. Lasters,et al.  The fuzzy-end elimination theorem: correctly implementing the side chain placement algorithm based on the dead-end elimination theorem. , 1993, Protein engineering.

[13]  Bruce Randall Donald,et al.  Improved Pruning algorithms and Divide-and-Conquer strategies for Dead-End Elimination, with application to protein design , 2006, ISMB.

[14]  S. L. Mayo,et al.  Enzyme-like proteins by computational design , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[15]  M. Marahiel,et al.  Portability of epimerization domain and role of peptidyl carrier protein on epimerization activity in nonribosomal peptide synthetases. , 2001, Biochemistry.

[16]  D. Benjamin Gordon,et al.  Radical performance enhancements for combinatorial optimization algorithms based on the dead-end elimination theorem , 1998, Journal of Computational Chemistry.

[17]  P. Kollman,et al.  A Second Generation Force Field for the Simulation of Proteins, Nucleic Acids, and Organic Molecules , 1995 .

[18]  P. Brick,et al.  Structural basis for the activation of phenylalanine in the non‐ribosomal biosynthesis of gramicidin S , 1997, The EMBO journal.

[19]  C. Walsh,et al.  Harnessing the biosynthetic code: combinations, permutations, and mutations. , 1998, Science.

[20]  W. Jin,et al.  De novo design of foldable proteins with smooth folding funnel: automated negative design and experimental verification. , 2003, Structure.

[21]  L. Looger,et al.  Computational design of receptor and sensor proteins with novel functions , 2003, Nature.

[22]  T. Stachelhaus,et al.  The specificity-conferring code of adenylation domains in nonribosomal peptide synthetases. , 1999, Chemistry & biology.

[23]  S. L. Mayo,et al.  De novo backbone and sequence design of an idealized α/β-barrel protein: evidence of stable tertiary structure , 2003 .

[24]  Niles A Pierce,et al.  Protein design is NP-hard. , 2002, Protein engineering.

[25]  J. Ponder,et al.  Tertiary templates for proteins. Use of packing criteria in the enumeration of allowed sequences for different structural classes. , 1987, Journal of molecular biology.

[26]  M. Marahiel,et al.  Nonribosomal peptides: from genes to products. , 2003, Natural product reports.

[27]  T. Stachelhaus,et al.  Rational design of peptide antibiotics by targeted replacement of bacterial and fungal domains. , 1995, Science.

[28]  S. Wodak,et al.  Automatic procedures for protein design. , 2001, Combinatorial chemistry & high throughput screening.

[29]  Bruce Randall Donald,et al.  A novel ensemble-based scoring and search algorithm for protein redesign, and its application to modify the substrate specificity of the gramicidin synthetase A phenylalanine adenylation enzyme , 2004, RECOMB.

[30]  T. Stachelhaus,et al.  Targeted alteration of the substrate specificity of peptide synthetases by rational module swapping , 1998, Molecular and General Genetics MGG.

[31]  Rafael Najmanovich,et al.  Side‐chain flexibility in proteins upon ligand binding , 2000, Proteins.

[32]  F. Richards,et al.  Construction of new ligand binding sites in proteins of known structure. I. Computer-aided modeling of sites with pre-defined geometry. , 1991, Journal of molecular biology.

[33]  Tomas Lozano-Perez,et al.  De novo determination of peptide structure with solid-state magic-angle spinning NMR spectroscopy , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[34]  M. Marahiel,et al.  Construction of hybrid peptide synthetases by module and domain fusions. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[35]  G. Challis,et al.  Predictive, structure-based model of amino acid recognition by nonribosomal peptide synthetase adenylation domains. , 2000, Chemistry & biology.

[36]  Johan Desmet,et al.  The dead-end elimination theorem and its use in protein side-chain positioning , 1992, Nature.