Compact Representation of Continuous Energy Surfaces for More Efficient Protein Design.

In macromolecular design, conformational energies are sensitive to small changes in atom coordinates; thus, modeling the small, continuous motions of atoms around low-energy wells confers a substantial advantage in structural accuracy. However, modeling these motions comes at the cost of a very large number of energy function calls, which form the bottleneck in the design calculations. In this work, we remove this bottleneck by consolidating all conformational energy evaluations into the pre-computation of a local polynomial expansion of the energy about the "ideal" conformation for each low-energy, "rotameric" state of each residue pair. This expansion is called "energy as polynomials in internal coordinates" (EPIC), where the internal coordinates can be side-chain dihedrals, backrub angles, and/or any other continuous degrees of freedom of a macromolecule, and any energy function can be used without adding any asymptotic complexity to the design. We demonstrate that EPIC efficiently represents the energy surface for both molecular-mechanics and quantum-mechanical energy functions, and apply it specifically to protein design for modeling both side chain and backbone degrees of freedom.

[1]  Young Do Kwon,et al.  Enhanced Potency of a Broadly Neutralizing HIV-1 Antibody In Vitro Improves Protection against Lentiviral Infection In Vivo , 2014, Journal of Virology.

[2]  A. Edmundson,et al.  Treatment of osteoarthritis with aspartame , 1998, Clinical pharmacology and therapeutics.

[3]  宁北芳,et al.  疟原虫var基因转换速率变化导致抗原变异[英]/Paul H, Robert P, Christodoulou Z, et al//Proc Natl Acad Sci U S A , 2005 .

[4]  Mona Singh,et al.  A Semidefinite Programming Approach to Side Chain Positioning with New Rounding Strategies , 2004, INFORMS J. Comput..

[5]  Peter A. Kollman,et al.  AMBER: Assisted model building with energy refinement. A general program for modeling molecules and their interactions , 1981 .

[6]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[7]  Amy C. Anderson,et al.  Computational structure-based redesign of enzyme activity , 2009, Proceedings of the National Academy of Sciences.

[8]  J. Richardson,et al.  The penultimate rotamer library , 2000, Proteins.

[9]  Bruce Randall Donald,et al.  Protein Design Using Continuous Rotamers , 2012, PLoS Comput. Biol..

[10]  H. Scheraga,et al.  Monte Carlo-minimization approach to the multiple-minima problem in protein folding. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[11]  M.G.B. Drew,et al.  The art of molecular dynamics simulation , 1996 .

[12]  I. Lasters,et al.  Fast and accurate side‐chain topology and energy refinement (FASTER) as a new method for protein structure optimization , 2002, Proteins.

[13]  Bruce Randall Donald,et al.  A Novel Minimized Dead-End Elimination Criterion and Its Application to Protein Redesign in a Hybrid Scoring and Search Algorithm for Computing Partition Functions over Molecular Ensembles , 2006, RECOMB.

[14]  Bruce R Donald,et al.  Allosteric inhibition of the protein-protein interaction between the leukemia-associated proteins Runx1 and CBFbeta. , 2007, Chemistry & biology.

[15]  Bruce Randall Donald,et al.  A Novel Ensemble-Based Scoring and Search Algorithm for Protein Redesign and Its Application to Modify the Substrate Specificity of the Gramicidin Synthetase A Phenylalanine Adenylation Enzyme , 2005, J. Comput. Biol..

[16]  M. Karplus,et al.  Effective energy function for proteins in solution , 1999, Proteins.

[17]  Xin-She Yang,et al.  Introduction to Algorithms , 2021, Nature-Inspired Optimization Algorithms.

[18]  P. Kollman,et al.  A Second Generation Force Field for the Simulation of Proteins, Nucleic Acids, and Organic Molecules , 1995 .

[19]  Ron Diskin,et al.  Increasing the Potency and Breadth of an HIV Antibody by Using Structure-Based Rational Design , 2011, Science.

[20]  J. Sodroski,et al.  Structure of an HIV gp120 envelope glycoprotein in complex with the CD4 receptor and a neutralizing human antibody , 1998, Nature.

[21]  M. Levitt,et al.  Conformation of amino acid side-chains in proteins. , 1978, Journal of molecular biology.

[22]  Bruce Randall Donald,et al.  Algorithms in Structural Molecular Biology , 2011 .

[23]  Bruce Randall Donald,et al.  Algorithm for backrub motions in protein design , 2008, ISMB.

[24]  Y Li,et al.  Design of epitope-specific probes for sera analysis and antibody isolation , 2012, Retrovirology.

[25]  Chen Zeng,et al.  An improved pairwise decomposable finite‐difference Poisson–Boltzmann method for computational protein design , 2008, J. Comput. Chem..

[26]  Niles A Pierce,et al.  Protein design is NP-hard. , 2002, Protein engineering.

[27]  Tjerk P. Straatsma,et al.  NWChem: A comprehensive and scalable open-source solution for large scale molecular simulations , 2010, Comput. Phys. Commun..

[28]  A R Leach,et al.  Exploring the conformational space of protein side chains using dead‐end elimination and the A* algorithm , 1998, Proteins.

[29]  Nils J. Nilsson,et al.  A Formal Basis for the Heuristic Determination of Minimum Cost Paths , 1968, IEEE Trans. Syst. Sci. Cybern..

[30]  G A Petsko,et al.  Chemistry and biology. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[31]  M. Hestenes,et al.  Methods of conjugate gradients for solving linear systems , 1952 .

[32]  Mark A Hallen,et al.  Dead‐end elimination with perturbations (DEEPer): A provable protein design algorithm with continuous sidechain and backbone flexibility , 2013, Proteins.

[33]  Pablo Gainza,et al.  Osprey: Protein Design with Ensembles, Flexibility, and Provable Algorithms , 2022 .

[34]  O. Schueler‐Furman,et al.  Improved side‐chain modeling for protein–protein docking , 2005, Protein science : a publication of the Protein Society.

[35]  Bruce R Donald,et al.  Predicting resistance mutations using protein design algorithms , 2010, Proceedings of the National Academy of Sciences.

[36]  R. Rosenfeld Nature , 2009, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[37]  Bruce Randall Donald,et al.  Dead-End Elimination with Backbone Flexibility , 2007, ISMB/ECCB.

[38]  N M F S A Cerqueira,et al.  MADAMM: A multistaged docking with an automated molecular modeling protocol , 2009, Proteins.

[39]  Bruce R Donald,et al.  Redesigning the PheA domain of gramicidin synthetase leads to a new understanding of the enzyme's mechanism and selectivity. , 2006, Biochemistry.

[40]  F. Young Biochemistry , 1955, The Indian Medical Gazette.

[41]  M. Stone The Generalized Weierstrass Approximation Theorem , 1948 .

[42]  Bruce Randall Donald,et al.  Computational Design of a PDZ Domain Peptide Inhibitor that Rescues CFTR Activity , 2012, PLoS Comput. Biol..