Thermodynamic prediction of protein neutrality.

We present a simple theory that uses thermodynamic parameters to predict the probability that a protein retains the wild-type structure after one or more random amino acid substitutions. Our theory predicts that for large numbers of substitutions the probability that a protein retains its structure will decline exponentially with the number of substitutions, with the severity of this decline determined by properties of the structure. Our theory also predicts that a protein can gain extra robustness to the first few substitutions by increasing its thermodynamic stability. We validate our theory with simulations on lattice protein models and by showing that it quantitatively predicts previously published experimental measurements on subtilisin and our own measurements on variants of TEM1 beta-lactamase. Our work unifies observations about the clustering of functional proteins in sequence space, and provides a basis for interpreting the response of proteins to substitutions in protein engineering applications.

[1]  C. Dobson,et al.  Sequence does specify protein conformation. , 1998, Trends in biochemical sciences.

[2]  P. Wolynes,et al.  Symmetry and the energy landscapes of biomolecules. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[3]  A. Fersht Structure and mechanism in protein science , 1998 .

[4]  W. Ebeling Stochastic Processes in Physics and Chemistry , 1995 .

[5]  E. Shakhnovich,et al.  Engineering of stable and fast-folding sequences of model proteins. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[6]  N. Oppenheimer,et al.  Structure and mechanism , 1989 .

[7]  Brian K Shoichet,et al.  Evolution of an antibiotic resistance enzyme constrained by stability and activity trade-offs. , 2002, Journal of molecular biology.

[8]  Fengzhu Sun,et al.  The Polymerase Chain Reaction and Branching Processes , 1995, J. Comput. Biol..

[9]  B K Shoichet,et al.  Enhancement of protein stability by the combination of point mutations in T4 lysozyme is additive. , 1995, Protein engineering.

[10]  D. Sherrington Stochastic Processes in Physics and Chemistry , 1983 .

[11]  E. Shakhnovich,et al.  Influence of point mutations on protein structure: probability of a neutral mutation. , 1991, Journal of theoretical biology.

[12]  Akinori Sarai,et al.  ProTherm, version 4.0: thermodynamic database for proteins and mutants , 2004, Nucleic Acids Res..

[13]  M. Levitt,et al.  Simulating protein evolution in sequence and structure space. , 2004, Current opinion in structural biology.

[14]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[15]  S. Bouvier,et al.  Systematic mutation of bacteriophage T4 lysozyme. , 1991, Journal of molecular biology.

[16]  R. Sauer,et al.  Bacteriophage lambda cro mutations: effects on activity and intracellular degradation. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[17]  E. Bornberg-Bauer,et al.  Modeling evolutionary landscapes: mutational stability, topology, and superfunnels in sequence space. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[18]  M. Levitt,et al.  Exploring conformational space with a simple lattice model for protein structure. , 1994, Journal of molecular biology.

[19]  Eugene I Shakhnovich,et al.  Structural determinant of protein designability. , 2002, Physical review letters.

[20]  R A Goldstein,et al.  Why are some proteins structures so common? , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[21]  L. Serrano,et al.  Predicting changes in the stability of proteins and protein complexes: a study of more than 1000 mutations. , 2002, Journal of molecular biology.

[22]  B. Bainbridge,et al.  Genetics , 1981, Experientia.

[23]  R. Siegel,et al.  Generation of large libraries of random mutants in Bacillus subtilis by PCR-based plasmid multimerization. , 1997, BioTechniques.

[24]  R. Jernigan,et al.  Estimation of effective interresidue contact energies from protein crystal structures: quasi-chemical approximation , 1985 .

[25]  D. Shortle,et al.  Genetic analysis of staphylococcal nuclease: identification of three intragenic "global" suppressors of nuclease-minus mutations. , 1985, Genetics.

[26]  D. Yee,et al.  Principles of protein folding — A perspective from simple exact models , 1995, Protein science : a publication of the Protein Society.

[27]  C Cruz,et al.  Genetic studies of the lac repressor. XIV. Analysis of 4000 altered Escherichia coli lac repressors reveals essential and non-essential residues, as well as "spacers" which do not require a specific sequence. , 1994, Journal of molecular biology.

[28]  S. Bouvier,et al.  Alteration of T4 lysozyme structure by second‐site reversion of deleterious mutations , 1997, Protein science : a publication of the Protein Society.

[29]  E. Shakhnovich,et al.  STABILITY OF DESIGNED PROTEINS AGAINST MUTATIONS , 1998, cond-mat/9809410.

[30]  A. G. Day,et al.  Step-wise mutation of barnase to binase. A procedure for engineering increased stability of proteins and an experimental analysis of the evolution of protein stability. , 1993, Journal of molecular biology.

[31]  C. Tanford Macromolecules , 1994, Nature.

[32]  Frances H Arnold,et al.  Library analysis of SCHEMA‐guided protein recombination , 2003, Protein science : a publication of the Protein Society.

[33]  Hao Li,et al.  Designability and thermal stability of protein structures , 2003, cond-mat/0303600.

[34]  N. Kampen,et al.  Stochastic processes in physics and chemistry , 1981 .

[35]  P G Wolynes,et al.  Protein folding mechanisms and the multidimensional folding funnel , 1998, Proteins.

[36]  J. Wells,et al.  Additivity of mutational effects in proteins. , 1990, Biochemistry.

[37]  Richard A Goldstein,et al.  Why are proteins so robust to site mutations? , 2002, Journal of molecular biology.

[38]  C. Wilke,et al.  Evolution of mutational robustness. , 2003, Mutation research.

[39]  Juno Choe,et al.  Protein tolerance to random amino acid change. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[40]  Marianne Manchester,et al.  Complete mutagenesis of the HIV-1 protease , 1989, Nature.

[41]  M. Levitt,et al.  Funnel‐like organization in sequence space determines the distributions of protein stability and folding rate preferred by evolution , 2004, Proteins.

[42]  D Gilis,et al.  PoPMuSiC, an algorithm for predicting protein mutant stability changes: application to prion proteins. , 2000, Protein engineering.

[43]  Frances H Arnold,et al.  Why high-error-rate random mutagenesis libraries are enriched in functional and improved proteins. , 2004, Journal of molecular biology.

[44]  D. Axe Estimating the prevalence of protein sequences adopting functional enzyme folds. , 2004, Journal of molecular biology.

[45]  N. Wingreen,et al.  Emergence of Preferred Structures in a Simple Model of Protein Folding , 1996, Science.

[46]  F. Young Biochemistry , 1955, The Indian Medical Gazette.