Can contact potentials reliably predict stability of proteins?

The simplest approximation of interaction potential between amino acid residues in proteins is the contact potential, which defines the effective free energy of a protein conformation by a set of amino acid contacts formed in this conformation. Finding a contact potential capable of predicting free energies of protein states across a variety of protein families will aid protein folding and engineering in silico on a computationally tractable time-scale. We test the ability of contact potentials to accurately and transferably (across various protein families) predict stability changes of proteins upon mutations. We develop a new methodology to determine the contact potentials in proteins from experimental measurements of changes in protein's thermodynamic stabilities (DeltaDeltaG) upon mutations. We apply our methodology to derive sets of contact interaction parameters for a hierarchy of interaction models including solvation and multi-body contact parameters. We test how well our models reproduce experimental measurements by statistical tests. We evaluate the maximum accuracy of predictions obtained by using contact potentials and the correlation between parameters derived from different data-sets of experimental (DeltaDeltaG) values. We argue that it is impossible to reach experimental accuracy and derive fully transferable contact parameters using the contact models of potentials. However, contact parameters may yield reliable predictions of DeltaDeltaG for datasets of mutations confined to the same amino acid positions in the sequence of a single protein.

[1]  K Nishikawa,et al.  Experimental verification of the 'stability profile of mutant protein' (SPMP) data using mutant human lysozymes. , 1999, Protein engineering.

[2]  H. Scheraga,et al.  Accessible surface areas as a measure of the thermodynamic parameters of hydration of peptides. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[3]  J R Desjarlais,et al.  Computational protein design. , 2001, Current opinion in chemical biology.

[4]  W E Stites,et al.  In a staphylococcal nuclease mutant the side-chain of a lysine replacing valine 66 is fully buried in the hydrophobic core. , 1991, Journal of molecular biology.

[5]  D. Shortle The denatured state (the other half of the folding equation) and its role in protein stability , 1996, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[6]  Sv Silvia Nedea,et al.  Dynamic Monte Carlo Simulations , 1999 .

[7]  D. Shortle,et al.  Contributions of the ionizable amino acids to the stability of staphylococcal nuclease. , 1996, Biochemistry.

[8]  Michael R. Shirts,et al.  Simulation of folding of a small alpha-helical protein in atomistic detail using worldwide-distributed computing. , 2002, Journal of molecular biology.

[9]  J R Banavar,et al.  Learning effective amino acid interactions through iterative stochastic techniques , 2000, Proteins.

[10]  Zhou,et al.  First-Order Disorder-to-Order Transition in an Isolated Homopolymer Model. , 1996, Physical review letters.

[11]  Contributions of the ionizable amino acids to the stability of staphylococcal nuclease. , 1996 .

[12]  D Eisenberg,et al.  Hydrophobicity and amphiphilicity in protein structure , 1986, Journal of cellular biochemistry.

[13]  E. Shakhnovich,et al.  Understanding hierarchical protein evolution from first principles. , 2001, Journal of molecular biology.

[14]  A. Fersht,et al.  Rationally designing the accumulation of a folding intermediate of barnase by protein engineering. , 1993, Biochemistry.

[15]  Yu-Dong Cai,et al.  Is it a paradox or misinterpretation? , 2001, Proteins.

[16]  G P Zhou,et al.  Some insights into protein structural class prediction , 2001, Proteins.

[17]  L. H. Bradley,et al.  Protein design by binary patterning of polar and nonpolar amino acids. , 1993, Methods in molecular biology.

[18]  Seno,et al.  Optimal Protein Design Procedure. , 1996, Physical review letters.

[19]  William H. Press,et al.  Numerical recipes in C , 2002 .

[20]  N. Go Theoretical studies of protein folding. , 1983, Annual review of biophysics and bioengineering.

[21]  H Oschkinat,et al.  Improving the refolding yield of interleukin-4 through the optimization of local interactions. , 2000, Journal of biotechnology.

[22]  D. Shortle Denatured states of proteins and their roles in folding and stability , 1993 .

[23]  William H. Press,et al.  Numerical Recipes in C, 2nd Edition , 1992 .

[24]  J Skolnick,et al.  Dynamic Monte Carlo simulations of globular protein folding. Model studies of in vivo assembly of four helix bundles and four member beta-barrels. , 1990, Journal of molecular biology.

[25]  Deutsch,et al.  New algorithm for protein design. , 1995, Physical review letters.

[26]  H. Stanley,et al.  Discrete molecular dynamics studies of the folding of a protein-like model. , 1998, Folding & design.

[27]  K. Chou A novel approach to predicting protein structural classes in a (20–1)‐D amino acid composition space , 1995, Proteins.

[28]  G. Crippen,et al.  Contact potential that recognizes the correct folding of globular proteins. , 1992, Journal of molecular biology.

[29]  L. Serrano,et al.  Predicting changes in the stability of proteins and protein complexes: a study of more than 1000 mutations. , 2002, Journal of molecular biology.

[30]  Eytan Domany,et al.  Protein folding in contact map space , 2000 .

[31]  Amos Maritan,et al.  Extraction of interaction potentials between amino acids from native protein structures , 2000 .

[32]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[33]  Protein folding: Think globally, (inter)act locally , 1998, Current Biology.

[34]  W E Stites,et al.  Contributions of the large hydrophobic amino acids to the stability of staphylococcal nuclease. , 1990, Biochemistry.

[35]  Werner Braun,et al.  Exact and efficient analytical calculation of the accessible surface areas and their gradients for macromolecules , 1998, J. Comput. Chem..

[36]  C. Tanford Macromolecules , 1994, Nature.

[37]  E. Shakhnovich Theoretical studies of protein-folding thermodynamics and kinetics. , 1997, Current opinion in structural biology.

[38]  D Baker,et al.  Simplified proteins: minimalist solutions to the 'protein folding problem'. , 1998, Current opinion in structural biology.

[39]  Akinori Sarai,et al.  ProTherm: Thermodynamic Database for Proteins and Mutants , 1999, Nucleic Acids Res..

[40]  J. Onuchic,et al.  The energy landscape theory of protein folding: insights into folding mechanisms and scenarios. , 2000, Advances in protein chemistry.

[41]  A. Tropsha,et al.  Four-body potentials reveal protein-specific correlations to stability changes caused by hydrophobic core mutations. , 2001, Journal of molecular biology.

[42]  J. Onuchic,et al.  Theory of protein folding: the energy landscape perspective. , 1997, Annual review of physical chemistry.

[43]  A. Roitberg,et al.  All-atom structure prediction and folding simulations of a stable protein. , 2002, Journal of the American Chemical Society.

[44]  K. Chou,et al.  Prediction of protein structural classes. , 1995, Critical reviews in biochemistry and molecular biology.

[45]  H W Hellinga,et al.  Rational protein design: combining theory and experiment. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[46]  V. Pande,et al.  Absolute comparison of simulated and experimental protein-folding dynamics , 2002, Nature.

[47]  M. Karplus,et al.  Effective energy functions for protein structure prediction. , 2000, Current opinion in structural biology.

[48]  E. Shakhnovich,et al.  Side‐chain dynamics and protein folding , 2001, Proteins.

[49]  R. Jernigan,et al.  Residue-residue potentials with a favorable contact pair term and an unfavorable high packing density term, for simulation and threading. , 1996, Journal of molecular biology.

[50]  M. Edgell,et al.  High-precision, high-throughput stability determinations facilitated by robotics and a semiautomated titrating fluorometer. , 2003, Biochemistry.

[51]  E. Domany,et al.  Pairwise contact potentials are unsuitable for protein folding , 1998 .

[52]  M. Levitt,et al.  Protein folding: the endgame. , 1997, Annual review of biochemistry.

[53]  Vijay S. Pande,et al.  Heteropolymer freezing and design: Towards physical models of protein folding , 2000 .

[54]  D. Shortle,et al.  Persistence of Native-Like Topology in a Denatured Protein in 8 M Urea , 2001, Science.

[55]  Y N Vorobjev,et al.  SIMS: computation of a smooth invariant molecular surface. , 1997, Biophysical journal.

[56]  S. Wodak,et al.  Automatic procedures for protein design. , 2001, Combinatorial chemistry & high throughput screening.

[57]  K. Chou,et al.  Prediction and classification of domain structural classes , 1998, Proteins.

[58]  D. Baker,et al.  Native protein sequences are close to optimal for their structures. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[59]  K. Dill,et al.  Inverse protein folding problem: designing polymer sequences. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[60]  M. Karplus,et al.  Folding of a model three-helix bundle protein: a thermodynamic and kinetic analysis. , 1999, Journal of molecular biology.

[61]  E I Shakhnovich,et al.  Protein design: a perspective from simple tractable models , 1998, Folding & design.

[62]  D Thirumalai,et al.  Factors governing the foldability of proteins , 1996, Proteins.

[63]  William Swope,et al.  Understanding folding and design: Replica-exchange simulations of ``Trp-cage'' miniproteins , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[64]  Nicolas E. Buchler,et al.  Effect of alphabet size and foldability requirements on protein structure designability , 1999, Proteins.

[65]  R. Jernigan,et al.  Estimation of effective interresidue contact energies from protein crystal structures: quasi-chemical approximation , 1985 .

[66]  S. L. Mayo,et al.  De novo protein design: fully automated sequence selection. , 1997, Science.

[67]  Ke Fan,et al.  What is the minimum number of letters required to fold a protein? , 2003, Journal of molecular biology.

[68]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[69]  K. Dill Dominant forces in protein folding. , 1990, Biochemistry.

[70]  H. Stanley,et al.  Direct molecular dynamics observation of protein folding transition state ensemble. , 2002, Biophysical journal.

[71]  Hongyi Zhou,et al.  Stability scale and atomic solvation parameters extracted from 1023 mutation experiments , 2002, Proteins.

[72]  C. DeLisi,et al.  Prediction of protein structural class from the amino acid sequence , 1986, Biopolymers.

[73]  W. Jin,et al.  De novo design of foldable proteins with smooth folding funnel: automated negative design and experimental verification. , 2003, Structure.

[74]  W E Stites,et al.  Higher-order packing interactions in triple and quadruple mutants of staphylococcal nuclease. , 2001, Biochemistry.

[75]  P Prabakaran,et al.  Thermodynamic databases for proteins and protein-nucleic acid interactions. , 2001, Biopolymers.

[76]  W E Stites,et al.  Energetics of side chain packing in staphylococcal nuclease assessed by exchange of valines, isoleucines, and leucines. , 2001, Biochemistry.

[77]  M Levitt,et al.  From structure to sequence and back again. , 1996, Journal of molecular biology.

[78]  Cecilia Clementi,et al.  FOLDING, DESIGN, AND DETERMINATION OF INTERACTION POTENTIALS USING OFF-LATTICE DYNAMICS OF MODEL HETEROPOLYMERS , 1998 .

[79]  N. V. Dokholyan What is the protein design alphabet? , 2004, Proteins.

[80]  D. Shortle,et al.  Residual structure in large fragments of staphylococcal nuclease: effects of amino acid substitutions. , 1989, Biochemistry.