Deriving amino acid contact potentials from their frequencies of occurrence in proteins: a lattice model study

The possibility of deriving the contact potentials between amino acids from their frequencies of occurrence in proteins is discussed in evolutionary terms. This approach allows the use of traditional thermodynamics to describe such frequencies and, consequently, to develop a strategy to include in the calculations correlations due to the spatial proximity of the amino acids and to their overall tendency of being conserved in proteins. Making use of a lattice model to describe protein chains and defining a 'true' potential, we test these strategies by selecting a database of folding model sequences, deriving the contact potentials from such sequences and comparing them with the 'true' potential. Taking into account correlations allows for a markedly better prediction of the interaction potentials.

[1]  A. Godzik,et al.  Are proteins ideal mixtures of amino acids? Analysis of energy parameter sets , 1995, Protein science : a publication of the Protein Society.

[2]  P. Wolynes,et al.  Intermediates and barrier crossing in a random energy model , 1989 .

[3]  E. Shakhnovich,et al.  Topological determinants of protein folding , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[4]  C. J. Adkins Thermodynamics and statistical mechanics , 1972, Nature.

[5]  A. Godzik,et al.  A general method for the prediction of the three dimensional structure and folding pathway of globular proteins: Application to designed helical proteins , 1993 .

[6]  E. Shakhnovich,et al.  Proteins with selected sequences fold into unique native conformation. , 1994, Physical review letters.

[7]  R. Broglia,et al.  Hierarchy of events in the folding of model proteins , 2001 .

[8]  E. Shakhnovich,et al.  A new approach to the design of stable proteins. , 1993, Protein engineering.

[9]  C. Tanford Macromolecules , 1994, Nature.

[10]  J. Skolnick,et al.  Monte carlo simulations of protein folding. I. Lattice model and interaction scheme , 1994, Proteins.

[11]  K. Dill,et al.  Statistical potentials extracted from protein structures: how accurate are they? , 1996, Journal of molecular biology.

[12]  R. Jernigan,et al.  Residue-residue potentials with a favorable contact pair term and an unfavorable high packing density term, for simulation and threading. , 1996, Journal of molecular biology.

[13]  Cecilia Clementi,et al.  Determination of interaction potentials of amino acids from native protein structures: Tests on simple lattice models , 1999 .

[14]  A. Finkelstein,et al.  Why do protein architectures have boltzmann‐like statistics? , 1995, Proteins.

[15]  L A Mirny,et al.  How to derive a protein folding potential? A new approach to an old problem. , 1996, Journal of molecular biology.

[16]  F. Young Biochemistry , 1955, The Indian Medical Gazette.

[17]  R. Jernigan,et al.  Estimation of effective interresidue contact energies from protein crystal structures: quasi-chemical approximation , 1985 .

[18]  E. Shakhnovich,et al.  Analysis of knowledge‐based protein‐ligand potentials using a self‐consistent method , 2008, Protein science : a publication of the Protein Society.

[19]  J Skolnick,et al.  How do potentials derived from structural databases relate to “true” potentials? , 1998, Protein science : a publication of the Protein Society.

[20]  D. Thirumalai,et al.  Pair potentials for protein folding: Choice of reference states and sensitivity of predicted native states to variations in the interaction schemes , 2008, Protein science : a publication of the Protein Society.

[21]  E I Shakhnovich,et al.  Specific nucleus as the transition state for protein folding: evidence from the lattice model. , 1994, Biochemistry.

[22]  M Vendruscolo,et al.  Can a pairwise contact potential stabilize native protein folds against decoys obtained by threading? , 2000, Proteins.

[23]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[24]  M. Sippl Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. , 1990, Journal of molecular biology.

[25]  B. Rost,et al.  Protein structures sustain evolutionary drift. , 1997, Folding & design.

[26]  A. Ben-Naim STATISTICAL POTENTIALS EXTRACTED FROM PROTEIN STRUCTURES : ARE THESE MEANINGFUL POTENTIALS? , 1997 .

[27]  B. Derrida Random-energy model: An exactly solvable model of disordered systems , 1981 .