Extracting contact energies from protein structures: A study using a simplified model

In this study, we exploited an elementary 2‐dimensional square lattice model of HP polymers to test the premise of extracting contact energies from protein structures. Given a set of prespecified energies for H–H, H–P, and P–P contacts, all possible sequences of various lengths were exhaustively enumerated to find sequences that have unique lowest‐energy conformations. The lowest‐energy structures (or native structures) of such (native) sequences were used to extract contact energies using the Miyazawa‐Jernigan procedure and here‐defined reference state. The relative magnitudes of the original energies were restored reasonably well, but the extracted contact energies were independent of the absolute magnitudes of the initial energies. We turned to a more detailed characterization of the energy landscapes of the native sequences in light of a new theoretical framework on protein folding. Foldability of such sequences imposes two limits on the absolute value of the prespecified energies: a lower bound entailed by the minimum requirement for thermodynamic stability and an upper bound associated with the entrapment of the chain to local minima. We found that these two limits confine the prespecified energy values to a rather narrow range which, surprisingly, also contains the extracted energies in all the cases examined. These results indicate that the quasi‐chemical approximation can be used to connect quantitatively the occurrence of various residue–residue contacts in an ensemble of native structures with the energies of the contacts. More importantly, they suggest that the extracted contact energies do contain information on structural stability and can be used to estimate actual structural energetics. This study also encourages the use of structure‐derived contact energies in threading. The finding that there is a rather narrow range of energies that are optimal for folding a sequence also cautions the use of arbitrary energy Hamiltonion in minimal folding models. Proteins 31:299–308, 1998. © 1998 Wiley‐Liss, Inc.

[1]  R L Jernigan,et al.  Protein stability for single substitution mutants and the extent of local compactness in the denatured state. , 1994, Protein engineering.

[2]  R. Jernigan,et al.  Inter-residue potentials in globular proteins and the dominance of highly specific hydrophilic interactions at close separation. , 1997, Journal of molecular biology.

[3]  R. Friesner,et al.  Computer modeling of protein folding: conformational and energetic analysis of reduced and detailed protein models. , 1995, Journal of molecular biology.

[4]  M. Levitt,et al.  Exploring conformational space with a simple lattice model for protein structure. , 1994, Journal of molecular biology.

[5]  N. Wingreen,et al.  Emergence of Preferred Structures in a Simple Model of Protein Folding , 1996, Science.

[6]  Shoshana J. Wodak,et al.  Generating and testing protein folds , 1993 .

[7]  A. Godzik,et al.  Topology fingerprint approach to the inverse protein folding problem. , 1992, Journal of molecular biology.

[8]  M Karplus,et al.  Theoretical studies of protein folding and unfolding. , 1995, Current opinion in structural biology.

[9]  B. Derrida Random-energy model: An exactly solvable model of disordered systems , 1981 .

[10]  Temple F. Smith,et al.  Global optimum protein threading with gapped alignment and empirical pair score functions. , 1996, Journal of molecular biology.

[11]  G. Crippen,et al.  Contact potential that recognizes the correct folding of globular proteins. , 1992, Journal of molecular biology.

[12]  K. Dill,et al.  ‘‘Sequence space soup’’ of proteins and copolymers , 1991 .

[13]  D. Yee,et al.  Principles of protein folding — A perspective from simple exact models , 1995, Protein science : a publication of the Protein Society.

[14]  M J Sippl,et al.  Knowledge-based potentials for proteins. , 1995, Current opinion in structural biology.

[15]  J L Cornette,et al.  Consistency in structural energetics of protein folding and peptide recognition , 1997, Protein science : a publication of the Protein Society.

[16]  Camacho Entropic Barriers, Frustration, and Order: Basic Ingredients in Protein Folding. , 1996, Physical review letters.

[17]  A. Finkelstein,et al.  Why do protein architectures have boltzmann‐like statistics? , 1995, Proteins.

[18]  K. Dill,et al.  Transition states and folding dynamics of proteins and heteropolymers , 1994 .

[19]  C DeLisi,et al.  Computational determination of side chain specificity for pockets in class I MHC molecules. , 1996, Molecular immunology.

[20]  D. Thirumalai,et al.  Modeling the role of disulfide bonds in protein folding: Entropic barriers and pathways , 1995, Proteins.

[21]  D. Covell,et al.  A role for surface hydrophobicity in protein‐protein recognition , 1994, Protein science : a publication of the Protein Society.

[22]  P. Wolynes,et al.  Spin glasses and the statistical mechanics of protein folding. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[23]  S. Wodak,et al.  Factors influencing the ability of knowledge-based potentials to identify native sequence-structure matches. , 1994, Journal of molecular biology.

[24]  M. Levitt,et al.  Energy functions that discriminate X-ray and near native folds from well-constructed decoys. , 1996, Journal of molecular biology.

[25]  J. Onuchic,et al.  Toward an outline of the topography of a realistic protein-folding funnel. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[26]  S. Sun,et al.  Reduced representation model of protein structure prediction: Statistical potential and genetic algorithms , 1993, Protein science : a publication of the Protein Society.

[27]  L A Mirny,et al.  How to derive a protein folding potential? A new approach to an old problem. , 1996, Journal of molecular biology.

[28]  J. Onuchic,et al.  KINETICS OF PROTEINLIKE MODELS : THE ENERGY LANDSCAPE FACTORS THAT DETERMINE FOLDING , 1995 .

[29]  K. Dill,et al.  Statistical potentials extracted from protein structures: how accurate are they? , 1996, Journal of molecular biology.

[30]  D. Covell,et al.  Conformations of folded proteins in restricted spaces. , 1990, Biochemistry.

[31]  D Thirumalai,et al.  Factors governing the foldability of proteins , 1996, Proteins.

[32]  M. Levitt A simplified representation of protein conformations for rapid simulation of protein folding. , 1976, Journal of molecular biology.

[33]  B. Lee,et al.  Protein folding by a biased Monte Carlo procedure in the dihedral angle space , 1996, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[34]  D. T. Jones,et al.  A new approach to protein fold recognition , 1992, Nature.

[35]  A. Godzik,et al.  Derivation and testing of pair potentials for protein folding. When is the quasichemical approximation correct? , 1997, Protein science : a publication of the Protein Society.

[36]  D Fischer,et al.  Assigning amino acid sequences to 3‐dimensional protein folds , 1996, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[37]  J. Onuchic,et al.  Protein folding funnels: a kinetic approach to the sequence-structure relationship. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[38]  R. Jernigan,et al.  Estimation of effective interresidue contact energies from protein crystal structures: quasi-chemical approximation , 1985 .

[39]  C. DeLisi,et al.  Determination of atomic desolvation energies from the structures of crystallized proteins. , 1997, Journal of molecular biology.

[40]  K. Dill,et al.  From Levinthal to pathways to funnels , 1997, Nature Structural Biology.

[41]  J. Onuchic,et al.  Navigating the folding routes , 1995, Science.

[42]  R. Jernigan,et al.  Structure-derived potentials and protein simulations. , 1996, Current opinion in structural biology.

[43]  R. L. Baldwin The nature of protein folding pathways: The classical versus the new view , 1995, Journal of biomolecular NMR.

[44]  K. Dill,et al.  An iterative method for extracting energy-like quantities from protein structures. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[45]  K. Dill Theory for the folding and stability of globular proteins. , 1985, Biochemistry.

[46]  M. Karplus,et al.  How does a protein fold? , 1994, Nature.

[47]  J. Skolnick,et al.  Monte carlo simulations of protein folding. I. Lattice model and interaction scheme , 1994, Proteins.

[48]  J. Onuchic,et al.  Funnels, pathways, and the energy landscape of protein folding: A synthesis , 1994, Proteins.

[49]  P. Wolynes,et al.  Protein tertiary structure recognition using optimized Hamiltonians with local interactions. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[50]  P. Wolynes,et al.  Intermediates and barrier crossing in a random energy model , 1989 .

[51]  S. Bryant,et al.  An empirical energy function for threading protein sequence through the folding motif , 1993, Proteins.

[52]  D Thirumalai,et al.  Kinetics and thermodynamics of folding in model proteins. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[53]  J. Onuchic,et al.  Folding kinetics of proteinlike heteropolymers , 1994, cond-mat/9404001.

[54]  K. Dill,et al.  Folding kinetics of proteins and copolymers , 1992 .

[55]  R. Jernigan,et al.  Residue-residue potentials with a favorable contact pair term and an unfavorable high packing density term, for simulation and threading. , 1996, Journal of molecular biology.

[56]  R Unger,et al.  Genetic algorithms for protein folding simulations. , 1992, Journal of molecular biology.