A statistical mechanical method to optimize energy functions for protein folding.

We present a method for deriving energy functions for protein folding by maximizing the thermodynamic average of the overlap with the native state. The method has been tested by using the pairwise contact approximation of the energy function and generating alternative structures by threading sequences over a database of 1, 169 structures. With the derived energy function, most native structures: (i) have minimal energy and (ii) are thermodynamically rather stable, and (iii) the corresponding energy landscapes are smooth. Precisely, 92% of the 1,013 x-ray structures are stabilized. Most failures can be attributed to the neglect of interactions between chains forming polychain proteins and of interactions with cofactors. When these are considered, only nine cases remain unexplained. In contrast, 38% of NMR structures are not assigned properly.

[1]  C. Sander,et al.  Errors in protein structures , 1996, Nature.

[2]  G. Crippen,et al.  Contact potential that recognizes the correct folding of globular proteins. , 1992, Journal of molecular biology.

[3]  D. Thirumalai,et al.  Pair potentials for protein folding: Choice of reference states and sensitivity of predicted native states to variations in the interaction schemes , 2008, Protein science : a publication of the Protein Society.

[4]  I. Kanter,et al.  STATISTICAL PROPERTIES OF CONTACT MAPS , 1998, cond-mat/9810285.

[5]  P. Wolynes,et al.  Spin glasses and the statistical mechanics of protein folding. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Cecilia Clementi,et al.  Determination of interaction potentials of amino acids from native protein structures: Tests on simple lattice models , 1999 .

[7]  A. Kolinski,et al.  Simulations of the Folding of a Globular Protein , 1990, Science.

[8]  Knapp,et al.  Protein dynamics with off-lattice Monte Carlo moves. , 1996, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[9]  L A Mirny,et al.  Universality and diversity of the protein folding scenarios: a comprehensive analysis with the aid of a lattice model. , 1996, Folding & design.

[10]  P G Wolynes,et al.  Protein folding mechanisms and the multidimensional folding funnel , 1998, Proteins.

[11]  A. Maritan,et al.  STABILITY THRESHOLD AS A SELECTION PRINCIPLE FOR PROTEIN DESIGN , 1997, cond-mat/9704167.

[12]  Walter Nadler,et al.  Testing a new Monte Carlo algorithm for protein folding , 1998 .

[13]  P. Kollman,et al.  Pathways to a protein folding intermediate observed in a 1-microsecond simulation in aqueous solution. , 1998, Science.

[14]  M Karplus,et al.  Factors that affect the folding ability of proteins , 1998, Proteins.

[15]  M Vendruscolo,et al.  Recovery of protein structure from contact maps. , 1997, Folding & design.

[16]  V. Pande,et al.  On the theory of folding kinetics for short proteins. , 1997, Folding & design.

[17]  M. Hao,et al.  Designing potential energy functions for protein folding. , 1999, Current opinion in structural biology.

[18]  P. Gennes Scaling Concepts in Polymer Physics , 1979 .

[19]  G. Casari,et al.  Identification of native protein folds amongst a large number of incorrect models. The calculation of low energy conformations from potentials of mean force. , 1990, Journal of molecular biology.

[20]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[21]  M Vendruscolo,et al.  Can a pairwise contact potential stabilize native protein folds against decoys obtained by threading? , 2000, Proteins.

[22]  A. Godzik,et al.  Derivation and testing of pair potentials for protein folding. When is the quasichemical approximation correct? , 1997, Protein science : a publication of the Protein Society.

[23]  R. Jernigan,et al.  Estimation of effective interresidue contact energies from protein crystal structures: quasi-chemical approximation , 1985 .

[24]  P. Wolynes,et al.  Optimal protein-folding codes from spin-glass theory. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[25]  L A Mirny,et al.  How to derive a protein folding potential? A new approach to an old problem. , 1996, Journal of molecular biology.

[26]  R A Goldstein,et al.  On the thermodynamic hypothesis of protein folding. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[27]  K. Dill,et al.  From Levinthal to pathways to funnels , 1997, Nature Structural Biology.

[28]  J. Banavar,et al.  Master Equation Approach to Protein Folding and Kinetic Traps , 1998, cond-mat/9803019.

[29]  M. Sippl Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. , 1990, Journal of molecular biology.

[30]  K. Dill,et al.  An iterative method for extracting energy-like quantities from protein structures. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[31]  E. Domany,et al.  Pairwise contact potentials are unsuitable for protein folding , 1998 .

[32]  P. Wolynes,et al.  Protein tertiary structure recognition using optimized Hamiltonians with local interactions. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[33]  M. Hao,et al.  How optimization of potential functions affects protein folding. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[34]  M Karplus,et al.  "New view" of protein folding reconciled with the old through multiple unfolding simulations. , 1997, Science.

[35]  U. Hobohm,et al.  Enlarged representative set of protein structures , 1994, Protein science : a publication of the Protein Society.

[36]  Eugene I. Shakhnovich,et al.  Free energy landscape for protein folding kinetics: Intermediates, traps, and multiple pathways in theory and lattice model simulations , 1994 .

[37]  M Vendruscolo,et al.  Protein folding using contact maps. , 1999, Vitamins and hormones.