Determination of interaction potentials of amino acids from native protein structures: Tests on simple lattice models

We propose a novel method for the determination of the effective interaction potential between the amino acids of a protein. The strategy is based on the combination of a new optimization procedure and a geometrical argument, which also uncovers the shortcomings of any optimization scheme. The strategy can be applied on any data set of native structures such as those available from the Protein Data Bank. In this work, however, we explain and test our approach on simple lattice models, where the true interactions are known a priori and a Model Protein Data Bank (MPDB) can be generated by identifying proteins as amino acid sequences having a unique ground state conformation among all possible conformations. Excellent agreement is obtained between the extracted and the true potentials even for modest numbers of protein structures in the MPDB. Comparisons with other methods are also discussed.

[1]  G M Crippen Easily searched protein folding potentials. , 1996, Journal of molecular biology.

[2]  C. Tanford Macromolecules , 1994, Nature.

[3]  Cecilia Clementi,et al.  FOLDING, DESIGN, AND DETERMINATION OF INTERACTION POTENTIALS USING OFF-LATTICE DYNAMICS OF MODEL HETEROPOLYMERS , 1998 .

[4]  D. Yee,et al.  Principles of protein folding — A perspective from simple exact models , 1995, Protein science : a publication of the Protein Society.

[5]  K. Dill,et al.  Statistical potentials extracted from protein structures: how accurate are they? , 1996, Journal of molecular biology.

[6]  E I Shakhnovich,et al.  Design of proteins with selected thermal properties. , 1996, Folding & design.

[7]  M. Nadeau Proteins : Structure , Function , and Genetics , 2022 .

[8]  H. Scheraga,et al.  Medium- and long-range interaction parameters between amino acids for predicting three-dimensional structures of proteins. , 1976, Macromolecules.

[9]  G. Crippen Prediction of protein folding from amino acid sequence over discrete conformation spaces. , 1991, Biochemistry.

[10]  L A Mirny,et al.  How to derive a protein folding potential? A new approach to an old problem. , 1996, Journal of molecular biology.

[11]  R. Jernigan,et al.  Residue-residue potentials with a favorable contact pair term and an unfavorable high packing density term, for simulation and threading. , 1996, Journal of molecular biology.

[12]  M. Karplus,et al.  How does a protein fold? , 1994, Nature.

[13]  W. Krauth,et al.  Learning algorithms with optimal stability in neural networks , 1987 .

[14]  Sebastian Doniach Statistical mechanics, protein structure, and protein substrate interactions , 1994 .

[15]  K. Dill,et al.  An iterative method for extracting energy-like quantities from protein structures. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[16]  P. Wolynes,et al.  Toward Protein Tertiary Structure Recognition by Means of Associative Memory Hamiltonians , 1989, Science.

[17]  M J Sippl,et al.  Progress in fold recognition , 1995, Proteins.

[18]  F. Young Biochemistry , 1955, The Indian Medical Gazette.

[19]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[20]  Søren Brunak,et al.  Protein structure by distance analysis , 1994 .

[21]  G. Crippen,et al.  Contact potential that recognizes the correct folding of globular proteins. , 1992, Journal of molecular biology.

[22]  R. Srinivasan,et al.  LINUS: A hierarchic procedure to predict the fold of a protein , 1995, Proteins.

[23]  K. Dill,et al.  A lattice statistical mechanics model of the conformational and sequence spaces of proteins , 1989 .

[24]  J R Banavar,et al.  Interaction potentials for protein folding , 1998, Proteins.

[25]  G. Crippen,et al.  Learning about protein folding via potential functions , 1994, Proteins.