Scoring functions in protein folding and design

We present an analysis of the assumptions behind some of the most commonly used methods for evaluating the goodness of the fit between a sequence and a structure. Our studies on a lattice model show that methods based on statistical considerations are easy to use and can capture some of the features of protein‐like sequences and their corresponding native states, but unfortunately are incapable of recognizing, with certainty, the native‐like conformation of a sequence among a set of decoys. Meanwhile, an optimization method, entailing the determination of the parameters of an effective free energy of interaction, is much more reliable in recognizing the native state of a sequence. However, the statistical method is shown to perform quite well in tests of protein design.

[1]  J R Banavar,et al.  Interaction potentials for protein folding , 1998, Proteins.

[2]  E. Domany,et al.  Pairwise contact potentials are unsuitable for protein folding , 1998 .

[3]  Cecilia Clementi,et al.  FOLDING, DESIGN, AND DETERMINATION OF INTERACTION POTENTIALS USING OFF-LATTICE DYNAMICS OF MODEL HETEROPOLYMERS , 1998 .

[4]  M. Levitt,et al.  Energy functions that discriminate X-ray and near native folds from well-constructed decoys. , 1996, Journal of molecular biology.

[5]  R. Jernigan,et al.  Residue-residue potentials with a favorable contact pair term and an unfavorable high packing density term, for simulation and threading. , 1996, Journal of molecular biology.

[6]  K. Dill,et al.  A lattice statistical mechanics model of the conformational and sequence spaces of proteins , 1989 .

[7]  J. Skolnick,et al.  Monte carlo simulations of protein folding. I. Lattice model and interaction scheme , 1994, Proteins.

[8]  S. Doniach,et al.  A computer model to dynamically simulate protein folding: Studies with crambin , 1989, Proteins.

[9]  D. Eisenberg,et al.  Assessment of protein models with three-dimensional profiles , 1992, Nature.

[10]  D. Thirumalai,et al.  Pair potentials for protein folding: Choice of reference states and sensitivity of predicted native states to variations in the interaction schemes , 2008, Protein science : a publication of the Protein Society.

[11]  G. Crippen,et al.  Contact potential that recognizes the correct folding of globular proteins. , 1992, Journal of molecular biology.

[12]  G. Casari,et al.  Identification of native protein folds amongst a large number of incorrect models. The calculation of low energy conformations from potentials of mean force. , 1990, Journal of molecular biology.

[13]  K. Dill,et al.  The Protein Folding Problem , 1993 .

[14]  R. Jernigan,et al.  Estimation of effective interresidue contact energies from protein crystal structures: quasi-chemical approximation , 1985 .

[15]  D. Baker,et al.  Improved recognition of native‐like protein structures using a combination of sequence‐dependent and sequence‐independent features of proteins , 1999, Proteins.

[16]  L A Mirny,et al.  How to derive a protein folding potential? A new approach to an old problem. , 1996, Journal of molecular biology.

[17]  R. Sauer,et al.  Evolution of a protein fold in vitro. , 1999, Science.

[18]  M. Hao,et al.  Designing potential energy functions for protein folding. , 1999, Current opinion in structural biology.

[19]  M. Sippl Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. , 1990, Journal of molecular biology.

[20]  A. Kolinski,et al.  Simulations of the Folding of a Globular Protein , 1990, Science.

[21]  M. Hao,et al.  How optimization of potential functions affects protein folding. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[22]  K. Dill,et al.  Statistical potentials extracted from protein structures: how accurate are they? , 1996, Journal of molecular biology.

[23]  H. Scheraga,et al.  Medium- and long-range interaction parameters between amino acids for predicting three-dimensional structures of proteins. , 1976, Macromolecules.

[24]  P. Wolynes,et al.  Optimal protein-folding codes from spin-glass theory. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[25]  S Doniach,et al.  Computer simulation of antibody binding specificity , 1993, Proteins.

[26]  J Skolnick,et al.  How do potentials derived from structural databases relate to “true” potentials? , 1998, Protein science : a publication of the Protein Society.

[27]  K. Dill,et al.  An iterative method for extracting energy-like quantities from protein structures. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[28]  M J Rooman,et al.  Are database-derived potentials valid for scoring both forward and inverted protein folding? , 1995, Protein engineering.

[29]  Suganthi Balasubramanian,et al.  Protein alchemy: Changing β-sheet into α-helix , 1997, Nature Structural Biology.

[30]  S. Sun,et al.  Reduced representation model of protein structure prediction: Statistical potential and genetic algorithms , 1993, Protein science : a publication of the Protein Society.

[31]  A. Godzik,et al.  A general method for the prediction of the three dimensional structure and folding pathway of globular proteins: Application to designed helical proteins , 1993 .

[32]  C Kooperberg,et al.  Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. , 1997, Journal of molecular biology.

[33]  Chao Zhang,et al.  Extracting contact energies from protein structures: A study using a simplified model , 1998, Proteins.

[34]  R A Goldstein,et al.  Optimizing energy potentials for success in protein tertiary structure prediction. , 1998, Folding & design.

[35]  E. Shakhnovich,et al.  Conserved residues and the mechanism of protein folding , 1996, Nature.

[36]  P. Wolynes,et al.  Self-consistently optimized energy functions for protein structure prediction by molecular dynamics. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[37]  P. Wolynes,et al.  Self‐consistently optimized statistical mechanical energy functions for sequence structure alignment , 1996, Protein science : a publication of the Protein Society.

[38]  M. Sippl,et al.  Detection of native‐like models for amino acid sequences of unknown three‐dimensional structure in a data base of known protein conformations , 1992, Proteins.

[39]  D. T. Jones,et al.  A new approach to protein fold recognition , 1992, Nature.

[40]  D. Eisenberg,et al.  Three-dimensional profiles from residue-pair preferences: identification of sequences with beta/alpha-barrel fold. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[41]  Cecilia Clementi,et al.  Determination of interaction potentials of amino acids from native protein structures: Tests on simple lattice models , 1999 .

[42]  Harold A. Scheraga,et al.  Optimizing Potential Functions for Protein Folding , 1996 .

[43]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[44]  S. Bryant,et al.  An empirical energy function for threading protein sequence through the folding motif , 1993, Proteins.

[45]  A. Godzik,et al.  Derivation and testing of pair potentials for protein folding. When is the quasichemical approximation correct? , 1997, Protein science : a publication of the Protein Society.

[46]  P. Wolynes,et al.  Protein tertiary structure recognition using optimized Hamiltonians with local interactions. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[47]  R. Jernigan,et al.  Self‐consistent estimation of inter‐residue protein contact energies based on an equilibrium mixture approximation of residues , 1999, Proteins.

[48]  A. Maritan,et al.  Protein Design in a Lattice Model of Hydrophobic and Polar Amino Acids , 1998 .

[49]  T. Creighton Proteins: Structures and Molecular Properties , 1986 .

[50]  C. DeLisi,et al.  Determination of atomic desolvation energies from the structures of crystallized proteins. , 1997, Journal of molecular biology.

[51]  W. Krauth,et al.  Learning algorithms with optimal stability in neural networks , 1987 .

[52]  N. Wingreen,et al.  Emergence of Preferred Structures in a Simple Model of Protein Folding , 1996, Science.

[53]  G. K. Gilbert DRAINAGE SYSTEM AND LOESS DISTRIBUTION OF EASTERN IOWA. , 1883, Science.