An assessment of protein secondary structure prediction methods based on amino acid sequence.

Five of the several secondary structure prediction methods based on protein amino acid sequence has been computerized, allowing the calculation of joint prediction histograms which have been shown to be superior to any individual prediction. The known structures of about 40 proteins experimentally determined by X-ray crystallography are compared with the predictions resulting from calculated histograms. The accuracy of the predictions for helices is generally much better than for both beta-sheet regions and for turns. The overall agreement between prediction and observation within the amino terminal half of the protein molecules is clearly superior to that for the carboxyl half, suggesting an amino nucleating core. Predictions for smaller proteins and thermally stable proteins are generally good, indicating the sensitivity of the methods to short-range but not long-range interactions. In less than half the cases tested were the predictions useful; there was no way of knowing ahead of time if a favorable prediction would result. Given the lack of dramatic improvement with an increase in data base for the schemes and the generally poor agreement factors, it appears that a perfect predictive algorithm must include a consideration of energy minimization, thermalization, and long-range interactions. Extreme caution is suggested in applying present prediction routines to unknown protein structures.

[1]  D. Blow Structure and Substrate Specificity in the Serine Enzymes , 1974 .

[2]  P. Y. Chou,et al.  Prediction of protein conformation. , 1974, Biochemistry.

[3]  V. Lim Algorithms for prediction of α-helical and β-structural regions in globular proteins , 1974 .

[4]  J. Wootton The coenzyme-binding domains of glutamate dehydrogenases , 1974, Nature.

[5]  C. D. Barry,et al.  Comparison of predicted and experimentally determined secondary structure of adenyl kinase , 1974, Nature.

[6]  C. Fullmer,et al.  Calcium-binding protein of bovine intestine. The complete amino acid sequence. , 1975, The Journal of biological chemistry.

[7]  P. Y. Chou,et al.  Conformational parameters for amino acids in helical, beta-sheet, and random coil regions calculated from proteins. , 1974, Biochemistry.

[8]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[9]  A Ikai,et al.  Kinetics of unfolding and refolding of proteins. II. Results for cytochrome c. , 1973, Journal of molecular biology.

[10]  Michael G. Rossmann,et al.  X-Ray Studies of Protein Interactions , 1974 .

[11]  K. Nagano Logical analysis of the mechanism of protein folding. I. Predictions of helices, loops and beta-structures from primary structure. , 1973, Journal of molecular biology.

[12]  K. Nagano,et al.  Logical analysis of the mechanism of protein folding II. The nucleation process. , 1974, Journal of molecular biology.

[13]  H. Scheraga,et al.  Analysis of Conformations of Amino Acid Residues and Prediction of Backbone Topography in Proteins , 1974 .

[14]  R. Kretsinger,et al.  Carp muscle calcium-binding protein. II. Structure determination and general description. , 1973, The Journal of biological chemistry.

[15]  R. Huber,et al.  Crystal and molecular structure of a dimer composed of the variable portions of the Bence-Jones protein REI. , 1974, European journal of biochemistry.

[16]  V. Lim Structural principles of the globular organization of protein chains. A stereochemical theory of globular protein secondary structure. , 1974, Journal of molecular biology.

[17]  C B Anfinsen,et al.  The formation and stabilization of protein structure. , 1972, The Biochemical journal.

[18]  P. Y. Chou,et al.  Conformational prediction and circular dichroism studies on the lac repressor. , 1975, Journal of molecular biology.

[19]  A. Mclachlan,et al.  Structural homology of myosin alkali light chains, troponin C and carp calcium binding protein , 1974, Nature.

[20]  R. Kretsinger,et al.  Troponin and parvalbumin calcium binding regions predicted in myosin light chain and T4 lysozyme. , 1975, Science.

[21]  M. Levitt,et al.  Computer simulation of protein folding , 1975, Nature.

[22]  G. Petsko,et al.  Structure of chicken muscle triose phosphate isomerase determined crystallographically at 2.5Å resolution: using amino acid sequence data , 1975, Nature.