Pattern recognition in the prediction of protein structural class

Algorithms that predict the secondary structure of individual amino acids based solely on their local sequence environment obtain at best 60% to 65% accuracy. The approach presented uses more global information as input to two methodologies, a modified Euclidean statistical clustering algorithm and a three-layer backpropagation network. Input to both these methods consists of the normalized frequency of the 20 amino acids as well as the frequency of six hydrophobic amino acid patterns. The average predictive accuracy for all test set proteins using the Euclidean statistic was 74.0%. The backpropagation network correctly predicted 76.2% of the test proteins. These results show that there exist patterns in the protein primary sequence that are useful for the prediction of protein structural class by certain statistical clustering algorithms and neural networks.<<ETX>>

[1]  I D Kuntz,et al.  Amino acid composition and hydrophobicity patterns of protein domains correlate with their structures , 1985, Biopolymers.

[2]  C. Epstein,et al.  The Genetic Control of Tertiary Protein Structure: Studies With Model Systems , 1963 .

[3]  J. Mesirov,et al.  Hybrid system for protein secondary structure prediction. , 1992, Journal of molecular biology.

[4]  K. Chou,et al.  An optimization approach to predicting protein structural class from amino acid composition , 1992, Protein science : a publication of the Protein Society.

[5]  G. Rose,et al.  Hydrophobicity of amino acid residues in globular proteins. , 1985, Science.

[6]  P. Y. Chou,et al.  Prediction of Protein Structural Classes from Amino Acid Compositions , 1989 .

[7]  Winona C. Barker,et al.  Protein sequence database. , 1990 .

[8]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[9]  C. Chothia,et al.  Structural patterns in globular proteins , 1976, Nature.

[10]  M. Karplus,et al.  Protein secondary structure prediction with a neural network. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[11]  V. Lim Algorithms for prediction of α-helical and β-structural regions in globular proteins , 1974 .

[12]  K. Dill Dominant forces in protein folding. , 1990, Biochemistry.

[13]  J. Garnier,et al.  Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins. , 1978, Journal of molecular biology.

[14]  G Deléage,et al.  An algorithm for protein secondary structure prediction based on class prediction. , 1987, Protein engineering.

[15]  P. Y. Chou,et al.  Prediction of protein conformation. , 1974, Biochemistry.

[16]  T. Sejnowski,et al.  Predicting the secondary structure of globular proteins using neural network models. , 1988, Journal of molecular biology.

[17]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[18]  C. DeLisi,et al.  Prediction of protein structural class from the amino acid sequence , 1986, Biopolymers.

[19]  J. Gibrat,et al.  Secondary structure prediction: combination of three different methods. , 1988, Protein engineering.

[20]  G. Fasman Prediction of Protein Structure and the Principles of Protein Conformation , 2012, Springer US.

[21]  W. Kabsch,et al.  How good are predictions of protein secondary structure? , 1983, FEBS letters.

[22]  J. Gibrat,et al.  Further developments of protein secondary structure prediction using information theory. New parameters and consideration of residue pairs. , 1987, Journal of molecular biology.

[23]  T. Kohonen,et al.  Statistical pattern recognition with neural networks: benchmarking studies , 1988, IEEE 1988 International Conference on Neural Networks.

[24]  M. Schiffer,et al.  Use of helical wheels to represent the structures of proteins and to identify segments with helical potential. , 1967, Biophysical journal.

[25]  P. Klein,et al.  Prediction of protein structural class by discriminant analysis. , 1986, Biochimica et biophysica acta.

[26]  C. Anfinsen,et al.  The kinetics of formation of native ribonuclease during oxidation of the reduced polypeptide chain. , 1961, Proceedings of the National Academy of Sciences of the United States of America.

[27]  R Langridge,et al.  Improvements in protein secondary structure prediction by an enhanced neural network. , 1990, Journal of molecular biology.