Peptides secondary structure prediction with neural networks: a criterion for building appropriate learning sets

Artificial neural networks have been recently applied with success for protein secondary structure prediction. So far, one of the two main aspects on which neural net performance depends, the topology of the net, has been considered. The present work addresses the other main aspect, the building up of the learning set. The author presents a criterion to build up suitable learning sets based on the alpha -helix percentage. Starting from a set of several well known proteins, the author formed 7 groups of proteins with similar helix percentages and used them for the learning of the same neural net. The author found that the best secondary structure prediction for each of the tested proteins (not belonging to the initial set) was the one obtained using the learning set whose helix percentage was closest to that of the tested protein. The accuracy of correct prediction of the author's method on 3 types of secondary structure ( alpha -helix, beta -sheet and coil), has been compared with the accuracy of other secondary structure prediction methods.<<ETX>>

[1]  T. Sejnowski,et al.  Predicting the secondary structure of globular proteins using neural network models. , 1988, Journal of molecular biology.

[2]  P. Y. Chou,et al.  Empirical predictions of protein conformation. , 1978, Annual review of biochemistry.

[3]  A. Gronenborn,et al.  The polypeptide fold of the globular domain of histone H5 in solution. A study using nuclear magnetic resonance, distance geometry and restrained molecular dynamics. , 1987, The EMBO journal.

[4]  V. Lim Algorithms for prediction of α-helical and β-structural regions in globular proteins , 1974 .

[5]  J. Garnier,et al.  Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins. , 1978, Journal of molecular biology.

[6]  W. Kabsch,et al.  How good are predictions of protein secondary structure? , 1983, FEBS letters.

[7]  G. Clore,et al.  Nuclear magnetic resonance study of the globular domain of chicken histone H5: resonance assignment and secondary structure. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[8]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[9]  M. Karplus,et al.  Protein secondary structure prediction with a neural network. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[10]  R. Sacile,et al.  Peptides Secondary Structure Prediction With Neural Networks Using "Weak Homologies" , 1991, Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society Volume 13: 1991.

[11]  J. Richardson,et al.  Amino acid preferences for specific locations at the ends of alpha helices. , 1988, Science.

[12]  Malcolm J. McGregor,et al.  Prediction of ?-turns in proteins using neural network , 1989 .

[13]  R Langridge,et al.  Improvements in protein secondary structure prediction by an enhanced neural network. , 1990, Journal of molecular biology.

[14]  Janet M. Thornton,et al.  Prediction of protein structure from amino acid sequence , 1978, Nature.

[15]  K Nishikawa,et al.  Amino acid sequence homology applied to the prediction of protein secondary structures, and joint prediction with existing methods. , 1986, Biochimica et biophysica acta.

[16]  S. Brunak,et al.  Protein secondary structure and homology by neural networks The α‐helices in rhodopsin , 1988 .

[17]  M. Sternberg,et al.  Prediction of protein secondary structure and active sites using the alignment of homologous sequences. , 1987, Journal of molecular biology.

[18]  G. Rose,et al.  Helix signals in proteins. , 1988, Science.

[19]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[20]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[21]  Barry Robson,et al.  An algorithm for secondary structure determination in proteins based on sequence similarity , 1986, FEBS letters.

[22]  R M Sweet,et al.  Evolutionary similarity among peptide segments is a basis for prediction of protein folding , 1986, Biopolymers.

[23]  G Rauch A secondary structure evaluation of peptides based on statistics and heuristic rules. , 1991, Computer methods and programs in biomedicine.