The prediction of protein secondary structure with a Cascade Correlation Learning Architecture of neural networks

A Cascade Correlation Learning Architecture (CCLA) of neural networks is tested on the task of predicting the secondary structure of proteins. The results are compared with those obtained with Neural Networks (NN) trained with the back-propagation algorithm (BPNN) and generated with genetic algorithms. CCLA proceeds towards the global minimum of the error function more efficiently than BPNN. However, only a slight improvement in the average efficiency value is noticeable (61.82% as compared with 61.61% obtained with BPNN). The values of the three correlation coefficients for the discriminated secondary structures are also rather similar (Ct8,Cα,Cβ and Ccoil are 0.36, 0.29 and 0.36 with CCLA, and 0.36, 0.31 and 0.35 with BPNN). This indicates that the efficiency of the prediction does not depend upon the training algorithm, and confirms our previous observation that when single sequences are used as input code to the network system, different NN architectures can perform similarly.

[1]  P Stolorz,et al.  Predicting protein secondary structure using neural net and statistical methods. , 1992, Journal of molecular biology.

[2]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[3]  Steven M. Muskal,et al.  Predicting protein secondary structure content. A tandem neural network approach. , 1992, Journal of molecular biology.

[4]  Piero Fariselli,et al.  LGANN: a parallel system combining a local genetic algorithm and neural networks for the prediction of secondary structure of proteins , 1995, Comput. Appl. Biosci..

[5]  Stefano Pascarella,et al.  PRONET: a microcomputer program for predicting the secondary structure of proteins with a neural network , 1989, Comput. Appl. Biosci..

[6]  B. Rost,et al.  Prediction of protein secondary structure at better than 70% accuracy. , 1993, Journal of molecular biology.

[7]  Georg E. Schulz,et al.  Principles of Protein Structure , 1979 .

[8]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[9]  Chris Bishop,et al.  Exact Calculation of the Hessian Matrix for the Multilayer Perceptron , 1992, Neural Computation.

[10]  Burkhard Rost,et al.  PHD - an automatic mail server for protein secondary structure prediction , 1994, Comput. Appl. Biosci..

[11]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[12]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[13]  Christian Lebiere,et al.  The Cascade-Correlation Learning Architecture , 1989, NIPS.

[14]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[15]  M. Karplus,et al.  Protein secondary structure prediction with a neural network. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[16]  M J Sternberg,et al.  Prediction of structural and functional features of protein and nucleic acid sequences by artificial neural networks. , 1992, Biochemistry.

[17]  Piero Fariselli,et al.  Predicting secondary structures of membrane proteins with neural networks , 2004, European Biophysics Journal.

[18]  Scott R. Presnell,et al.  Artificial neural networks for pattern recognition in biochemical sequences. , 1993, Annual review of biophysics and biomolecular structure.

[19]  T. Sejnowski,et al.  Predicting the secondary structure of globular proteins using neural network models. , 1988, Journal of molecular biology.

[20]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[21]  B. Rost,et al.  Redefining the goals of protein secondary structure prediction. , 1994, Journal of molecular biology.

[22]  R Langridge,et al.  Improvements in protein secondary structure prediction by an enhanced neural network. , 1990, Journal of molecular biology.

[23]  D. Lipman,et al.  Rapid and sensitive protein similarity searches. , 1985, Science.