Protein alpha -helix region prediction based on stochastic-rule learning

The authors apply their previously introduced (1992) method (the MY method) to alpha -helix region prediction for a variety of proteins which are randomly selected from the Brookhaven Protein Data Bank. The MY method produces a stochastic rule which assigns, to any region in a sequence, the probability that it is an alpha -helix region. Optimal stochastic rules are obtained by using Laplace estimation of real-valued parameters and the minimum description length principle. The experimental results show that the MY method achieved an average prediction accuracy rate of more than 70%, on more than 3000 residues in the test sequences, even when only hemoglobin sequences were used to generate examples of alpha -helix regions.<<ETX>>

[1]  F. Schreiber,et al.  The Bayes-Laplace statistic of the multinomial distribution , 1985 .

[2]  W. Taylor,et al.  The classification of amino acid conservation. , 1986, Journal of theoretical biology.

[3]  Kenji Yamanishi,et al.  A learning criterion for stochastic rules , 1990, COLT '90.

[4]  J. Rissanen Stochastic Complexity in Statistical Inquiry Theory , 1989 .

[5]  J. Rissanen,et al.  Modeling By Shortest Data Description* , 1978, Autom..

[6]  T. Sejnowski,et al.  Predicting the secondary structure of globular proteins using neural network models. , 1988, Journal of molecular biology.

[7]  R. M. Abarbanel,et al.  Turn prediction in proteins using a pattern-matching approach. , 1986, Biochemistry.

[8]  Kenji Yamanishi,et al.  Protein Secondary Structure Prediction Based on Stochastic-Rule Learning , 1992, ALT.

[9]  K. Nagano,et al.  Triplet information in helix prediction applied to the analysis of super-secondary structures. , 1977, Journal of molecular biology.

[10]  P. Y. Chou,et al.  Prediction of protein conformation. , 1974, Biochemistry.

[11]  J. Garnier,et al.  Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins. , 1978, Journal of molecular biology.

[12]  V. Lim Algorithms for prediction of α-helical and β-structural regions in globular proteins , 1974 .

[13]  M J Sternberg,et al.  Machine learning approach for the prediction of protein secondary structure. , 1990, Journal of molecular biology.

[14]  P. Y. Chou,et al.  Conformational parameters for amino acids in helical, beta-sheet, and random coil regions calculated from proteins. , 1974, Biochemistry.

[15]  S. Brunak,et al.  Protein secondary structure and homology by neural networks The α‐helices in rhodopsin , 1988 .

[16]  G. Fasman Prediction of Protein Structure and the Principles of Protein Conformation , 2012, Springer US.

[17]  M. Karplus,et al.  Protein secondary structure prediction with a neural network. , 1989, Proceedings of the National Academy of Sciences of the United States of America.