Comparative Analysis using Bayesian Approach to Neural Network of Translational Initiation Sites in Alternative Polymorphic Context

Widely accepted as an important signal for gene discovery, translation initiation sites (TIS) in weak context has been the main focus in this paper. Many TIS prediction programs have been developed for optimal context, but they fail to successfully predict the start codon if the contexts conditions are in weak positions. The objectives of this paper are to develop useful algorithms and to build a new classification model for the case study. The first approach of neural network includes training on algorithms of Resilient Backpropagation, Scaled Conjugate Gradient Backpropagation and Levenberg-Marquardt. The outputs are used in comparison with Bayesian Neural Network for efficiency comparison. The results showed that Resilient Backpropagation have the consistency in all measurement but performs less in accuracy. In second approach, the Bayesian Classifier_01 outperforms the Resilient Backpropagation by successfully increasing the overall prediction accuracy by 16.0%. The Bayesian Classifier_02 is built to improve the accuracy by adding new features of chemical properties as selected by the Information Gain Ratio method, and increasing the length of the window sequence to 201. The result shows that the built model successfully increases the accuracy by 96.0%. In comparison, the Bayesian model outperforms Tikole and Sankararamakrishnan (2008) by increasing the sensitivity by 10% and specificity by 26%.

[1]  Radford M. Neal Probabilistic Inference Using Markov Chain Monte Carlo Methods , 2011 .

[2]  Eukaryotic transcriptional control. , 1999 .

[3]  K. Weigel,et al.  Predicting complex quantitative traits with Bayesian neural networks: a case study with Jersey cows and wheat , 2011, BMC Genetics.

[4]  James W. Fickett,et al.  The Gene Identification Problem: An Overview for Developers , 1995, Comput. Chem..

[5]  Volker Brendel,et al.  MetWAMer: eukaryotic translation initiation site prediction , 2008, BMC Bioinformatics.

[6]  R. Kornberg,et al.  Eukaryotic transcriptional control. , 1999, Trends in cell biology.

[7]  M. Kozak An analysis of 5'-noncoding sequences from 699 vertebrate messenger RNAs. , 1987, Nucleic acids research.

[8]  A. Kochetov AUG codons at the beginning of protein coding sequences are frequent in eukaryotic mRNAs with a suboptimal start codon context , 2005, Bioinform..

[9]  Ramasubbu Sankararamakrishnan,et al.  Prediction of translation initiation sites in human mRNA sequences with AUG start codon in weak Kozak context: A neural network approach. , 2008, Biochemical and biophysical research communications.

[10]  A. Shatkin,et al.  Migration of 40 S ribosomal subunits on messenger RNA in the presence of edeine. , 1978, The Journal of biological chemistry.

[11]  Luciano Milanesi,et al.  Presence of ATG triplets in 5' untranslated regions of eukaryotic cDNAs correlates with a 'weak' context of the start codon , 2001, Bioinform..

[12]  M. Kozak The scanning model for translation: an update , 1989, The Journal of cell biology.

[13]  A. Kochetov,et al.  Alternative translation start sites and hidden coding potential of eukaryotic mRNAs. , 2008, BioEssays : news and reviews in molecular, cellular and developmental biology.

[14]  Caren Marzban,et al.  A Bayesian Neural Network for Severe-Hail Size Prediction , 2001 .

[15]  David J. C. MacKay,et al.  A Practical Bayesian Framework for Backpropagation Networks , 1992, Neural Computation.