Comparative Study of Back Propagation Neural Network and Support Vector Machine to Classify Hypertension Gene Sequences from Relative Codon Frequency

In this work, relative codon frequency has been considered to specify gene sequences. Implementing BPNN and SVM hypertension gene sequences have been categorized from others. A single gene sequence has been converted into 19 dimensional feature vector and then the Back Propagation Neural Network and Support Vector Machine have been trained using the feature vectors of all gene sequences. Since relative frequencies are not dependent on lengths, sequences with various lengths are used here. Accuracy rates of Support Vector Machine and Back Propagation Neural Network have been compared using different no. of samples at the training and testing phase. In this experiment, the accuracy rate has been increased proportionally with the size of samples. It is found that in most of the cases, BPNN is better than SVM

[1]  Paul M. Sharp,et al.  Codon usage in Caenorhabditis elegans: delineation of translational selection and mutational biases , 1994, Nucleic Acids Res..

[2]  F. Lisacek,et al.  Codon usage and gene function are related in sequences of Arabidopsis thaliana. , 1998, Gene.

[3]  D. Signorini,et al.  Neural networks , 1995, The Lancet.

[4]  A. Eyre-Walker,et al.  An analysis of codon usage in mammals: Selection or mutation bias? , 1991, Journal of Molecular Evolution.

[5]  M. Cerdán,et al.  Codon usage in Kluyveromyces lactis and in yeast cytochrome c-encoding genes. , 1994, Gene.

[6]  Jagath C. Rajapakse,et al.  Determination of the Relative Importance of Gene Function or Taxonomic Grouping to Codon Usage Bias Using Cluster Analysis and SVMs , 2006, 2006 IEEE Symposium on Computational Intelligence and Bioinformatics and Computational Biology.

[7]  Mevlut Ture,et al.  Comparing classification techniques for predicting essential hypertension , 2005, Expert Syst. Appl..

[8]  X. Pan,et al.  [Molecular evolution of MHC DQA genes. II. Phylogenetic analysis based on nucleotide substitution and SCU bias]. , 1997, Yi chuan xue bao = Acta genetica Sinica.

[9]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[10]  M. Gouy,et al.  Codon usage in bacteria: correlation with gene expressivity. , 1982, Nucleic acids research.

[11]  Xiao Sun,et al.  Analysis of synonymous codon usage in SARS Coronavirus and other viruses in the Nidovirales , 2004, Virus Research.

[12]  Paul M. Sharp,et al.  Codon usage in yeast: cluster analysis clearly differentiates highly and lowly expressed genes , 1986, Nucleic Acids Res..

[13]  Xiao Sun,et al.  Cluster analysis of the codon use frequency of MHC genes from different species. , 2002, Bio Systems.

[14]  B. Morton,et al.  Chloroplast DNA codon use: Evidence for selection at the psb A locus based on tRNA availability , 1993, Journal of Molecular Evolution.

[15]  A. Santhakumaran,et al.  Pre-Diagnosis of Hypertension Using Artificial Neural Network , 2011 .

[16]  Yann LeCun,et al.  Measuring the VC-Dimension of a Learning Machine , 1994, Neural Computation.