Correlated structural features and their applications to exon recognition in DNA seqeunces

This paper presents a new approach for short gene recognition in DNA sequences. Three DNA structural features are selected from an analysis of fourteen structural features. The feature values are mapped to new values. Three DNA signals are generated by the three sets of mapped feature values. Then the three DNA signals are normalized and combined into one signal. An auto-regressive (AR) model is used for power spectral density (PSD) estimation of the signal. The experiment result obtained by this method is shown to be comparable to existing exon detection methods which use digital signal processing (DSP). Also the computation complexity of the new method is only 1/3 of that of the method proposed previously.

[1]  Hong Yan,et al.  Autoregressive modeling of DNA features for short exon recognition , 2010, 2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[2]  P. P. Vaidyanathan,et al.  GENE AND EXON PREDICTION USING ALLPASS-BASED FILTERS , 2002 .

[3]  S. Tiwari,et al.  Prediction of probable genes by Fourier analysis of genomic sequences , 1997, Comput. Appl. Biosci..

[4]  H. Blöcker,et al.  Predicting DNA duplex stability from the base sequence. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[5]  J. Fickett Recognition of protein coding regions in DNA sequences. , 1982, Nucleic acids research.

[6]  Tuan D. Pham,et al.  Spectral estimation techniques for DNA sequence and microarray data analysis , 2007 .

[7]  Alan Wee-Chung Liew,et al.  DB-Curve: a novel 2D method of DNA sequence visualization and representation , 2003 .

[8]  Hong Yan,et al.  Short Exon Detection in DNA Sequences Based on Multifeature Spectral Analysis , 2011, EURASIP J. Adv. Signal Process..

[9]  Hong Yan,et al.  Studies of spectral properties of short genes using the wavelet subspace Hilbert–Huang transform (WSHHT) , 2008 .

[10]  A V Sivolob,et al.  Translational positioning of nucleosomes on DNA: the role of sequence-dependent isotropic DNA bending stiffness. , 1995, Journal of molecular biology.

[11]  M. A. El Hassan,et al.  Propeller-twisting of base-pairs and the conformational mobility of dinucleotide steps in DNA. , 1996, Journal of molecular biology.

[12]  Yvan Saeys,et al.  Large-scale structural analysis of the core promoter in mammalian and plant genomes , 2005, Nucleic acids research.

[13]  N. Sugimoto,et al.  Improved thermodynamic parameters and helix initiation factor to predict stability of DNA duplexes. , 1996, Nucleic acids research.

[14]  S. Lang,et al.  Frequency estimation with maximum entropy spectral estimators , 1980 .

[15]  W. Youden,et al.  Index for rating diagnostic tests , 1950, Cancer.

[16]  Juan V. Lorenzo-Ginori,et al.  Digital Signal Processing in the Analysis of Genomic Sequences , 2009 .

[17]  Leonidas D. Iasemidis,et al.  Autoregressive Modeling and Feature Analysis of DNA Sequences , 2004, EURASIP J. Adv. Signal Process..

[18]  Hong Yan,et al.  Multi-scale parametric spectral analysis for exon detection in DNA sequences based on forward-backward linear prediction and singular value decomposition of the double-base curves , 2008, Bioinformation.

[19]  Tessamma Thomas,et al.  Discrete wavelet transform de-noising in eukaryotic gene splicing , 2010, BMC Bioinformatics.