Spectral analysis of coding and non-coding regions of a DNA sequence by Parametric method

The objective of this paper is to estimate the spectral content of coding and non-coding segments of DNA sequence by Parametric method. An attempt has been made so that by analyzing this estimated spectral content, the hidden internal properties of the DNA sequence can be brought into light in order to identify coding regions from non-coding ones. In this approach the DNA sequence from various Homo Sapien genes have been identified for sample test and assigned numerical values based on strong-weak hydrogen bonding of nucleotides. DSP analysis of DNA sequences by Parametric Spectral Estimation methods show satisfactory results.

[1]  Monson H. Hayes,et al.  Statistical Digital Signal Processing and Modeling , 1996 .

[2]  Dimitris Anastassiou,et al.  Frequency-domain analysis of biomolecular sequences , 2000, Bioinform..

[3]  M. Roy,et al.  Identification and analysis of coding and non-coding regions of a DNA sequence by positional frequency distribution of nucleotides (PFDN) algorithm , 2009, 2009 4th International Conference on Computers and Devices for Communication (CODEC).

[4]  Leonidas D. Iasemidis,et al.  Autoregressive Modeling and Feature Analysis of DNA Sequences , 2004, EURASIP J. Adv. Signal Process..

[5]  Wentian Li,et al.  Long-range correlation and partial 1/fα spectrum in a noncoding DNA sequence , 1992 .

[6]  Changchuan Yin,et al.  Prediction of protein coding regions by the 3-base periodicity analysis of a DNA sequence. , 2007, Journal of theoretical biology.

[7]  P. P. Vaidyanathan,et al.  The role of signal-processing concepts in genomics and proteomics , 2004, J. Frankl. Inst..

[8]  R. Voss,et al.  Evolution of long-range fractal correlations and 1/f noise in DNA base sequences. , 1992, Physical review letters.

[9]  Hong Yan,et al.  Autoregressive Models for Spectral Analysis of Short Tandem Repeats in DNA Sequences , 2006, 2006 IEEE International Conference on Systems, Man and Cybernetics.

[10]  S. Tiwari,et al.  Prediction of probable genes by Fourier analysis of genomic sequences , 1997, Comput. Appl. Biosci..

[11]  A. Nair,et al.  A coding measure scheme employing electron-ion interaction pseudopotential (EIIP) , 2006, Bioinformation.