Prediction of protein coding regions by combining Fourier and Wavelet Transform

Identifying protein coding regions in DNA sequences is an important step in the location of genes. The major signal in protein coding regions for most of genomic sequences is three-base periodicity. This paper proposed a Fourier-Wavelet-based method to predict protein coding regions based on the previous Fourier-based predictor. The validity of this approach is verified by a great deal of research results from theoretical analysis and experiments. Sensitivity's increase and specificity's decrease demonstrate the efficacy of the proposed predictor.

[1]  Liaofu Luo,et al.  Periodicity of base correlation in nucleotide sequence , 1997 .

[2]  S. Tiwari,et al.  Prediction of probable genes by Fourier analysis of genomic sequences , 1997, Comput. Appl. Biosci..

[3]  Dimitris Anastassiou,et al.  Frequency-domain analysis of biomolecular sequences , 2000, Bioinform..

[4]  Sanjit K. Mitra,et al.  Power spectrum analysis for DNA sequences , 2003, Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings..

[5]  P.D. Cristea,et al.  Genomic signal processing , 2004, 7th Seminar on Neural Network Applications in Electrical Engineering, 2004. NEUREL 2004. 2004.

[6]  R. Mantegna,et al.  Long-range correlation properties of coding and noncoding DNA sequences: GenBank analysis. , 1995, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[7]  E. Snyder,et al.  Identification of protein coding regions in genomic DNA. , 1995, Journal of molecular biology.

[8]  D. Searls,et al.  Gene structure prediction by linguistic methods. , 1994, Genomics.

[9]  E. Ambikairajah,et al.  Detection of period-3 behavior in genomic sequences using singular value decomposition , 2005, Proceedings of the IEEE Symposium on Emerging Technologies, 2005..

[10]  Alessandro Neri,et al.  New approaches to genome sequence analysis based on digital signal processing , 2002 .

[11]  S. Mallat A wavelet tour of signal processing , 1998 .

[12]  E. Snyder,et al.  Identification of coding regions in genomic DNA sequences: an application of dynamic programming and neural networks. , 1993, Nucleic acids research.

[13]  Stanley,et al.  Molecular dynamics simulation of spinodal decomposition in a two-dimensional binary fluid mixture. , 1994, Physical review letters.

[14]  James W. Fickett,et al.  The Gene Identification Problem: An Overview for Developers , 1995, Comput. Chem..

[15]  A. Lapedes,et al.  Determination of eukaryotic protein coding regions using neural networks and information theory. , 1992, Journal of molecular biology.

[16]  Yizhar Lavner,et al.  Gene prediction by spectral rotation measure: a new method for identifying protein-coding regions. , 2003, Genome research.

[17]  Alessandro Neri,et al.  Visualization and analysis of DNA sequences using DNA walks , 2004, J. Frankl. Inst..

[18]  P. P. Vaidyanathan,et al.  GENE AND EXON PREDICTION USING ALLPASS-BASED FILTERS , 2002 .

[19]  P. Vandergheynst,et al.  Fourier and wavelet transform analysis, a tool for visualizing regular patterns in DNA sequences. , 2000, Journal of theoretical biology.

[20]  R. Guigó,et al.  Evaluation of gene structure prediction programs. , 1996, Genomics.