An integer period DFT for biological sequence processing

Detection of periodicity in symbolic sequences such as DNA is of considerable interest in a number of applications, however fast, accurate algorithms are needed for measuring spectral content at multiple integer periods. This paper describes an integer period discrete Fourier transform (IPDFT), presents a new algorithm for its implementation, and discusses applications to DNA sequence analysis. Evaluations on DNA sequence data show that the IPDFT may be a more suitable tool for periodicity analysis than an existing widely used correlation-based approach.

[1]  S. Tiwari,et al.  Prediction of probable genes by Fourier analysis of genomic sequences , 1997, Comput. Appl. Biosci..

[2]  Wei Wang,et al.  Computing linear transforms of symbolic signals , 2002, IEEE Trans. Signal Process..

[3]  R. Voss,et al.  Evolution of long-range fractal correlations and 1/f noise in DNA base sequences. , 1992, Physical review letters.

[4]  R. Arora,et al.  Detection of Periodicities in Gene Sequences: A Maximum Likelihood Approach , 2007, 2007 IEEE International Workshop on Genomic Signal Processing and Statistics.

[5]  J. Fickett Recognition of protein coding regions in DNA sequences. , 1982, Nucleic acids research.

[6]  Mahmood Akhtar,et al.  Signal Processing in Sequence Analysis: Advances in Eukaryotic Gene Prediction , 2008, IEEE Journal of Selected Topics in Signal Processing.

[7]  Matthias E. Futschik,et al.  DNA Motifs and Sequence Periodicities , 2006, Silico Biol..

[8]  P.D. Cristea,et al.  Genomic signal processing , 2004, 7th Seminar on Neural Network Applications in Electrical Engineering, 2004. NEUREL 2004. 2004.

[9]  Sanjit K. Mitra,et al.  Power spectrum analysis for DNA sequences , 2003, Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings..

[10]  N. Dimitrova,et al.  Improvement of Spectral Analysis as a Genomic Analysis Tool , 2007, 2007 IEEE International Workshop on Genomic Signal Processing and Statistics.