The role of the symbolic-to-numerical mapping in the detection of DNA periodicities

The detection of many forms of periodicities in DNA sequences has been an active area of research in recent years. Most of the signal processing based methods have used the simple Voss mapping to map the symbolic DNA sequence into binary indicator ones before computing some form of the so-called DNA spectrum to locate these repeats. A key research issue that remains however open is whether the success of these techniques is Voss specific. In this paper, we first propose a new and generic matrix based framework that comprises most of the widely used mappings in the literature as special cases. By using this approach, we can then show that the standard DNA spectrum is in fact in variable under most of these mappings. Finally, we demonstrate that a number of potential new mappings naturally follow from the suggested framework.

[1]  P.D. Cristea,et al.  Genomic signal processing , 2004, 7th Seminar on Neural Network Applications in Electrical Engineering, 2004. NEUREL 2004. 2004.

[2]  J. Butler,et al.  Forensic DNA Typing: Biology and Technology behind STR Markers , 2002, Heredity.

[3]  S. Tiwari,et al.  Prediction of probable genes by Fourier analysis of genomic sequences , 1997, Comput. Appl. Biosci..

[4]  A. Smit,et al.  The origin of interspersed repeats in the human genome. , 1996, Current opinion in genetics & development.

[5]  Rappold,et al.  Human Molecular Genetics , 1996, Nature Medicine.

[6]  P D Cristea Conversion of nucleotides sequences into genomic signals , 2002, Journal of cellular and molecular medicine.

[7]  V. Chechetkin,et al.  Search of hidden periodicities in DNA sequences. , 1995, Journal of theoretical biology.

[8]  Ravi Gupta,et al.  An efficient algorithm to detect palindromes in DNA sequences using periodicity transform , 2006, Signal Process..

[9]  J. Tuqan,et al.  The Filtered Spectral Rotation Measure , 2006, 2006 Fortieth Asilomar Conference on Signals, Systems and Computers.

[10]  Eivind Coward,et al.  Equivalence of two Fourier methods for biological sequences , 1997 .

[11]  R. Linsker,et al.  A measure of DNA periodicity. , 1986, Journal of theoretical biology.

[12]  Andrzej K. Brodzik,et al.  Symbol-balanced quaternionic periodicity transform for latent pattern detection in DNA sequences , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[13]  Ivo Grosse,et al.  Repeats and correlations in human DNA sequences. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[14]  Jamal Tuqan,et al.  A DSP Approach for Finding the Codon Bias in DNA Sequences , 2008, IEEE Journal of Selected Topics in Signal Processing.

[15]  V. R. Chechetkin,et al.  Anticodons, Frameshifts, and Hidden Periodicities in tRNA Sequences , 2006, Journal of biomolecular structure & dynamics.

[16]  R. Voss,et al.  Evolution of long-range fractal correlations and 1/f noise in DNA base sequences. , 1992, Physical review letters.

[17]  John M. Butler,et al.  Forensic DNA typing : biology & technology behind STR markers , 2001 .

[18]  Michael R. Hayden,et al.  Analysis of Triplet Repeat Disorders , 1998 .

[19]  Jamal Tuqan,et al.  Gene Identification Using the Z-Curve Representation , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[20]  G. Benson,et al.  Tandem repeats finder: a program to analyze DNA sequences. , 1999, Nucleic acids research.