Fractals and Hidden Symmetries in DNA

This paper deals with the digital complex representation of a DNA sequence and the analysis of existing correlations by wavelets. The symbolic DNA sequence is mapped into a nonlinear time series. By studying this time series the existence of fractal shapes and symmetries will be shown. At first step, the indicator matrix enables us to recognize some typical patterns of nucleotide distribution. The DNA sequence, of the influenza virus A (H1N1), is investigated by using the complex representation, together with the corresponding walks on DNA; in particular, it is shown that DNA walks are fractals. Finally, by using the wavelet analysis, the existence of symmetries is proven.

[1]  Ming Li,et al.  On the Predictability of Long-Range Dependent Series , 2010 .

[2]  Hanspeter Herzel,et al.  Interpreting correlations in biosequences , 1998 .

[3]  Wentian Li The Measure of Compositional Heterogeneity in DNA Sequences Is Related to Measures of Complexity , 1997, adap-org/9709007.

[4]  P. Bernaola-Galván,et al.  Compositional segmentation and long-range fractal correlations in DNA sequences. , 1996, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[5]  R. Polozov,et al.  Wavelet analysis of DNA sequences. , 1996, Genetic analysis : biomolecular engineering.

[6]  C. Peng,et al.  Mosaic organization of DNA nucleotides. , 1994, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[7]  Wentian Li,et al.  Long-range correlation and partial 1/fα spectrum in a noncoding DNA sequence , 1992 .

[8]  Alain Arneodo,et al.  Long-range correlations between DNA bending sites: relation to the structure and dynamics of nucleosomes. , 2002, Journal of molecular biology.

[9]  Wentian Li The complexity of DNA , 1997 .

[10]  J. Fitch,et al.  Genomic engineering: moving beyond DNA sequence to function , 2000, Proceedings of the IEEE.

[11]  Dimitris Anastassiou,et al.  Frequency-domain analysis of biomolecular sequences , 2000, Bioinform..

[12]  Eivind Coward,et al.  Equivalence of two Fourier methods for biological sequences , 1997 .

[13]  Emmanuel Bacry,et al.  What can we learn with wavelets about DNA sequences , 1998 .

[14]  Branko Borštnik,et al.  Analysis of Apparent 1/fα Spectrum in DNA Sequences , 1993 .

[15]  R. Voss,et al.  Evolution of long-range fractal correlations and 1/f noise in DNA base sequences. , 1992, Physical review letters.

[16]  S Karlin,et al.  Patchiness and correlations in DNA sequences , 1993, Science.

[17]  C. Peng,et al.  Long-range correlations in nucleotide sequences , 1992, Nature.

[18]  Emmanuel Bacry,et al.  Wavelet based fractal analysis of DNA sequences , 1996 .

[19]  Amir Niknejad,et al.  DNA sequence representation without degeneracy. , 2003, Nucleic acids research.

[20]  Carlo Cattani,et al.  Complex Representation of DNA Sequences , 2008, BIRD.

[21]  P. Vandergheynst,et al.  Fourier and wavelet transform analysis, a tool for visualizing regular patterns in DNA sequences. , 2000, Journal of theoretical biology.

[22]  Carlo Cattani,et al.  Wavelet and Wave Analysis as Applied to Materials with Micro or Nanostructure , 2007, Series on Advances in Mathematics for Applied Sciences.

[23]  Ming Li Fractal Time Series—A Tutorial Review , 2010 .

[24]  E. Bacry,et al.  Characterizing long-range correlations in DNA sequences from wavelet analysis. , 1995, Physical review letters.

[25]  Carlo Cattani Haar wavelets based technique in evolution problems , 2004, Proceedings of the Estonian Academy of Sciences. Physics. Mathematics.

[26]  Paul Dan Cristea,et al.  Large scale features in DNA genomic signals , 2003, Signal Process..

[27]  Tsonis,et al.  Wavelet analysis of DNA sequences. , 1996, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[28]  Ming Li,et al.  Power spectrum of generalized Cauchy process , 2010, Telecommun. Syst..

[29]  Denise Gorse,et al.  Wavelet transforms for the characterization and detection of repeating motifs. , 2002, Journal of molecular biology.

[30]  Carlo Cattani,et al.  Wavelet Algorithms for DNA Analysis , 2010 .

[31]  R. Mantegna,et al.  Long-range correlation properties of coding and noncoding DNA sequences: GenBank analysis. , 1995, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[32]  Richard F. Voss,et al.  LONG-RANGE FRACTAL CORRELATIONS IN DNA INTRONS AND EXONS , 1994 .

[33]  H Herzel,et al.  Correlations in protein sequences and property codes. , 1998, Journal of theoretical biology.

[34]  Henry Gee A journey into the genome: what's there , 2001 .

[35]  Carlo Cattani,et al.  Harmonic wavelet approximation of random, fractal and high frequency signals , 2010, Telecommun. Syst..

[36]  Alessandro Neri,et al.  Visualization and analysis of DNA sequences using DNA walks , 2004, J. Frankl. Inst..

[37]  Wentian Li,et al.  The Study of Correlation Structures of DNA Sequences: A Critical Review , 1997, Comput. Chem..

[38]  P. P. Vaidyanathan,et al.  The role of signal-processing concepts in genomics and proteomics , 2004, J. Frankl. Inst..

[39]  B. Wang,et al.  Correlation property of length sequences based on global structure of the complete genome. , 2000, Physical review. E, Statistical, nonlinear, and soft matter physics.

[40]  Carlo Cattani,et al.  Haar wavelet-based technique for sharp jumps classification , 2004 .