Chargaff's "Grammar of Biology": New Fractal-like Rules

Chargaff once said that "I saw before me in dark contours the beginning of a grammar of Biology". In linguistics, "grammar" is the set of natural language rules, but we do not know for sure what Chargaff meant by "grammar" of Biology. Nevertheless, assuming the metaphor, Chargaff himself started a "grammar of Biology" discovering the so called Chargaff's rules. In this work, we further develop his grammar. Using new concepts, we were able to discovery new genomic rules that seem to be invariant across a large set of organisms, and show a fractal-like property, since no matter the scale, the same pattern is observed (self-similarity). We hope that these new invariant genomic rules may be used in different contexts since short read data bias detection to genome assembly quality assessment.

[1]  A. Jamie Cuticchia,et al.  Compositional symmetries in complete genomes , 2001, Bioinform..

[2]  Jan Schröder,et al.  Reference-Free Validation of Short Read Data , 2010, PloS one.

[3]  D. A. Palmieri,et al.  The genome sequence of the plant pathogen Xylella fastidiosa , 2000, Nature.

[4]  D. Forsdyke,et al.  Deviations from Chargaff's second parity rule correlate with direction of transcription. , 1999, Journal of theoretical biology.

[5]  F. Crick,et al.  Molecular structure of nucleic acids , 2004, JAMA.

[6]  V. Prabhu Symmetry observations in long nucleotide sequences. , 1993, Nucleic acids research.

[7]  E. Chargaff Chemical specificity of nucleic acids and mechanism of their enzymatic degradation , 1950, Experientia.

[8]  Donald R. Forsdyke,et al.  Purine loading, stem-loops and Chargaff’s second parity rule: a discussion of the application of elementary principles to early chemical observations , 2004, Applied bioinformatics.

[9]  M. Yamagishi,et al.  Nucleotide Frequencies in Human Genome and Fibonacci Numbers , 2006, Bulletin of mathematical biology.

[10]  Maclyn McCarty,et al.  STUDIES ON THE CHEMICAL NATURE OF THE SUBSTANCE INDUCING TRANSFORMATION OF PNEUMOCOCCAL TYPES , 1944, The Journal of experimental medicine.

[11]  K. Hansen,et al.  Biases in Illumina transcriptome sequencing caused by random hexamer priming , 2010, Nucleic acids research.

[12]  David Mitchell,et al.  A test of Chargaff's second rule. , 2006, Biochemical and biophysical research communications.

[13]  Hong-Da Chen,et al.  Inverse Symmetry in Complete Genomes and Whole-Genome Inverse Duplication , 2009, PloS one.