Comparative Genomic Study Reveals a Transition from TA Richness in Invertebrates to GC Richness in Vertebrates at CpG Flanking Sites: An Indication for Context-Dependent Mutagenicity of Methylated CpG Sites

Vertebrate genomes are characterized with CpG deficiency, particularly for GC-poor regions. The GC content-related CpG deficiency is probably caused by context-dependent deamination of methylated CpG sites. This hypothesis was examined in this study by comparing nucleotide frequencies at CpG flanking positions among invertebrate and vertebrate genomes. The finding is a transition of nucleotide preference of 5′ T to 5′ A at the invertebrate-vertebrate boundary, indicating that a large number of CpG sites with 5′ Ts were depleted because of global DNA methylation developed in vertebrates. At genome level, we investigated CpG observed/expected (obs/exp) values in 500 bp fragments, and found that higher CpG obs/exp value is shown in GC-poor regions of invertebrate genomes (except sea urchin) but in GC-rich sequences of vertebrate genomes. We next compared GC content at CpG flanking positions with genomic average, showing that the GC content is lower than the average in invertebrate genomes, but higher than that in vertebrate genomes. These results indicate that although 5′ T and 5′ A are different in inducing deamination of methylated CpG sites, GC content is even more important in affecting the deamination rate. In all the tests, the results of sea urchin are similar to vertebrates perhaps due to its fractional DNA methylation. CpG deficiency is therefore suggested to be mainly a result of high mutation rates of methylated CpG sites in GC-poor regions.

[1]  A. Nekrutenko,et al.  Assessment of compositional heterogeneity within and between eukaryotic genomes. , 2000, Genome research.

[2]  Michael Q. Zhang,et al.  Large-scale structure of genomic methylation patterns. , 2005, Genome research.

[3]  S. Karlin,et al.  Dinucleotide relative abundance extremes: a genomic signature. , 1995, Trends in genetics : TIG.

[4]  P. Jones,et al.  The rate of hydrolytic deamination of 5-methylcytosine in double-stranded DNA. , 1994, Nucleic acids research.

[5]  Laurence D. Hurst,et al.  The evolution of isochores , 2001, Nature Reviews Genetics.

[6]  Daiya Takai,et al.  Comprehensive analysis of CpG islands in human chromosomes 21 and 22 , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[7]  A. Jamie Cuticchia,et al.  Compositional symmetries in complete genomes , 2001, Bioinform..

[8]  S Karlin,et al.  Heterogeneity of genomes: measures and values. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Adrian Bird,et al.  CpG methylation is targeted to transcription units in an invertebrate genome. , 2007, Genome research.

[10]  G. Pfeifer,et al.  Methylation of CpG dinucleotides in the lacI gene of the Big Blue transgenic mouse. , 1998, Mutation research.

[11]  G Bernardi,et al.  Methylation patterns in the isochores of vertebrate genomes. , 1997, Gene.

[12]  David N. Cooper,et al.  The CpG dinucleotide and human genetic disease , 1988, Human Genetics.

[13]  Kamel Jabbari,et al.  Cytosine methylation and CpG, TpG (CpA) and TpA frequencies. , 2004, Gene.

[14]  Ying Wang,et al.  Insights into social insects from the genome of the honeybee Apis mellifera , 2006, Nature.

[15]  E. Chargaff How Genetics Got a Chemical Education , 1979, Annals of the New York Academy of Sciences.

[16]  J. Mortimer,et al.  Chargaff's legacy. , 2000, Gene.

[17]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[18]  M W Simmen,et al.  Nonmethylated transposable elements and methylated genes in a chordate genome. , 1999, Science.

[19]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.

[20]  G Bernardi,et al.  CpG doublets, CpG islands and Alu repeats in long human DNA sequences from different isochore families. , 1998, Gene.

[21]  S Tweedie,et al.  Methylation of genomes and genes at the invertebrate-vertebrate boundary , 1997, Molecular and cellular biology.

[22]  A. Bird DNA methylation and the frequency of CpG in animal DNA. , 1980, Nucleic acids research.

[23]  G. Robinson,et al.  Functional CpG Methylation System in a Social Insect , 2006, Science.

[24]  K. J. Fryxell,et al.  Cytosine deamination plays a primary role in the evolution of mammalian isochores. , 2000, Molecular biology and evolution.

[25]  Ernest,et al.  Enzymatic synthesis of deoxyribonucleic acid. , 1969, Harvey lectures.

[26]  Colin N. Dewey,et al.  Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution , 2004, Nature.

[27]  Donald R. Forsdyke Symmetry observations in long nucleotide sequences: a commentary on the Discovery Note of Qi and Cuticchia , 2002, Bioinform..

[28]  S. Cross,et al.  Non-methylated islands in fish genomes are GC-poor. , 1991, Nucleic acids research.

[29]  K. J. Fryxell,et al.  CpG mutation rates in the human genome are highly dependent on local GC content. , 2005, Molecular biology and evolution.

[30]  A. Clark,et al.  Local rates of recombination are positively correlated with GC content in the human genome. , 2001, Molecular biology and evolution.

[31]  G. Pesole,et al.  Structural and compositional features of untranslated regions of eukaryotic mRNAs. , 1997, Gene.

[32]  Albert Jeltsch,et al.  DNA of Drosophila melanogaster contains 5‐methylcytosine , 2000, The EMBO journal.

[33]  A. Weiner,et al.  Nonviral retroposons: genes, pseudogenes, and transposable elements generated by the reverse flow of genetic information. , 1986, Annual review of biochemistry.

[34]  G Bernardi,et al.  Evolutionary changes in CpG and methylation levels in the genome of vertebrates. , 1997, Gene.

[35]  G Bernardi,et al.  CpG islands, genes and isochores in the genomes of vertebrates. , 1991, Gene.

[36]  G Bernardi,et al.  The mosaic genome of warm-blooded vertebrates. , 1985, Science.

[37]  Giorgio Bernardi,et al.  DNA methylation and body temperature in fishes. , 2006, Gene.

[38]  J. Josse,et al.  Enzymatic synthesis of deoxyribonucleic acid. VIII. Frequencies of nearest neighbor base sequences in deoxyribonucleic acid. , 1961, The Journal of biological chemistry.

[39]  C. Walsh,et al.  Cytosine methylation and the ecology of intragenomic parasites. , 1997, Trends in genetics : TIG.