Investigating the relationship of DNA methylation with mutation rate and allele frequency in the human genome

BackgroundDNA methylation, which mainly occurs at CpG dinucleotides, is a dynamic epigenetic regulation mechanism in most eukaryotic genomes. It is already known that methylated CpG dinucleotides can lead to a high rate of C to T mutation at these sites. However, less is known about whether and how the methylation level causes a different mutation rate, especially at the single-base resolution.ResultsIn this study, we used genome-wide single-base resolution methylation data to perform a comprehensive analysis of the mutation rate of methylated cytosines from human embryonic stem cell. Through the analysis of the density of single nucleotide polymorphisms, we first confirmed that the mutation rate in methylated CpG sites is greater than that in unmethylated CpG sites. Then, we showed that among methylated CpG sites, the mutation rate is markedly increased in low-intermediately (20-40% methylation level) to intermediately methylated CpG sites (40-60% methylation level) of the human genome. This mutation pattern was observed regardless of DNA strand direction and the sequence coverage over the site on which the methylation level was calculated. Moreover, this highly non-random mutation pattern was found more apparent in intergenic and intronic regions than in promoter regions and CpG islands. Our investigation suggested this pattern appears primarily in autosomes rather than sex chromosomes. Further analysis based on human-chimpanzee divergence confirmed these observations. Finally, we observed a significant correlation between the methylation level and cytosine allele frequency.ConclusionsOur results showed a high mutation rate in low-intermediately to intermediately methylated CpG sites at different scales, from the categorized genomic region, whole chromosome, to the whole genome level, thereby providing the first supporting evidence of mutation rate variation at human methylated CpG sites using the genome-wide sing-base resolution methylation data.

[1]  Lee E. Edsall,et al.  Human DNA methylomes at base resolution show widespread epigenomic differences , 2009, Nature.

[2]  Keith C. Norris,et al.  DNA cytosine methylation and heat-induced deamination , 1986, Bioscience reports.

[3]  Alan Hodgkinson,et al.  Variation in the mutation rate across mammalian genomes , 2011, Nature Reviews Genetics.

[4]  Leng Han,et al.  CpG island density and its correlations with genomic features in mammalian genomes , 2008, Genome Biology.

[5]  Kateryna Makova,et al.  Male-driven evolution. , 2002, Current opinion in genetics & development.

[6]  Laurent Duret,et al.  The Impact of Recombination on Nucleotide Substitutions in the Human Genome , 2008, PLoS genetics.

[7]  Zhongming Zhao,et al.  Functional complementation between transcriptional methylation regulation and post-transcriptional microRNA regulation in the human genome , 2011, BMC Genomics.

[8]  Taro L. Saito,et al.  Genome-wide genetic variations are highly correlated with proximal DNA methylation patterns , 2012, Genome research.

[9]  H. Ellegren,et al.  Substitution rate variation at human CpG sites correlates with non-CpG divergence, methylation level and GC content , 2011, Genome Biology.

[10]  Daiya Takai,et al.  Comprehensive analysis of CpG islands in human chromosomes 21 and 22 , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[11]  R. Durbin,et al.  A Bayesian deconvolution strategy for immunoprecipitation-based DNA methylome analysis , 2008, Nature Biotechnology.

[12]  Junfeng Xia,et al.  Do MicroRNAs Preferentially Target the Genes with Low DNA Methylation Level at the Promoter Region? , 2011, ICIC.

[13]  Zhongming Zhao,et al.  Contrast features of CpG islands in the promoter and other regions in the dog genome. , 2009, Genomics.

[14]  Zhongming Zhao,et al.  Conservation and divergence of DNA methylation in eukaryotes , 2011, Epigenetics.

[15]  K. J. Fryxell,et al.  CpG mutation rates in the human genome are highly dependent on local GC content. , 2005, Molecular biology and evolution.

[16]  J. Stamatoyannopoulos,et al.  Human mutation rate associated with DNA replication timing , 2009, Nature Genetics.

[17]  Zhongming Zhao,et al.  Features of Methylation and Gene Expression in the Promoter-Associated CpG Islands Using Human Methylome Data , 2012, Comparative and functional genomics.

[18]  I. Weissman,et al.  Stem cells, cancer, and cancer stem cells , 2001, Nature.

[19]  Zhongming Zhao,et al.  CpG islands or CpG clusters: how to identify functional GC-rich regions in a genome? , 2009, BMC Bioinformatics.

[20]  Zhongming Zhao,et al.  Mutational spectrum in the recent human genome inferred by single nucleotide polymorphisms. , 2006, Genomics.

[21]  Laurent Farinelli,et al.  Impact of replication timing on non-CpG and CpG substitution rates in mammalian genomes. , 2010, Genome research.

[22]  Zhongming Zhao,et al.  Methylation-dependent transition rates are dependent on local sequence lengths and genomic regions. , 2007, Molecular biology and evolution.

[23]  Zhongming Zhao,et al.  CpG islands: algorithms and applications in methylation studies. , 2009, Biochemical and biophysical research communications.

[24]  David N. Cooper,et al.  The CpG dinucleotide and human genetic disease , 1988, Human Genetics.

[25]  Leng Han,et al.  Features and trend of loss of promoter-associated CpG islands in the human and mouse genomes. , 2007, Molecular biology and evolution.