A full Bayesian partition model for identifying hypo- and hyper-methylated loci from single nucleotide resolution sequencing data

BackgroudDNA methylation is an epigenetic modification that plays important roles on gene regulation. Study of whole-genome bisulfite sequencing and reduced representation bisulfite sequencing brings the availability of DNA methylation at single CpG resolution. The main interest of study on DNA methylation data is to test the methylation difference under two conditions of biological samples. However, the high cost and complexity of this sequencing experiment limits the number of biological replicates, which brings challenges to the development of statistical methods.ResultsBayesian modeling is well known to be able to borrow strength across the genome, and hence is a powerful tool for high-dimensional- low-sample- size data. In order to provide accurate identification of methylation loci, especially for low coverage data, we propose a full Bayesian partition model to detect differentially methylated loci under two conditions of scientific study. Since hypo-methylation and hyper-methylation have distinct biological implication, it is desirable to differentiate these two types of differential methylation. The advantage of our Bayesian model is that it can produce one-step output of each locus being either equal-, hypo- or hyper-methylated locus without further post-hoc analysis. An R package named as MethyBayes implementing the proposed full Bayesian partition model will be submitted to the bioconductor website upon publication of the manuscript.ConclusionsThe proposed full Bayesian partition model outperforms existing methods in terms of power while maintaining a low false discovery rate based on simulation studies and real data analysis including bioinformatics analysis.

[1]  M. Ehrlich,et al.  Amount and distribution of 5-methylcytosine in human DNA from different types of tissues of cells. , 1982, Nucleic acids research.

[2]  A. Razin,et al.  DNA methylation and genomic imprinting , 1994, Cell.

[3]  Peter A. Jones,et al.  The Role of DNA Methylation in Mammalian Epigenetics , 2001, Science.

[4]  E. Li Chromatin modification and epigenetic reprogramming in mammalian development , 2002, Nature Reviews Genetics.

[5]  John D. Storey A direct approach to false discovery rates , 2002 .

[6]  A. Bird,et al.  Epigenetic regulation of gene expression: how the genome integrates intrinsic and environmental signals , 2003, Nature Genetics.

[7]  P. M. Das,et al.  DNA methylation and cancer. , 2004, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[8]  Deepayan Sarkar,et al.  Detecting differential gene expression with a semiparametric hierarchical mixture method. , 2004, Biostatistics.

[9]  S. E. Ahmed,et al.  Markov Chain Monte Carlo: Stochastic Simulation for Bayesian Inference , 2008, Technometrics.

[10]  L. Kristensen,et al.  Epigenetics and cancer treatment. , 2009, European journal of pharmacology.

[11]  M. Toyota,et al.  Genomic screening for genes upregulated by demethylation revealed novel targets of epigenetic silencing in breast cancer , 2010, Breast Cancer Research and Treatment.

[12]  M. Ehrlich DNA hypomethylation in cancer cells. , 2009, Epigenomics.

[13]  Lee E. Edsall,et al.  Human DNA methylomes at base resolution show widespread epigenomic differences , 2009, Nature.

[14]  Wenge Guo,et al.  Controlling False Discoveries in Multidimensional Directional Decisions, with Applications to Gene Expression Data on Ordered Categories , 2010, Biometrics.

[15]  K. Patterson,et al.  DNA Methylation: Bisulphite Modification and Analysis , 2011, Journal of visualized experiments : JoVE.

[16]  A. Feinberg,et al.  Increased methylation variation in epigenetic domains across cancer types , 2011, Nature Genetics.

[17]  B. Langmead,et al.  BSmooth: from whole genome bisulfite sequencing reads to differentially methylated regions , 2012, Genome Biology.

[18]  Francine E. Garrett-Bakelman,et al.  methylKit: a comprehensive R package for the analysis of genome-wide DNA methylation profiles , 2012, Genome Biology.

[19]  Gerald L. Arthur,et al.  Genome-wide DNA methylation analysis reveals novel epigenetic changes in chronic lymphocytic leukemia , 2012, Epigenetics.

[20]  K. Conneely,et al.  A Bayesian hierarchical model to detect differentially methylated loci from single nucleotide resolution sequencing data , 2014, Nucleic acids research.