Detection of differentially methylated regions in whole genome bisulfite sequencing data using local Getis-Ord statistics

MOTIVATION DNA methylation is an important epigenetic modification that has essential role in gene regulation, cell differentiation and cancer development. Bisulfite sequencing is a widely used technique to obtain genome-wide DNA methylation profiles, and one of the key tasks of analyzing bisulfite sequencing data is to detect differentially methylated regions (DMRs) among samples under different treatment conditions. Although numerous tools have been proposed to detect differentially methylated single CpG site (DMC) between samples, methods for direct DMR detection, especially for complex study designs, are largely limited. RESULTS We present a new software, GetisDMR, for direct DMR detection. We use beta-binomial regression to model the whole-genome bisulfite sequencing data, where variations in methylation levels and confounding effects have been accounted for. We employ a region-wise test statistic, which is derived from local Getis-Ord statistics and considers the spatial correlation between nearby CpG sites, to detect DMRs. Unlike existing methods, that attempt to infer DMRs from DMCs based on empirical criteria, we provide statistical inference for direct DMR detection. Through extensive simulations and an application to two mouse datasets, we demonstrate that GetisDMR achieves better sensitivities, positive predictive values, more exact locations and better agreement of DMRs with current biological knowledge. AVAILABILITY AND IMPLEMENTATION It is available at https://github.com/DMU-lilab/GetisDMR CONTACTS: y.wen@auckland.ac.nz or zhiguangli@dlmedu.edu.cnSupplementary information: Supplementary data are available at Bioinformatics online.

[1]  Andrew D. Smith,et al.  Using beta-binomial regression for high-precision differential methylation analysis in multifactor whole-genome bisulfite sequencing experiments , 2014, BMC Bioinformatics.

[2]  Yongseok Park,et al.  MethylSig: a whole genome DNA methylation analysis pipeline , 2014, Bioinform..

[3]  Francine E. Garrett-Bakelman,et al.  methylKit: a comprehensive R package for the analysis of genome-wide DNA methylation profiles , 2012, Genome Biology.

[4]  J. Rogers,et al.  DNA methylation profiling of human chromosomes 6, 20 and 22 , 2006, Nature Genetics.

[5]  Toutai Mituyama,et al.  Bisulfighter: accurate detection of methylated cytosines and differentially methylated regions , 2014, Nucleic acids research.

[6]  A. Bird,et al.  CpG islands and the regulation of transcription. , 2011, Genes & development.

[7]  G. Hon,et al.  Adult tissue methylomes harbor epigenetic memory at embryonic enhancers , 2013, Nature Genetics.

[8]  Yurij S. Kharin,et al.  Robust estimation and forecasting for beta-mixed hierarchical models of grouped binary data , 2004 .

[9]  K. Conneely,et al.  A Bayesian hierarchical model to detect differentially methylated loci from single nucleotide resolution sequencing data , 2014, Nucleic acids research.

[10]  Rafael A Irizarry,et al.  Comprehensive high-throughput arrays for relative methylation (CHARM). , 2008, Genome research.

[11]  Martin Dugas,et al.  Detection of significantly differentially methylated regions in targeted bisulfite sequencing data , 2013, Bioinform..

[12]  Wendy Dean,et al.  Dynamic reprogramming of DNA methylation in the early mouse embryo. , 2002, Developmental biology.

[13]  M. Ehrlich,et al.  DNA methylation in cancer: too much, but also too little , 2002, Oncogene.

[14]  M. Marra,et al.  Next generation sequencing based approaches to epigenomics. , 2010, Briefings in functional genomics.

[15]  A. Getis The Analysis of Spatial Association by Use of Distance Statistics , 2010 .

[16]  K. Patterson,et al.  DNA Methylation: Bisulphite Modification and Analysis , 2011, Journal of visualized experiments : JoVE.

[17]  Gavin Giovannoni,et al.  Guthrie card methylomics identifies temporally stable epialleles that are present at birth in humans , 2012, Genome research.

[18]  Lee E. Edsall,et al.  Human DNA methylomes at base resolution show widespread epigenomic differences , 2009, Nature.

[19]  Gouri Sankar Bhunia,et al.  Spatial and temporal variation and hotspot detection of kala-azar disease in Vaishali district (Bihar), India , 2013, BMC Infectious Diseases.

[20]  Matthew D. Schultz,et al.  Global Epigenomic Reconfiguration During Mammalian Brain Development , 2013, Science.

[21]  J. Ord,et al.  Local Spatial Autocorrelation Statistics: Distributional Issues and an Application , 2010 .

[22]  Jeffrey T Leek,et al.  Significance analysis and statistical dissection of variably methylated regions. , 2012, Biostatistics.

[23]  B. Langmead,et al.  BSmooth: from whole genome bisulfite sequencing reads to differentially methylated regions , 2012, Genome Biology.

[24]  René S. Kahn,et al.  The Relationship of DNA Methylation with Age, Gender and Genotype in Twins and Healthy Controls , 2009, PloS one.

[25]  Jeffrey B. Cheng,et al.  Estimating absolute methylation levels at single-CpG resolution from methylation enrichment and restriction enzyme sequencing methods , 2013, RECOMB.

[26]  Peter F. Stadler,et al.  Fast and sensitive mapping of bisulfite-treated sequencing data , 2012, Bioinform..

[27]  J. Ord,et al.  Testing for Local Spatial Autocorrelation in the Presence of Global Autocorrelation , 2001 .

[28]  Matthew D. Schultz,et al.  'Leveling' the playing field for analyses of single-base resolution DNA methylomes. , 2012, Trends in genetics : TIG.

[29]  A. Bird,et al.  DNA methylation landscapes: provocative insights from epigenomics , 2008, Nature Reviews Genetics.

[30]  Chia-Lin Wei,et al.  Dynamic changes in the human methylome during differentiation. , 2010, Genome research.

[31]  C. Bock Analysing and interpreting DNA methylation data , 2012, Nature Reviews Genetics.

[32]  Rudolf Jaenisch,et al.  Role for DNA methylation in genomic imprinting , 1993, Nature.

[33]  Wei Li,et al.  MOABS: model based analysis of bisulfite sequencing data , 2014, Genome Biology.

[34]  Peter A. Jones,et al.  Epigenetics in cancer. , 2010, Carcinogenesis.