MethylSig: a whole genome DNA methylation analysis pipeline

MOTIVATION DNA methylation plays critical roles in gene regulation and cellular specification without altering DNA sequences. The wide application of reduced representation bisulfite sequencing (RRBS) and whole genome bisulfite sequencing (bis-seq) opens the door to study DNA methylation at single CpG site resolution. One challenging question is how best to test for significant methylation differences between groups of biological samples in order to minimize false positive findings. RESULTS We present a statistical analysis package, methylSig, to analyse genome-wide methylation differences between samples from different treatments or disease groups. MethylSig takes into account both read coverage and biological variation by utilizing a beta-binomial approach across biological samples for a CpG site or region, and identifies relevant differences in CpG methylation. It can also incorporate local information to improve group methylation level and/or variance estimation for experiments with small sample size. A permutation study based on data from enhanced RRBS samples shows that methylSig maintains a well-calibrated type-I error when the number of samples is three or more per group. Our simulations show that methylSig has higher sensitivity compared with several alternative methods. The use of methylSig is illustrated with a comparison of different subtypes of acute leukemia and normal bone marrow samples. AVAILABILITY methylSig is available as an R package at http://sartorlab.ccmb.med.umich.edu/software. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

[1]  Peter W. Laird,et al.  Interplay between the Cancer Genome and Epigenome , 2013, Cell.

[2]  R. Schneider,et al.  Chatting histone modifications in mammals. , 2010, Briefings in functional genomics.

[3]  Ram C. Tripathi,et al.  Estimation of parameters in the beta binomial model , 1994 .

[4]  Zachary D. Smith,et al.  Preparation of reduced representation bisulfite sequencing libraries for genome-scale DNA methylation profiling , 2011, Nature Protocols.

[5]  P. Laird,et al.  Epigenetic stem cell signature in cancer , 2007, Nature Genetics.

[6]  Jianqing Fan,et al.  Generalized likelihood ratio statistics and Wilks phenomenon , 2001 .

[7]  J. Tchinda,et al.  Recurrent fusion of TMPRSS2 and ETS transcription factor genes in prostate cancer. , 2006, Science.

[8]  O. Rando,et al.  Reduced-representation methylation mapping , 2008, Genome Biology.

[9]  Griffiths Da Maximum likelihood estimation for the beta-binomial distribution and an application to the household distribution of the total number of cases of a disease. , 1973 .

[10]  K. Conneely,et al.  A Bayesian hierarchical model to detect differentially methylated loci from single nucleotide resolution sequencing data , 2014, Nucleic acids research.

[11]  J. Licht,et al.  Leukemic IDH1 and IDH2 mutations result in a hypermethylation phenotype, disrupt TET2 function, and impair hematopoietic differentiation. , 2010, Cancer cell.

[12]  B. Langmead,et al.  BSmooth: from whole genome bisulfite sequencing reads to differentially methylated regions , 2012, Genome Biology.

[13]  K. Patterson,et al.  DNA Methylation: Bisulphite Modification and Analysis , 2011, Journal of visualized experiments : JoVE.

[14]  Christofer L. Bäcklin,et al.  Genome-wide signatures of differential DNA methylation in pediatric acute lymphoblastic leukemia , 2013, Genome Biology.

[15]  Francine E. Garrett-Bakelman,et al.  methylKit: a comprehensive R package for the analysis of genome-wide DNA methylation profiles , 2012, Genome Biology.

[16]  Peter A. Jones,et al.  Targeting DNA methylation for epigenetic therapy. , 2010, Trends in pharmacological sciences.

[17]  J. Issa,et al.  Targeting DNA Methylation , 2009, Clinical Cancer Research.

[18]  P. Laird Principles and challenges of genome-wide DNA methylation analysis , 2010, Nature Reviews Genetics.

[19]  Rajyalakshmi Luthra,et al.  Acute myeloid leukemia with IDH1 or IDH2 mutation: frequency and clinicopathologic features. , 2011, American journal of clinical pathology.

[20]  M. Hottiger,et al.  Crosstalk between SET7/9-dependent methylation and ARTD1-mediated ADP-ribosylation of histone H1.4 , 2013, Epigenetics & Chromatin.

[21]  Zachary D. Smith,et al.  Genome-scale DNA methylation mapping of clinical samples at single-nucleotide resolution , 2010, Nature Methods.

[22]  E. Maher,et al.  DNA methylation: a form of epigenetic control of gene expression , 2010 .

[23]  Fabien Campagne,et al.  DNA methylation signatures identify biologically distinct subtypes in acute myeloid leukemia. , 2010, Cancer cell.

[24]  Li Yu,et al.  [DNA methylation and cancer]. , 2005, Zhonghua nei ke za zhi.

[25]  Francine E. Garrett-Bakelman,et al.  Base-Pair Resolution DNA Methylation Sequencing Reveals Profoundly Divergent Epigenetic Landscapes in Acute Myeloid Leukemia , 2012, PLoS genetics.

[26]  W H DECKER,et al.  Some effects of relaxin in obstetrics. , 1958, Obstetrics and Gynecology.

[27]  R. Lister,et al.  Finding the fifth base: genome-wide sequencing of cytosine methylation. , 2009, Genome research.

[28]  Wei Li,et al.  MOABS: model based analysis of bisulfite sequencing data , 2014, Genome Biology.

[29]  Peter A. Jones,et al.  Epigenetics in cancer. , 2010, Carcinogenesis.

[30]  Z. Herceg,et al.  Epigenetic interplay between histone modifications and DNA methylation in gene silencing. , 2008, Mutation research.

[31]  J. Herman,et al.  Analysis of Promoter CpG Island Hypermethylation in Cancer: Location, Location, Location! , 2011, Clinical Cancer Research.

[32]  A. Gnirke,et al.  Reduced representation bisulfite sequencing for comparative high-resolution DNA methylation analysis , 2005, Nucleic acids research.

[33]  Kevin Petrie,et al.  AML1/ETO, a promiscuous fusion oncoprotein , 2007 .

[34]  L. Aravind,et al.  Impaired hydroxylation of 5-methylcytosine in myeloid cancers with mutant TET2 , 2010, Nature.

[35]  Bin Wang,et al.  Oncometabolite 2-hydroxyglutarate is a competitive inhibitor of α-ketoglutarate-dependent dioxygenases. , 2011, Cancer cell.

[36]  Steven Henikoff,et al.  ISWI and CHD chromatin remodelers bind to promoters but act in gene bodies , 2013, Epigenetics & Chromatin.

[37]  D. Griffiths Maximum likelihood estimation for the beta-binomial distribution and an application to the household distribution of the total number of cases of a disease. , 1973, Biometrics.

[38]  Felix Krueger,et al.  Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications , 2011, Bioinform..

[39]  Lee E. Edsall,et al.  Human DNA methylomes at base resolution show widespread epigenomic differences , 2009, Nature.

[40]  Tao Wang,et al.  RRBS‐Analyser: A Comprehensive Web Server for Reduced Representation Bisulfite Sequencing Data Analysis , 2013, Human mutation.

[41]  A. Feinberg,et al.  Increased methylation variation in epigenetic domains across cancer types , 2011, Nature Genetics.