Detect differentially methylated regions using non-homogeneous hidden Markov model for methylation array data

Motivation DNA methylation is an important epigenetic mechanism in gene regulation and the detection of differentially methylated regions (DMRs) is enthralling for many disease studies. There are several aspects that we can improve over existing DMR detection methods: (i) methylation statuses of nearby CpG sites are highly correlated, but this fact has seldom been modelled rigorously due to the uneven spacing; (ii) it is practically important to be able to handle both paired and unpaired samples; and (iii) the capability to detect DMRs from a single pair of samples is demanded. Results We present DMRMark (DMR detection based on non-homogeneous hidden Markov model), a novel Bayesian framework for detecting DMRs from methylation array data. It combines the constrained Gaussian mixture model that incorporates the biological knowledge with the non-homogeneous hidden Markov model that models spatial correlation. Unlike existing methods, our DMR detection is achieved without predefined boundaries or decision windows. Furthermore, our method can detect DMRs from a single pair of samples and can also incorporate unpaired samples. Both simulation studies and real datasets from The Cancer Genome Atlas showed the significant improvement of DMRMark over other methods. Availability and implementation DMRMark is freely available as an R package at the CRAN R package repository. Contact xfan@cuhk.edu.hk. Supplementary information Supplementary data are available at Bioinformatics online.

[1]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[2]  Carsten Wiuf,et al.  Comprehensive Genome Methylation Analysis in Bladder Cancer: Identification and Validation of Novel Methylated Genes and Application of These as Urinary Tumor Markers , 2011, Clinical Cancer Research.

[3]  Xiaofei Yang,et al.  Comparative pan-cancer DNA methylation analysis reveals cancer common and specific patterns , 2016, Briefings Bioinform..

[4]  Jayantha Gunaratne,et al.  YBX1 gene silencing inhibits migratory and invasive potential via CORO1C in breast cancer in vitro , 2017, BMC Cancer.

[5]  A. Bird,et al.  DNA methylation landscapes: provocative insights from epigenomics , 2008, Nature Reviews Genetics.

[6]  Andrew Gelman,et al.  General methods for monitoring convergence of iterative simulations , 1998 .

[7]  Roland Eils,et al.  DNA methylome analysis in Burkitt and follicular lymphomas identifies differentially methylated regions linked to somatic mutation and transcriptional control , 2015, Nature Genetics.

[8]  Adrian E. Raftery,et al.  mclust Version 4 for R : Normal Mixture Modeling for Model-Based Clustering , Classification , and Density Estimation , 2012 .

[9]  J. Keilwagen,et al.  Area under Precision-Recall Curves for Weighted and Unweighted Data , 2014, PloS one.

[10]  Tamar Sofer,et al.  A-clustering: a novel method for the detection of co-regulated methylation regions, and regions associated with exposure , 2013, Bioinform..

[11]  Rafael A Irizarry,et al.  Comprehensive high-throughput arrays for relative methylation (CHARM). , 2008, Genome research.

[12]  Janine E. Deakin,et al.  In Vivo Function and Evolution of the Eutherian-Specific Pluripotency Marker UTF1 , 2013, PloS one.

[13]  Michael Q. Zhang,et al.  High definition profiling of mammalian DNA methylation by array capture and single molecule bisulfite sequencing , 2009 .

[14]  Sergio Fonda,et al.  Promoter methylation and downregulated expression of the TBX15 gene in ovarian carcinoma. , 2016, Oncology letters.

[15]  Satoshi Yamashita,et al.  Estimation of the Fraction of Cancer Cells in a Tumor DNA Sample Using DNA Methylation , 2013, PloS one.

[16]  Yvonne Vergouwe,et al.  FGFR3, TERT and OTX1 as a Urinary Biomarker Combination for Surveillance of Patients with Bladder Cancer in a Large Prospective Multicenter Study , 2017, The Journal of urology.

[17]  Andrew Glass,et al.  Discovery and validation of methylation markers for endometrial cancer , 2014, International journal of cancer.

[18]  S. Tavazoie,et al.  PTPRN2 and PLCβ1 promote metastatic breast cancer cell migration through PI(4,5)P2‐dependent actin remodeling , 2015, The EMBO journal.

[19]  Jun Wang,et al.  Predicting tumor purity from methylation microarray data , 2015, Bioinform..

[20]  Wonyul Lee,et al.  Identification of differentially methylated loci using wavelet-based functional mixed models , 2016, Bioinform..

[21]  Wei Jiang,et al.  High-throughput DNA methylation profiling using universal bead arrays. , 2006, Genome research.

[22]  J R Ecker,et al.  Human DNA methylomes of neurodegenerative diseases show common epigenomic patterns , 2016, Translational psychiatry.

[23]  Saurabh Baheti,et al.  Identification of differentially methylated regions in new genes associated with knee osteoarthritis. , 2016, Gene.

[24]  Toutai Mituyama,et al.  Bisulfighter: accurate detection of methylated cytosines and differentially methylated regions , 2014, Nucleic acids research.

[25]  C-C Huang,et al.  Abstract P4-09-23: Gene expression signatures of microcalcifications among Taiwanese breast cancers: , 2016 .

[26]  Shufeng Zhou,et al.  Hsa-microRNA-181a is a regulator of a number of cancer genes and a biomarker for endometrial carcinoma in patients: a bioinformatic and clinical study and the therapeutic implication , 2015, Drug design, development and therapy.

[27]  H. Cui,et al.  MicroRNA-663 targets TGFB1 and regulates lung cancer proliferation. , 2011, Asian Pacific journal of cancer prevention : APJCP.

[28]  Tobias Rydén,et al.  EM versus Markov chain Monte Carlo for estimation of hidden Markov models: a computational perspective , 2008 .

[29]  Peter L Molloy,et al.  De novo identification of differentially methylated regions in the human genome , 2015, Epigenetics & Chromatin.

[30]  D. Balding,et al.  Epigenome-wide association studies for common human diseases , 2011, Nature Reviews Genetics.

[31]  W. Reik,et al.  Selective impairment of methylation maintenance is the major cause of DNA methylation reprogramming in the early embryo , 2015, Epigenetics & Chromatin.

[32]  Peter A. Jones,et al.  DNA methylation: The nuts and bolts of repression , 2007, Journal of cellular physiology.

[33]  Wei Li,et al.  MOABS: model based analysis of bisulfite sequencing data , 2014, Genome Biology.

[34]  Zhaohui S. Qin,et al.  Detection of differentially methylated regions from whole-genome bisulfite sequencing data without replicates , 2015, Nucleic acids research.

[35]  Stephan Beck,et al.  Probe Lasso: A novel method to rope in differentially methylated regions with 450K DNA methylation data , 2015, Methods.

[36]  Hao Wu,et al.  Estimating and accounting for tumor purity in the analysis of DNA methylation data from cancer studies , 2017, Genome Biology.

[37]  Sven Laur,et al.  seqlm: an MDL based method for identifying differentially methylated regions in high density methylation array data , 2016, Bioinform..

[38]  Gerry Melino,et al.  p73 in Cancer. , 2011, Genes & cancer.

[39]  Yair Lotan,et al.  Detection of Bladder Cancer Using Novel DNA Methylation Biomarkers in Urine Sediments , 2011, Cancer Epidemiology, Biomarkers & Prevention.

[40]  Shin Ishii,et al.  Optimal Aggregation of Binary Classifiers for Multiclass Cancer Diagnosis Using Gene Expression Profiles , 2009, TCBB.

[41]  Hua Yu,et al.  COHCAP: an integrative genomic pipeline for single-nucleotide resolution DNA methylation analysis , 2013, Nucleic acids research.

[42]  Gordon K Smyth,et al.  Statistical Applications in Genetics and Molecular Biology Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments , 2011 .

[43]  W. Richard McCombie,et al.  Sperm Methylation Profiles Reveal Features of Epigenetic Inheritance and Evolution in Primates , 2011, Cell.

[44]  V. Bae-Jump,et al.  Association between uterine serous carcinoma and breast cancer. , 2004, Gynecologic oncology.

[45]  R. Weksberg,et al.  Discovery of cross-reactive probes and polymorphic CpGs in the Illumina Infinium HumanMethylation450 microarray , 2013, Epigenetics.

[46]  Uwe Wagner,et al.  A transcriptome-based global map of signaling pathways in the ovarian cancer microenvironment associated with clinical outcome , 2016, Genome Biology.

[47]  Wendy A Bickmore,et al.  Redistribution of H3K27me3 upon DNA hypomethylation results in de-repression of Polycomb target genes , 2013, Genome Biology.

[48]  Ying-Chao Lin,et al.  Methods for identifying differentially methylated regions for sequence- and array-based data. , 2016, Briefings in functional genomics.

[49]  Jeffrey T Leek,et al.  Bump hunting to identify differentially methylated regions in epigenetic epidemiology studies. , 2012, International journal of epidemiology.

[50]  David Bender,et al.  An alternative ZMIZ1 promoter exhibits higher gene expression in epithelial ovarian cancer that is p53-independent , 2012 .

[51]  R. Jaenisch,et al.  Tracing Dynamic Changes of DNA Methylation at Single-Cell Resolution , 2015, Cell.

[52]  J. Barrett,et al.  Suppressed tumorigenicity of human endometrial cancer cells by the restored expression of the DCC gene , 2000, British Journal of Cancer.

[53]  David A. Cox,et al.  Solving Polynomial Equations: Foundations, Algorithms, and Applications (Algorithms and Computation in Mathematics) , 2005 .

[54]  Xiao Zhang,et al.  Comparison of Beta-value and M-value methods for quantifying methylation levels by microarray analysis , 2010, BMC Bioinformatics.

[55]  Sanja Sever,et al.  Anks1a regulates COPII-mediated anterograde transport of receptor tyrosine kinases critical for tumorigenesis , 2016, Nature Communications.

[56]  Yuhui Zheng,et al.  A Rough Set Bounded Spatially Constrained Asymmetric Gaussian Mixture Model for Image Segmentation , 2017, PloS one.

[57]  Steven J. M. Jones,et al.  Comprehensive molecular characterization of urothelial bladder carcinoma , 2014, Nature.

[58]  Peter A. Jones,et al.  The role of DNA methylation in directing the functional organization of the cancer epigenome , 2015, Genome research.

[59]  Alexandros Laios,et al.  Suppression of cancer stemness p21-regulating mRNA and microRNA signatures in recurrent ovarian cancer patient samples , 2012, Journal of Ovarian Research.

[60]  Masaki Kitajima,et al.  Identification of a new breast cancer-related gene by restriction landmark genomic scanning. , 2006, Anticancer research.

[61]  Kemp H. Kernstine,et al.  DNA methylation biomarkers for lung cancer , 2011, Tumor Biology.

[62]  The Cancer Genome Atlas Research Network,et al.  Comprehensive molecular characterization of urothelial bladder carcinoma , 2014, Nature.

[63]  Yue Lu,et al.  Abstract B22: Genome-wide methylation analysis reveals an independently validated CpG island methylator phenotype associated with favorable prognosis in acute myeloid leukemia. , 2015 .

[64]  Oliver Sieber,et al.  A statistical approach for detecting genomic aberrations in heterogeneous tumor samples from single nucleotide polymorphism genotyping data , 2010, Genome Biology.

[65]  Rafael A. Irizarry,et al.  Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays , 2014, Bioinform..