DM-BLD: differential methylation detection using a hierarchical Bayesian model exploiting local dependency

Motivation: The advent of high-throughput DNA methylation profiling techniques has enabled the possibility of accurate identification of differentially methylated genes for cancer research. The large number of measured loci facilitates whole genome methylation study, yet posing great challenges for differential methylation detection due to the high variability in tumor samples. Results: We have developed a novel probabilistic approach, differential methylation detection using a hierarchical Bayesian model exploiting local dependency (DM-BLD), to detect differentially methylated genes based on a Bayesian framework. The DM-BLD approach features a joint model to capture both the local dependency of measured loci and the dependency of methylation change in samples. Specifically, the local dependency is modeled by Leroux conditional autoregressive structure; the dependency of methylation changes is modeled by a discrete Markov random field. A hierarchical Bayesian model is developed to fully take into account the local dependency for differential analysis, in which differential states are embedded as hidden variables. Simulation studies demonstrate that DM-BLD outperforms existing methods for differential methylation detection, particularly when the methylation change is moderate and the variability of methylation in samples is high. DM-BLD has been applied to breast cancer data to identify important methylated genes (such as polycomb target genes and genes involved in transcription factor activity) associated with breast cancer recurrence. Availability and Implementation: A Matlab package of DM-BLD is available at http://www.cbil.ece.vt.edu/software.htm. Contact: Xuan@vt.edu Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  Sandhya Rani,et al.  Human Protein Reference Database—2009 update , 2008, Nucleic Acids Res..

[2]  Sudha Ananth,et al.  Molecular Mechanism of SLC5A8 Inactivation in Breast Cancer , 2013, Molecular and Cellular Biology.

[3]  Peter L Molloy,et al.  De novo identification of differentially methylated regions in the human genome , 2015, Epigenetics & Chromatin.

[4]  Yun Yen,et al.  Methylation of homeobox genes is a frequent and early epigenetic event in breast cancer , 2009, Breast Cancer Research.

[5]  Ellen R. Laird,et al.  Molecular basis for interaction of the protein tyrosine kinase ZAP-70 with the T-cell receptor , 2007, Nature.

[6]  Jeffrey T Leek,et al.  Bump hunting to identify differentially methylated regions in epigenetic epidemiology studies. , 2012, International journal of epidemiology.

[7]  A. Feinberg,et al.  The history of cancer epigenetics , 2004, Nature Reviews Cancer.

[8]  K. Gunderson,et al.  High density DNA methylation array with single CpG site resolution. , 2011, Genomics.

[9]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[10]  Jun Wang,et al.  Methylation profiling of 48 candidate genes in tumor and matched normal tissues from breast cancer patients , 2015, Breast Cancer Research and Treatment.

[11]  Wei Pan,et al.  Network‐based genomic discovery: application and comparison of Markov random‐field models , 2010, Journal of the Royal Statistical Society. Series C, Applied statistics.

[12]  M. Esteller,et al.  DNA methylation and cancer. , 2010, Advances in genetics.

[13]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[14]  V. Ganapathy,et al.  The plasma membrane transporter SLC5A8 suppresses tumour progression through depletion of survivin without involving its transport function. , 2013, The Biochemical journal.

[15]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumors , 2012, Nature.

[16]  J. Atchison,et al.  Logistic-normal distributions:Some properties and uses , 1980 .

[17]  D. Clayton,et al.  Empirical Bayes estimates of age-standardized relative risks for use in disease mapping. , 1987, Biometrics.

[18]  A. Feinberg Phenotypic plasticity and the epigenetics of human disease , 2007, Nature.

[19]  M. Esteller,et al.  Validation of a DNA methylation microarray for 450,000 CpG sites in the human genome , 2011, Epigenetics.

[20]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumours , 2013 .

[21]  A. Meissner Epigenetic modifications in pluripotent and differentiated cells , 2010, Nature Biotechnology.

[22]  Wei Zheng,et al.  CMTM3, located at the critical tumor suppressor locus 16q22.1, is silenced by CpG methylation in carcinomas and inhibits tumor cell growth through inducing apoptosis. , 2009, Cancer research.

[23]  S Ramchandani,et al.  DNA methylation is a reversible biological signal. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Duncan Lee,et al.  A comparison of conditional autoregressive models used in Bayesian disease mapping. , 2011, Spatial and spatio-temporal epidemiology.

[25]  D. Berry,et al.  Statistical models in epidemiology, the environment, and clinical trials , 2000 .

[26]  J. Besag On the Statistical Analysis of Dirty Pictures , 1986 .

[27]  Peter A. Jones,et al.  A decade of exploring the cancer epigenome — biological and translational implications , 2011, Nature Reviews Cancer.

[28]  A. Bird DNA methylation patterns and epigenetic memory. , 2002, Genes & development.

[29]  F. H. Garrison The history of cancer. , 2012, Bulletin of the New York Academy of Medicine.

[30]  Michael Rose,et al.  OASIS/CREB3L1 is epigenetically silenced in human bladder cancer facilitating tumor cell spreading and migration in vitro , 2014, Epigenetics.

[31]  Stephan Beck,et al.  Probe Lasso: A novel method to rope in differentially methylated regions with 450K DNA methylation data , 2015, Methods.

[32]  Dan Wang,et al.  IMA: an R package for high-throughput analysis of Illumina's 450K Infinium methylation data , 2012, Bioinform..

[33]  David Euhus,et al.  Identification of 5 novel genes methylated in breast and other epithelial cancers , 2010, Molecular Cancer.

[34]  Norman E. Breslow,et al.  Estimation of Disease Rates in Small Areas: A new Mixed Model for Spatial Dependence , 2000 .

[35]  Wendy Kennedy,et al.  Negative regulation of G1/S transition by the candidate bladder tumour suppressor gene DBCCR1 , 2001, Oncogene.

[36]  Cory Y. McLean,et al.  GREAT improves functional interpretation of cis-regulatory regions , 2010, Nature Biotechnology.

[37]  Brent S. Pedersen,et al.  Comb-p: software for combining, analyzing, grouping and correcting spatially correlated P-values , 2012, Bioinform..