BIMMER: a novel algorithm for detecting differential DNA methylation regions from MBDCap-seq data

DNA methylation is a common epigenetic marker that regulates gene expression. A robust and cost-effective way for measuring whole genome methylation is Methyl-CpG binding domain-based capture followed by sequencing (MBDCap-seq). In this study, we proposed BIMMER, a Hidden Markov Model (HMM) for differential Methylation Regions (DMRs) identification, where HMMs were proposed to model the methylation status in normal and cancer samples in the first layer and another HMM was introduced to model the relationship between differential methylation and methylation statuses in normal and cancer samples. To carry out the prediction for BIMMER, an Expectation-Maximization algorithm was derived. BIMMER was validated on the simulated data and applied to real MBDCap-seq data of normal and cancer samples. BIMMER revealed that 8.83% of the breast cancer genome are differentially methylated and the majority are hypo-methylated in breast cancer.

[1]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[2]  Howard Louis Yudkin,et al.  Channel state testing in information decoding , 1965 .

[3]  Mark S. Doderer,et al.  CMS: A Web-Based System for Visualization and Analysis of Genome-Wide Methylation Data of Human Cancers , 2013, PloS one.

[4]  M. Neuberger,et al.  CTNNBL1 Is a Novel Nuclear Localization Sequence-binding Protein That Recognizes RNA-splicing Factors CDC5L and Prp31 , 2011, The Journal of Biological Chemistry.

[5]  Jingmei Li,et al.  Confirmation of the reduction of hormone replacement therapy-related breast cancer risk for carriers of the HSD17B1_937_G variant , 2013, Breast Cancer Research and Treatment.

[6]  E. Villa-Moruzzi,et al.  PTPN12 controls PTEN and the AKT signalling to FAK and HER2 in migrating ovarian cancer cells , 2012, Molecular and Cellular Biochemistry.

[7]  J. Issa,et al.  A simple method for estimating global DNA methylation using bisulfite PCR of repetitive DNA elements. , 2004, Nucleic acids research.

[8]  Jia Fu,et al.  Decreased Expression of PTPN12 Correlates with Tumor Recurrence and Poor Survival of Patients with Hepatocellular Carcinoma , 2014, PloS one.

[9]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[10]  Y. Kagami,et al.  Inhibition of Bcl3 gene expression mediates the anti-proliferative action of estrogen in pituitary lactotrophs in primary culture , 2011, Molecular and Cellular Endocrinology.

[11]  K. Devriendt,et al.  Rearrangement of the human CDC5L gene by a t(6;19)(p21;q13.1) in a patient with multicystic renal dysplasia. , 1998, Genomics.

[12]  P. Park,et al.  Design and analysis of ChIP-seq experiments for DNA-binding proteins , 2008, Nature Biotechnology.

[13]  Juan Li,et al.  Association of Genetic Polymorphisms in HSD17B1, HSD17B2 and SHBG Genes with Hepatocellular Carcinoma Risk , 2014, Pathology & Oncology Research.

[14]  M. Ladanyi,et al.  Cell Cycle Regulator Gene CDC5L, a Potential Target for 6p12-p21 Amplicon in Osteosarcoma , 2008, Molecular Cancer Research.

[15]  E. Zabarovsky,et al.  Tumor suppressor Alpha B-crystallin (CRYAB) associates with the cadherin/catenin adherens junction and impairs NPC progression-associated properties , 2012, Oncogene.

[16]  Hee June Choi,et al.  Bcl3-dependent stabilization of CtBP1 is crucial for the inhibition of apoptosis and tumor progression in breast cancer. , 2010, Biochemical and biophysical research communications.

[17]  Zhaohui S. Qin,et al.  HPeak: an HMM-based algorithm for defining read-enriched regions in ChIP-Seq data , 2010, BMC Bioinformatics.

[18]  C. Orlando,et al.  Quantitative evaluation of DNA methylation by optimization of a differential-high resolution melt analysis protocol , 2009, Nucleic acids research.

[19]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[20]  Feng Lin,et al.  An HMM approach to genome-wide identification of differential histone modification sites from ChIP-seq data , 2008, Bioinform..

[21]  Li Yu,et al.  [DNA methylation and cancer]. , 2005, Zhonghua nei ke za zhi.

[22]  Dario Strbenac,et al.  Evaluation of affinity-based genome-wide DNA methylation data: effects of CpG density, amplification bias, and copy number variation. , 2010, Genome research.

[23]  M. Rao,et al.  Activation of Multiple Proto-oncogenic Tyrosine Kinases in Breast Cancer via Loss of the PTPN12 Phosphatase , 2011, Cell.

[24]  Pearlly Yan,et al.  Comparative study on ChIP-seq data: normalization and binding pattern characterization , 2009, Bioinform..

[25]  Michael Seifert,et al.  MeDIP-HMM: genome-wide identification of distinct DNA methylation states from high-density tiling arrays , 2012, Bioinform..

[26]  Bernard M. E. Moret,et al.  ChIPnorm: A Statistical Method for Normalizing and Identifying Differential Regions in Histone Modification ChIP-seq Libraries , 2012, PloS one.

[27]  E. Zabarovsky,et al.  Cysteine-rich intestinal protein 2 (CRIP2) acts as a repressor of NF-κB–mediated proangiogenic cytokine transcription to suppress tumorigenesis and angiogenesis , 2011, Proceedings of the National Academy of Sciences.

[28]  C. Lofton-Day,et al.  Comparative DNA methylation analysis in normal and tumour tissues and in cancer cell lines using differential methylation hybridisation. , 2007, The international journal of biochemistry & cell biology.

[29]  Natalie Jäger,et al.  Genome-wide mapping of DNA methylation: a quantitative technology comparison , 2010, Nature Biotechnology.

[30]  Brian J. Stevenson,et al.  Global DNA hypomethylation coupled to repressive chromatin domain formation and gene silencing in breast cancer. , 2012, Genome research.

[31]  W. Muller,et al.  Bcl3 selectively promotes metastasis of ERBB2-driven mammary tumors. , 2013, Cancer research.

[32]  B. Kuster,et al.  Functional analysis of the human CDC5L complex and identification of its components by mass spectrometry , 2000, The EMBO journal.