MethylMix: an R package for identifying DNA methylation-driven genes

Summary: DNA methylation is an important mechanism regulating gene transcription, and its role in carcinogenesis has been extensively studied. Hyper and hypomethylation of genes is an alternative mechanism to deregulate gene expression in a wide range of diseases. At the same time, high-throughput DNA methylation assays have been developed generating vast amounts of genome wide DNA methylation measurements. Yet, few tools exist that can formally identify hypo and hypermethylated genes that are predictive of transcription and thus functionally relevant for a particular disease. To accommodate this lack of tools, we developed MethylMix, an algorithm implemented in R to identify disease specific hyper and hypomethylated genes. MethylMix is based on a beta mixture model to identify methylation states and compares them with the normal DNA methylation state. MethylMix introduces a novel metric, the ‘Differential Methylation value’ or DM-value defined as the difference of a methylation state with the normal methylation state. Finally, matched gene expression data are used to identify, besides differential, transcriptionally predictive methylation states by focusing on methylation changes that effect gene expression. Availability and implementation: MethylMix was implemented as an R package and is available in bioconductor. Contact: olivier.gevaert@stanford.edu

[1]  S. Plevritis,et al.  Glioblastoma multiforme: exploratory radiogenomic analysis by using quantitative image features. , 2014, Radiology.

[2]  R. Tibshirani,et al.  Pancancer analysis of DNA methylation-driven genes using MethylMix , 2015, Genome Biology.

[3]  J. Mosser,et al.  DNA methylation in glioblastoma: impact on gene expression and clinical outcome , 2010, BMC Genomics.

[4]  S. Plevritis,et al.  Identification of ovarian cancer driver genes by using module network integration of multi-omics data , 2013, Interface Focus.

[5]  C. Ricketts,et al.  Genome-wide DNA methylation profiling of CpG islands in breast cancer identifies novel genes associated with tumorigenicity. , 2011, Cancer research.

[6]  Dan Wang,et al.  IMA: an R package for high-throughput analysis of Illumina's 450K Infinium methylation data , 2012, Bioinform..

[7]  Luca Regli,et al.  Clinical Trial Substantiates the Predictive Value of O-6-Methylguanine-DNA Methyltransferase Promoter Methylation in Glioblastoma Patients Treated with Temozolomide , 2004, Clinical Cancer Research.

[8]  Hua Yu,et al.  COHCAP: an integrative genomic pipeline for single-nucleotide resolution DNA methylation analysis , 2013, Nucleic acids research.

[9]  Olivier Gevaert,et al.  Identifying Master Regulators of Cancer and Their Downstream Targets by Integrating Genomic and Epigenomic Features , 2012, Pacific Symposium on Biocomputing.

[10]  Rafael A. Irizarry,et al.  Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays , 2014, Bioinform..

[11]  Joshua M. Korn,et al.  Comprehensive genomic characterization defines human glioblastoma genes and core pathways , 2008, Nature.