Convolutional Neural Networks In Classifying Cancer Through DNA Methylation

DNA Methylation has been the most extensively studied epigenetic mark. Usually a change in the genotype, DNA sequence, leads to a change in the phenotype, observable characteristics of the individual. But DNA methylation, which happens in the context of CpG (cytosine and guanine bases linked by phosphate backbone) dinucleotides, does not lead to a change in the original DNA sequence but has the potential to change the phenotype. DNA methylation is implicated in various biological processes and diseases including cancer. Hence there is a strong interest in understanding the DNA methylation patterns across various epigenetic related ailments in order to distinguish and diagnose the type of disease in its early stages. In this work, the relationship between methylated versus unmethylated CpG regions and cancer types is explored using Convolutional Neural Networks (CNNs). A CNN based Deep Learning model that can classify the cancer of a new DNA methylation profile based on the learning from publicly available DNA methylation datasets is then proposed.

[1]  Lee E. Edsall,et al.  Human DNA methylomes at base resolution show widespread epigenomic differences , 2009, Nature.

[2]  Zhongwei Si,et al.  Learning Deep Features for DNA Methylation Data Analysis , 2016, IEEE Access.

[3]  Xiao Zhang,et al.  Comparison of Beta-value and M-value methods for quantifying methylation levels by microarray analysis , 2010, BMC Bioinformatics.

[4]  W. Reik,et al.  Epigenetic Reprogramming in Mammalian Development , 2001, Science.

[5]  A. Feinberg,et al.  Increased methylation variation in epigenetic domains across cancer types , 2011, Nature Genetics.

[6]  En Li,et al.  De novo methylation of MMLV provirus in embryonic stem cells: CpG versus non-CpG methylation. , 2002, Gene.

[7]  Martin J Aryee,et al.  Differential methylation of tissue- and cancer-specific CpG island shores distinguishes human induced pluripotent stem cells, embryonic stem cells and fibroblasts , 2009, Nature Genetics.

[8]  Frank Lyko,et al.  LUMA (LUminometric Methylation Assay)--a high throughput method to the analysis of genomic DNA methylation. , 2006, Experimental cell research.

[9]  B. H. Miller,et al.  Epigenetic mechanisms in schizophrenia. , 2015, Progress in biophysics and molecular biology.

[10]  Howard Cedar,et al.  Programming of DNA methylation patterns. , 2012, Annual review of biochemistry.

[11]  A. Bird CpG-rich islands and the function of DNA methylation , 1986, Nature.

[12]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[13]  Robert S. Illingworth,et al.  Orphan CpG Islands Identify Numerous Conserved Promoters in the Mammalian Genome , 2010, PLoS genetics.

[14]  J. L. Paternáin,et al.  Specific gene hypomethylation and cancer: New insights into coding region feature trends , 2009, Bioinformation.

[15]  S. Horvath DNA methylation age of human tissues and cell types , 2013, Genome Biology.

[16]  Marco Masseroli,et al.  TCGA2BED: extracting, extending, integrating, and querying The Cancer Genome Atlas , 2017, BMC Bioinformatics.

[17]  D. Balding,et al.  Epigenome-wide association studies for common human diseases , 2011, Nature Reviews Genetics.

[18]  Ivo G Gut,et al.  DNA methylation analysis by pyrosequencing , 2007, Nature Protocols.

[19]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[20]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[21]  Liguo Song,et al.  Specific method for the determination of genomic DNA methylation by liquid chromatography-electrospray ionization tandem mass spectrometry. , 2005, Analytical chemistry.

[22]  T. Mikkelsen,et al.  Genome-scale DNA methylation maps of pluripotent and differentiated cells , 2008, Nature.

[23]  Michael B. Stadler,et al.  Distribution, silencing potential and evolutionary impact of promoter DNA methylation in the human genome , 2007, Nature Genetics.

[24]  J. Arand,et al.  Epigenetic Reprogramming in Mammalian Development , 2012 .

[25]  A. Bird DNA methylation patterns and epigenetic memory. , 2002, Genes & development.

[26]  J. Kaprio,et al.  Occurrence of rheumatoid arthritis in a nationwide series of twins. , 1986, The Journal of rheumatology.

[27]  Nithya Ramakrishnan,et al.  Analysis of healthy and tumour DNA methylation distributions in kidney-renal-clear-cell-carcinoma using Kullback-Leibler and Jensen-Shannon distance measures. , 2017, IET systems biology.