Prediction of DNA methylation for every CpG loci in chromosome 18 in embryonic stem cells using support vector machine

Embryonic stem cells (ESC) are pluripotent cells, had been obtained from an embryo in a few days old in which all 200+ kinds of tissue in the human body originate a fact which makes them potentially important in medicine. DNA methylation on the other hand plays an important role in gene modification leading to change in biological processes as pluripotent and differentiated cells or reasoned for many diseases. Cytosines in CpG dinucleotides can be methylated leading to concentrate to predict methylation in CpG loci inside DNA by helping of results coming from whole genome bisulfite sequencing (WGBS) experiment but very expensive and Illumina 450K array experiment that covers less than 2% of CpG locus. A machine learning model has been implemented using support vector machine classifier (SVM) that is trained on chromosome 18 of the human H1 ESC with both 450K array data and WGBS. The SVM gives an accuracy of 99% when testing on chromosome 18 of the human H9 ESC to predict methyl or un-methyl for each CpG loci in the chromosome separately.

[1]  T. Spector,et al.  Predicting genome-wide DNA methylation using methylation marks, genomic position, and DNA regulatory elements , 2013, Genome Biology.

[2]  G. Fan,et al.  DNA Methylation and Its Basic Function , 2013, Neuropsychopharmacology.

[3]  U. Lim,et al.  Dietary and lifestyle factors of DNA methylation. , 2012, Methods in molecular biology.

[4]  Daiya Takai,et al.  Comprehensive analysis of CpG islands in human chromosomes 21 and 22 , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[5]  S. Roberts,et al.  Genome-wide profiling of DNA methylation and gene expression in Crassostrea gigas male gametes , 2014, Front. Physiol..

[6]  Peter A. Jones Functions of DNA methylation: islands, start sites, gene bodies and beyond , 2012, Nature Reviews Genetics.

[7]  Li Zhang,et al.  Genome-Wide Profiling of DNA Methylation Reveals a Class of Normally Methylated CpG Island Promoters , 2007, PLoS genetics.

[8]  Dong Xu,et al.  Predicting DNA Methylation State of CpG Dinucleotide Using Genome Topological Features and Deep Networks , 2016, Scientific Reports.

[9]  Daiya Takai,et al.  The CpG Island Searcher: A new WWW resource , 2003, Silico Biol..

[10]  Kimberly P Keil,et al.  DNA methylation: a mechanism linking environmental chemical exposures to risk of autism spectrum disorders? , 2016, Environmental epigenetics.

[11]  Mengchi Wang,et al.  Computationally expanding infinium HumanMethylation450 BeadChip array data to reveal distinct DNA methylation patterns of rheumatoid arthritis , 2016, Bioinform..

[12]  Timothy J. Peters,et al.  Critical evaluation of the Illumina MethylationEPIC BeadChip microarray for whole-genome DNA methylation profiling , 2016, Genome Biology.

[13]  Michael Q. Zhang,et al.  Computational prediction of methylation status in human genomic sequences. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[14]  H. Prydz,et al.  CpG islands as gene markers in the human genome. , 1992, Genomics.

[15]  Wei Wang,et al.  Predicting CpG methylation levels by integrating Infinium HumanMethylation450 BeadChip array data. , 2016, Genomics.