MM-6mAPred: identifying DNA N6-methyladenine sites based on Markov model

MOTIVATION Recent studies have shown that DNA N6-methyladenine (6mA) plays an important role in epigenetic modification of eukaryotic organisms. It has been found that 6mA is closely related to embryonic development, stress response, and so on. Developing a new algorithm to quickly and accurately identify 6mA sites in genomes is important for explore their biological functions. RESULTS In this paper, we proposed a new classification method called MM-6mAPred based on a Markov model which makes use of the transition probability between adjacent nucleotides to identify 6mA site. The sensitivity and specificity of our method are 89.32% and 90.11%, respectively. The overall accuracy of our method is 89.72%, which is 6.59% higher than that of the previous method i6mA-Pred. It indicated that, compared with the 41 nucleotide chemical properties used by i6mA-Pred, the transition probability between adjacent nucleotides can capture more discriminant sequence information. AVAILABILITY The web server of MM-6mAPred is freely accessible at http://www.insect-genome.com/MM-6mAPred/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

[1]  Peng Jin,et al.  DNA N6-methyladenine is dynamically regulated in the mouse brain following environmental stress , 2017, Nature Communications.

[2]  Sean R. Eddy,et al.  Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids , 1998 .

[3]  Wei Chen,et al.  i6mA-Pred: identifying DNA N6-methyladenine sites in the rice genome , 2019, Bioinform..

[4]  H Almagor,et al.  A Markov analysis of DNA sequences. , 1983, Journal of theoretical biology.

[5]  Chuan He,et al.  Abundant DNA 6mA methylation during early embryogenesis of zebrafish and pig , 2016, Nature Communications.

[6]  James A. Swenberg,et al.  DNA methylation on N6-adenine in mammalian embryonic stem cells , 2016, Nature.

[7]  Minghui He,et al.  N6-Methyladenine DNA Modification in the Human Genome. , 2018, Molecular cell.

[8]  Shunmin He,et al.  N6-Methyladenine DNA Modification in Drosophila , 2015, Cell.

[9]  Yu Zhao,et al.  Identification and analysis of adenine N6-methylation sites in the rice genome , 2018, Nature Plants.

[10]  Jonathan D. Wren,et al.  Markov model recognition and classification of DNA/protein sequences within large text databases , 2005, Bioinform..

[11]  L. Doré,et al.  N 6-Methyldeoxyadenosine Marks Active Transcription Start Sites in Chlamydomonas , 2015, Cell.

[12]  Elmar Nöth,et al.  Interpolated markov chains for eukaryotic promoter recognition , 1999, Bioinform..

[13]  Harry Venner,et al.  Nachweis von Minoritätsbasen in Sperma-Desoxyribonucleinsäure , 1966 .

[14]  David Haussler,et al.  Improved splice site detection in Genie , 1997, RECOMB '97.

[15]  L. Aravind,et al.  DNA Methylation on N6-Adenine in C. elegans , 2015, Cell.

[16]  Hao Liu,et al.  Rice Information GateWay: A Comprehensive Bioinformatics Platform for Indica Rice Genomes. , 2017, Molecular plant.

[17]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[18]  M. Borodovsky,et al.  Detection of new genes in a bacterial genome using Markov models for three gene classes. , 1995, Nucleic acids research.

[19]  A Janulaitis,et al.  Cytosine modification in DNA by BcnI methylase yields N 4‐methylcytosine , 1983, FEBS letters.