Sequence analysis iDNA 4 mC : identifying DNA N 4-methylcytosine sites based on nucleotide chemical properties

Motivation: DNA N-methylcytosine (4mC) is an epigenetic modification. The knowledge about the distribution of 4mC is helpful for understanding its biological functions. Although experimental methods have been proposed to detect 4mC sites, they are expensive for performing genome-wide detections. Thus, it is necessary to develop computational methods for predicting 4mC sites. Results: In this work, we developed iDNA4mC, the first webserver to identify 4mC sites, in which DNA sequences are encoded with both nucleotide chemical properties and nucleotide frequency. The predictive results of the rigorous jackknife test and cross species test demonstrated that the performance of iDNA4mC is quite promising and holds high potential to become a useful tool for identifying 4mC sites. Availability and implementation: The user-friendly web-server, iDNA4mC, is freely accessible at http://lin.uestc.edu.cn/server/iDNA4mC. Contact: chenweiimu@gmail.com or hlin@uestc.edu.cn

[1]  A. Jeltsch,et al.  The Cytosine N4-Methyltransferase M.PvuII Also Modifies Adenine Residues , 2001, Biological chemistry.

[2]  Wei Chen,et al.  iPro54-PseKNC: a sequence-based predictor for identifying sigma-54 promoters in prokaryote with pseudo k-tuple nucleotide composition , 2014, Nucleic acids research.

[3]  Zhengwei Zhu,et al.  CD-HIT: accelerated for clustering the next-generation sequencing data , 2012, Bioinform..

[4]  Zhi Xie,et al.  MethSMRT: an integrative database for DNA N6-methyladenine and N4-methylcytosine generated by single-molecular real-time sequencing , 2016, Nucleic Acids Res..

[5]  Huiping Wang,et al.  Analysis and identification of toxin targets by topological properties in protein-protein interaction network. , 2014, Journal of theoretical biology.

[6]  Dongwan D. Kang,et al.  The Epigenomic Landscape of Prokaryotes , 2016, PLoS genetics.

[7]  Maria Strazzullo,et al.  DNA methylation 40 years later: Its role in human health and disease , 2005, Journal of cellular physiology.

[8]  Robert J. Schmitz,et al.  Base-resolution detection of N4-methylcytosine in genomic DNA using 4mC-Tet-assisted-bisulfite- sequencing , 2015, Nucleic acids research.

[9]  P. Modrich,et al.  Mechanisms and biological effects of mismatch repair. , 1991, Annual review of genetics.

[10]  R. Blumenthal,et al.  Finding a basis for flipping bases. , 1996, Structure.

[11]  Shunmin He,et al.  N6-Methyladenine DNA Modification in Drosophila , 2015, Cell.

[12]  M. Esteller,et al.  An Adenine Code for DNA: A Second Life for N6-Methyladenine , 2015, Cell.

[13]  Tyson A. Clark,et al.  Direct detection of DNA methylation during single-molecule, real-time sequencing , 2010, Nature Methods.

[14]  W. Messer,et al.  Timing and targeting: The biological functions of Dam methylation in E. coli , 1988, Cell.

[15]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[16]  Ho-Jin Choi,et al.  DNA Encoding for Splice Site Prediction in Large DNA Sequence , 2013, DASFAA Workshops.

[17]  R. Roberts,et al.  Hhal methyltransferase flips its target base out of the DNA helix , 1994, Cell.

[18]  Matthew K Waldor,et al.  Entering the era of bacterial epigenomics with single molecule real time DNA sequencing. , 2013, Current opinion in microbiology.

[19]  K. Chou Some remarks on protein attribute prediction and pseudo amino acid composition , 2010, Journal of Theoretical Biology.

[20]  K. Chou,et al.  iACP: a sequence-based tool for identifying anticancer peptides , 2016, Oncotarget.

[21]  R. Blumenthal,et al.  Structure of pvu II DNA-(cytosine N4) methyltransferase, an example of domain permutation and protein fold assignment. , 1997, Nucleic acids research.

[22]  Qi Zhao,et al.  GPS-SUMO: a tool for the prediction of sumoylation sites and SUMO-interaction motifs , 2014, Nucleic Acids Res..

[23]  L. Doré,et al.  N 6-Methyldeoxyadenosine Marks Active Transcription Start Sites in Chlamydomonas , 2015, Cell.

[24]  Xiaodong Cheng,et al.  DNA modification by methyltransferases. , 1995, Current opinion in structural biology.

[25]  Wei Chen,et al.  iRNA-AI: identifying the adenosine to inosine editing sites in RNA sequences , 2016, Oncotarget.

[26]  L. Aravind,et al.  DNA Methylation on N 6-Adenine in C . elegans Graphical Abstract Highlights , 2015 .

[27]  K. Chou,et al.  iRNA-Methyl: Identifying N(6)-methyladenosine sites using pseudo nucleotide composition. , 2015, Analytical biochemistry.

[28]  K. Zhao,et al.  Lsh, chromatin remodeling family member, modulates genome-wide cytosine methylation patterns at nonrepeat sequences , 2011, Proceedings of the National Academy of Sciences.

[29]  Wei Chen,et al.  iRNA-PseU: Identifying RNA pseudouridine sites , 2016, Molecular therapy. Nucleic acids.

[30]  R J Roberts,et al.  On base flipping , 1995, Cell.

[31]  K. Chou Impacts of bioinformatics to medicinal chemistry. , 2015, Medicinal chemistry (Shariqah (United Arab Emirates)).

[32]  L. Aravind,et al.  DNA Methylation on N6-Adenine in C. elegans , 2015, Cell.

[33]  S. Turner,et al.  Going beyond five bases in DNA sequencing. , 2012, Current opinion in structural biology.

[34]  Richard J. Roberts,et al.  REBASE—a database for DNA restriction and modification: enzymes, genes and genomes , 2009, Nucleic Acids Res..

[35]  Howard Cedar,et al.  Epigenetic mechanisms that regulate antigen receptor gene expression. , 2003, Current opinion in immunology.