Identifying N6-methyladenosine sites in the Arabidopsis thaliana transcriptome

N6-Methyladenosine (m6A) plays important roles in many biological processes. The knowledge of the distribution of m6A is helpful for understanding its regulatory roles. Although the experimental methods have been proposed to detect m6A, the resolutions of these methods are still unsatisfying especially for Arabidopsis thaliana. Benefitting from the experimental data, in the current work, a support vector machine-based method was proposed to identify m6A sites in A. thaliana transcriptome. The proposed method was validated on a benchmark dataset using jackknife test and was also validated by identifying strain-specific m6A sites in A. thaliana. The obtained predictive results indicate that the proposed method is quite promising. For the convenience of experimental biologists, an online webserver for the proposed method was built, which is freely available at http://lin.uestc.edu.cn/server/M6ATH. These results indicate that the proposed method holds a potential to become an elegant tool in identifying m6A site in A. thaliana.

[1]  Ian H. Witten,et al.  Data mining in bioinformatics using Weka , 2004, Bioinform..

[2]  Chengqi Yi,et al.  N6-Methyladenosine in Nuclear RNA is a Major Substrate of the Obesity-Associated FTO , 2011, Nature chemical biology.

[3]  K. Chou Some remarks on protein attribute prediction and pseudo amino acid composition , 2010, Journal of Theoretical Biology.

[4]  Jef Rozenski,et al.  The RNA modification database, RNAMDB: 2011 update , 2010, Nucleic Acids Res..

[5]  Wei Chen,et al.  Prediction of replication origins by calculating DNA structural properties , 2012, FEBS letters.

[6]  Zhengwei Zhu,et al.  CD-HIT: accelerated for clustering the next-generation sequencing data , 2012, Bioinform..

[7]  M. Kupiec,et al.  Topology of the human and mouse m6A RNA methylomes revealed by m6A-seq , 2012, Nature.

[8]  Renzhi Cao,et al.  SMOQ: a tool for predicting the absolute residue-specific quality of a single protein model with support vector machines , 2013, BMC Bioinformatics.

[9]  Schraga Schwartz,et al.  High-Resolution Mapping Reveals a Conserved, Widespread, Dynamic mRNA Methylation Program in Yeast Meiosis , 2013, Cell.

[10]  Wei Chen,et al.  iRSpot-PseDNC: identify recombination spots with pseudo dinucleotide composition , 2013, Nucleic acids research.

[11]  Hui Ding,et al.  AcalPred: A Sequence-Based Tool for Discriminating between Acidic and Alkaline Enzymes , 2013, PloS one.

[12]  Zheng Wang,et al.  Designing and evaluating the MULTICOM protein local and global model quality prediction methods in the CASP10 experiment , 2014, BMC Structural Biology.

[13]  T. Nilsen Internal mRNA Methylation Finally Finds Functions , 2014, Science.

[14]  Miao Yu,et al.  A METTL3-METTL14 complex mediates mammalian nuclear RNA N6-adenosine methylation , 2013, Nature chemical biology.

[15]  Wei Chen,et al.  Predicting the Types of J-Proteins Using Clustered Amino Acids , 2014, BioMed research international.

[16]  Pengmian Feng,et al.  Prediction of DNase I Hypersensitive Sites by Using Pseudo Nucleotide Compositions , 2014, TheScientificWorldJournal.

[17]  K. Chou,et al.  iSS-PseDNC: Identifying Splicing Sites Using Pseudo Dinucleotide Composition , 2014, BioMed research international.

[18]  Wei Chen,et al.  iTIS-PseTNC: a sequence-based predictor for identifying translation initiation site in human genes using pseudo trinucleotide composition. , 2014, Analytical biochemistry.

[19]  Samie R. Jaffrey,et al.  The dynamic epitranscriptome: N6-methyladenosine and gene expression control , 2014, Nature Reviews Molecular Cell Biology.

[20]  Zhike Lu,et al.  Unique Features of the m6A Methylome in Arabidopsis thaliana , 2014, Nature Communications.

[21]  K. Chou,et al.  iRNA-Methyl: Identifying N(6)-methyladenosine sites using pseudo nucleotide composition. , 2015, Analytical biochemistry.

[22]  Qi Zhou,et al.  m(6)A RNA methylation is regulated by microRNAs and promotes reprogramming to pluripotency. , 2015, Cell stem cell.

[23]  Erez Y. Levanon,et al.  m6A mRNA methylation facilitates resolution of naïve pluripotency toward differentiation , 2015, Science.

[24]  Wei Chen,et al.  Identification and analysis of the N6-methyladenosine in the Saccharomyces cerevisiae transcriptome , 2015, Scientific Reports.

[25]  Christopher E. Mason,et al.  Single-nucleotide resolution mapping of m6A and m6Am throughout the transcriptome , 2015, Nature Methods.

[26]  Wei Chen,et al.  iRNA-PseU: Identifying RNA pseudouridine sites , 2016, Molecular therapy. Nucleic acids.

[27]  Miao Yu,et al.  A METTL 3-METTL 14 complex mediates mammalian nuclear RNA N 6-adenosine methylation , 2016 .

[28]  Q. Cui,et al.  SRAMP: prediction of mammalian N6-methyladenosine (m6A) sites based on sequence-derived features , 2016, Nucleic acids research.

[29]  Wei Chen,et al.  Identifying 2'-O-methylationation sites by integrating nucleotide chemical properties and nucleotide compositions. , 2016, Genomics.

[30]  Wei Chen,et al.  MethyRNA: a web server for identification of N6-methyladenosine sites , 2017, Journal of biomolecular structure & dynamics.