Dm6A-TSVM: detection of N6-methyladenosine (m6A) sites from RNA transcriptomes using the twin support vector machines

N6-methyladenosine (m6A) is closely related to various life processes and diseases. The detection of genomic-level m6A sites plays an important role in explaining its biological mechanism. However, the current mainstream m6A sites detection method has limited precision. In this paper, a novel m6A sites detection method called “Dm6A-TSVM” is proposed. The feature vectors are firstly extracted from the mRNA sequences according to their nucleotide chemical property and position statistical distribution characteristics. Then the two kinds of features are combined, and the m6A sites detection model is constructed by the twin support vector machines method. Finally, based on the standard yeast dataset, the cross-validation experimental method is used to verify Dm6A-TSVM. The results demonstrate that the Dm6A-TSVM method is significantly better than the current mainstream m6A sites detection method, and its accuracy (ACC) value reaches 82.81%.

[1]  Kwonho Hong,et al.  Emerging function of N6-methyladenosine in cancer. , 2018, Oncology letters.

[2]  K. Chou,et al.  iRNA-Methyl: Identifying N(6)-methyladenosine sites using pseudo nucleotide composition. , 2015, Analytical biochemistry.

[3]  D. McGuinness,et al.  m 6 a RNA Methylation: The Implications for Health and Disease , 2014 .

[4]  Yudong Zhang,et al.  Cerebral micro‐bleeding identification based on a nine‐layer convolutional neural network with stochastic pooling , 2019, Concurr. Comput. Pract. Exp..

[5]  Tao Pan,et al.  Dynamic RNA Modifications in Gene Expression Regulation , 2017, Cell.

[6]  Javaid A. Sheikh,et al.  An efficient watermarking technique for tamper detection and localization of medical images , 2018, J. Ambient Intell. Humaniz. Comput..

[7]  Yudong Zhang,et al.  Single slice based detection for Alzheimer’s disease via wavelet entropy and multilayer perceptron trained by biogeography-based optimization , 2018, Multimedia Tools and Applications.

[8]  Schraga Schwartz,et al.  High-Resolution Mapping Reveals a Conserved, Widespread, Dynamic mRNA Methylation Program in Yeast Meiosis , 2013, Cell.

[9]  Wei Chen,et al.  Identification and analysis of the N6-methyladenosine in the Saccharomyces cerevisiae transcriptome , 2015, Scientific Reports.

[10]  Reshma Khemchandani,et al.  Twin Support Vector Machines for Pattern Classification , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Ho-Jin Choi,et al.  DNA Encoding for Splice Site Prediction in Large DNA Sequence , 2013, DASFAA Workshops.

[12]  Jalal A. Nasiri,et al.  LightTwinSVM: A Simple and Fast Implementation of Standard Twin Support Vector Machine Classifier , 2019, J. Open Source Softw..

[13]  Ran Su,et al.  Identifying N6-methyladenosine sites using multi-interval nucleotide pair position specificity and support vector machine , 2017, Scientific Reports.

[14]  Ran Su,et al.  M6APred-EL: A Sequence-Based Predictor for Identifying N6-methyladenosine Sites Using Ensemble Learning , 2018, Molecular therapy. Nucleic acids.

[15]  Wei Chen,et al.  Detecting N6-methyladenosine sites from RNA transcriptomes using ensemble Support Vector Machines , 2017, Scientific Reports.

[16]  Christopher E. Mason,et al.  Single-nucleotide resolution mapping of m6A and m6Am throughout the transcriptome , 2015, Nature Methods.

[17]  Chuan He,et al.  Post-transcriptional gene regulation by mRNA modifications , 2016, Nature Reviews Molecular Cell Biology.

[18]  Massimo Pappalardo,et al.  Algorithms for Equilibria , 2018, Nonlinear Programming Techniques for Equilibria.

[19]  Junding Sun,et al.  High Performance Multiple Sclerosis Classification by Data Augmentation and AlexNet Transfer Learning Model , 2019, J. Medical Imaging Health Informatics.

[20]  O. Elemento,et al.  Comprehensive Analysis of mRNA Methylation Reveals Enrichment in 3′ UTRs and near Stop Codons , 2012, Cell.

[21]  Ming Yang,et al.  Multivariate Approach for Alzheimer's Disease Detection Using Stationary Wavelet Entropy and Predator-Prey Particle Swarm Optimization. , 2018, Journal of Alzheimer's disease : JAD.

[22]  Nello Cristianini,et al.  Learning the Kernel Matrix with Semidefinite Programming , 2002, J. Mach. Learn. Res..

[23]  Jalal A. Nasiri,et al.  KNN-based least squares twin support vector machine for pattern classification , 2018, Applied Intelligence.