m6aViewer: software for the detection, analysis, and visualization of N6-methyladenosine peaks from m6A-seq/ME-RIP sequencing data

Recent methods for transcriptome-wide N6-methyladenosine (m6A) profiling have facilitated investigations into the RNA methylome and established m6A as a dynamic modification that has critical regulatory roles in gene expression and may play a role in human disease. However, bioinformatics resources available for the analysis of m6A sequencing data are still limited. Here, we describe m6aViewer-a cross-platform application for analysis and visualization of m6A peaks from sequencing data. m6aViewer implements a novel m6A peak-calling algorithm that identifies high-confidence methylated residues with more precision than previously described approaches. The application enables data analysis through a graphical user interface, and thus, in contrast to other currently available tools, does not require the user to be skilled in computer programming. m6aViewer and test data can be downloaded here: http://dna2.leeds.ac.uk/m6a.

[1]  Arne Klungland,et al.  ALKBH5 is a mammalian RNA demethylase that impacts RNA metabolism and mouse fertility. , 2013, Molecular cell.

[2]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[3]  Qi Zhou,et al.  m(6)A RNA methylation is regulated by microRNAs and promotes reprogramming to pluripotency. , 2015, Cell stem cell.

[4]  Sean R Eddy,et al.  What is a hidden Markov model? , 2004, Nature Biotechnology.

[5]  Schraga Schwartz,et al.  Perturbation of m6A writers reveals two distinct classes of mRNA methylation at internal and 5' sites. , 2014, Cell reports.

[6]  M. Kupiec,et al.  Topology of the human and mouse m6A RNA methylomes revealed by m6A-seq , 2012, Nature.

[7]  Christopher E. Mason,et al.  Single-nucleotide resolution mapping of m6A and m6Am throughout the transcriptome , 2015, Nature Methods.

[8]  Jianxin Wu Hidden Markov model , 2018 .

[9]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[10]  Ana Kozomara,et al.  miRBase: annotating high confidence microRNAs using deep sequencing data , 2013, Nucleic Acids Res..

[11]  Terrence S. Furey,et al.  The UCSC Table Browser data retrieval tool , 2004, Nucleic Acids Res..

[12]  D. Haussler,et al.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. , 2005, Genome research.

[13]  Xiaodong Cui,et al.  Exome-based analysis for RNA epigenome sequencing data , 2013, Bioinform..

[14]  Robert Petryszak,et al.  ArrayExpress update—simplifying data submissions , 2014, Nucleic Acids Res..

[15]  S. Salzberg,et al.  Microbial gene identification using interpolated Markov models. , 1998, Nucleic acids research.

[16]  Sanjeev Galande,et al.  One size does not fit all: On how Markov model order dictates performance of genomic sequence analyses , 2012, Nucleic acids research.

[17]  Peter F. Stadler,et al.  ViennaRNA Package 2.0 , 2011, Algorithms for Molecular Biology.

[18]  Marcel Martin Cutadapt removes adapter sequences from high-throughput sequencing reads , 2011 .

[19]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[20]  Yufei Huang,et al.  HEPeak: an HMM-based exome peak-finding package for RNA epigenome sequencing data , 2015, 2013 IEEE International Workshop on Genomic Signal Processing and Statistics.

[21]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[22]  S. Tavazoie,et al.  N6-methyladenosine marks primary microRNAs for processing , 2015, Nature.

[23]  O. Elemento,et al.  Comprehensive Analysis of mRNA Methylation Reveals Enrichment in 3′ UTRs and near Stop Codons , 2012, Cell.

[24]  A. Raftery,et al.  The Mixture Transition Distribution Model for High-Order Markov Chains and Non-Gaussian Time Series , 2002 .

[25]  Qi Xu,et al.  An association study of the m6A genes with major depressive disorder in Chinese Han population. , 2015, Journal of affective disorders.

[26]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[27]  Yufei Huang,et al.  A protocol for RNA methylation differential analysis with MeRIP-Seq data and exomePeak R/Bioconductor package. , 2014, Methods.

[28]  Samie R. Jaffrey,et al.  The dynamic epitranscriptome: N6-methyladenosine and gene expression control , 2014, Nature Reviews Molecular Cell Biology.

[29]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[30]  A. Nekrutenko,et al.  Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences , 2010, Genome Biology.

[31]  S. Salzberg,et al.  Interpolated Markov models for eukaryotic gene finding. , 1999, Genomics.

[32]  Xiaodong Cui,et al.  A novel algorithm for calling mRNA m6A peaks by modeling biological variances in MeRIP-seq data , 2016, Bioinform..

[33]  Wei Chen,et al.  Identification and analysis of the N6-methyladenosine in the Saccharomyces cerevisiae transcriptome , 2015, Scientific Reports.

[34]  Volodymyr Melnykov,et al.  Initializing the EM algorithm in Gaussian mixture models with an unknown number of components , 2012, Comput. Stat. Data Anal..

[35]  Q. Cui,et al.  SRAMP: prediction of mammalian N6-methyladenosine (m6A) sites based on sequence-derived features , 2016, Nucleic acids research.

[36]  Chuong B Do,et al.  What is the expectation maximization algorithm? , 2008, Nature Biotechnology.

[37]  Ranjan Maitra Initializing Partition-Optimization Algorithms , 2009, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[38]  Jun Yu,et al.  MeRIP-PF: An Easy-to-use Pipeline for High-resolution Peak-finding in MeRIP-Seq Data , 2013, Genom. Proteom. Bioinform..

[39]  Charles Elkan,et al.  Expectation Maximization Algorithm , 2010, Encyclopedia of Machine Learning.

[40]  Chuan He,et al.  FTO-dependent demethylation of N6-methyladenosine regulates mRNA splicing and is required for adipogenesis , 2014, Cell Research.

[41]  H WittenIan,et al.  The WEKA data mining software , 2009 .

[42]  A. Raftery A model for high-order Markov chains , 1985 .

[43]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.