Methylartist: tools for visualizing modified bases from nanopore sequence data

Methylartist is a consolidated suite of tools for processing, visualising, and analysing nanopore methylation data derived from modified basecalling methods. All detectable methylation types (e.g. 5mCpG, 5hmC, 6mA) are supported, enabling integrated study of base pairs when modified naturally or as part of an experimental protocol. Background Covalent modification of nucleobases is an important component of genomic regulatory regimes across all domains of life [1–3] and is harnessed by various genomic footprinting assays, including DamID[4], SMAC-seq[5], and NOMe-seq[6]. Nanopore sequencing offers comprehensive assessment of base modifications from arbitrarily long sequence reads through analysis of electrical current profiles, generally through machine learning models trained to discriminate between modified and unmodified bases [7]. An increasing number of computational tools have been developed or enhanced for calling modified bases [8], including nanopolish [7], megalodon [9], and guppy [10], along with an increasing number of available pre-trained models.

[1]  L. Coin,et al.  Porpoise: a new approach for accurate prediction of RNA pseudouridine sites. , 2021, Briefings in bioinformatics.

[2]  Leszek P. Pryszcz,et al.  Quantitative profiling of pseudouridylation dynamics in native RNAs with nanopore sequencing , 2021, Nature Biotechnology.

[3]  Leszek P. Pryszcz,et al.  ModPhred: an integrative toolkit for the analysis and storage of nanopore sequencing DNA and RNA modification data , 2021, bioRxiv.

[4]  D. Cook,et al.  NanoMethViz: An R/Bioconductor package for visualizing long-read methylation data , 2021, bioRxiv.

[5]  Michael L. Waskom,et al.  Seaborn: Statistical Data Visualization , 2021, J. Open Source Softw..

[6]  G. Faulkner,et al.  Nanopore Sequencing Enables Comprehensive Transposable Element Epigenomic Profiling. , 2020, Molecular cell.

[7]  Dennis McNevin,et al.  Systematic benchmarking of tools for CpG methylation detection from nanopore sequencing , 2020, Nature Communications.

[8]  Leszek P. Pryszcz,et al.  Decoding ribosomal RNA modification dynamics at single molecule resolution , 2020, bioRxiv.

[9]  Jaime Fern'andez del R'io,et al.  Array programming with NumPy , 2020, Nature.

[10]  G. Faulkner,et al.  Nanopore sequencing enables comprehensive transposable element epigenomic profiling , 2020, bioRxiv.

[11]  Anton J. Enright,et al.  RNA modifications detection by comparative Nanopore direct RNA sequencing , 2019, Nature Communications.

[12]  Wouter De Coster,et al.  Methplotlib: analysis of modified nucleotides from nanopore sequencing , 2019, bioRxiv.

[13]  Svenn-Arne Dragly,et al.  Perineuronal nets stabilize the grid cell network , 2019, bioRxiv.

[14]  Johannes L. Schönberger,et al.  SciPy 1.0: fundamental algorithms for scientific computing in Python , 2019, Nature Methods.

[15]  J. Simpson,et al.  Simultaneous profiling of chromatin accessibility and methylation on human cell lines with nanopore sequencing , 2018, Nature Methods.

[16]  Nicholas A. Sinnott-Armstrong,et al.  Long-range single-molecule mapping of chromatin accessibility in eukaryotes , 2018, Nature Methods.

[17]  Feng Luo,et al.  DeepSignal: detecting DNA methylation state from Nanopore sequencing reads using deep-learning , 2018, bioRxiv.

[18]  Joshua M. Dempster,et al.  Genetic and transcriptional evolution alters cancer cell line drug response , 2018, Nature.

[19]  A. Lindås,et al.  The DNA Methylome of the Hyperthermoacidophilic Crenarchaeon Sulfolobus acidocaldarius , 2018, Front. Microbiol..

[20]  Richard,et al.  The non-specific adenine DNA methyltransferase M.EcoGII , 2017, Nucleic acids research.

[21]  Heng Li,et al.  Minimap2: pairwise alignment for nucleotide sequences , 2017, Bioinform..

[22]  Winston Timp,et al.  Detecting DNA cytosine methylation using nanopore sequencing , 2017, Nature Methods.

[23]  Giovanna Ambrosini,et al.  The eukaryotic promoter database in its 30th year: focus on non-vertebrate organisms , 2016, Nucleic Acids Res..

[24]  Hao Wu,et al.  Differential methylation analysis for BS-seq data under general experimental design , 2016, Bioinform..

[25]  Dongwan D. Kang,et al.  The Epigenomic Landscape of Prokaryotes , 2016, PLoS genetics.

[26]  Marius Raica,et al.  The Story of MCF-7 Breast Cancer Cell Line: 40 years of Experience in Research. , 2015, Anticancer research.

[27]  Leo van Iersel,et al.  WhatsHap: Weighted Haplotype Assembly for Future-Generation Sequencing Reads , 2015, J. Comput. Biol..

[28]  M. DePristo,et al.  A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.

[29]  D. Zilberman,et al.  Genome-Wide Evolutionary Analysis of Eukaryotic DNA Methylation , 2010, Science.

[30]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[31]  John D. Hunter,et al.  Matplotlib: A 2D Graphics Environment , 2007, Computing in Science & Engineering.

[32]  S. Henikoff,et al.  Identification of in vivo DNA targets of chromatin proteins using tethered Dam methyltransferase , 2000, Nature Biotechnology.

[33]  A. Harris,et al.  Differential expression of human Wnt genes 2, 3, 4, and 7B in human breast cell lines and normal and disease states of human breast tissue. , 1994, Cancer research.

[34]  Wes McKinney,et al.  Data Structures for Statistical Computing in Python , 2010, SciPy.