NanoMod: a computational tool to detect DNA modifications using Nanopore long-read sequencing data

Background Recent advances in single-molecule sequencing techniques, such as Nanopore sequencing, improved read length, increased sequencing throughput, and enabled direct detection of DNA modifications through the analysis of raw signals. These DNA modifications include naturally occurring modifications such as DNA methylations, as well as modifications that are introduced by DNA damage or through synthetic modifications to one of the four standard nucleotides. Methods To improve the performance of detecting DNA modifications, especially synthetically introduced modifications, we developed a novel computational tool called NanoMod. NanoMod takes raw signal data on a pair of DNA samples with and without modified bases, extracts signal intensities, performs base error correction based on a reference sequence, and then identifies bases with modifications by comparing the distribution of raw signals between two samples, while taking into account of the effects of neighboring bases on modified bases (“neighborhood effects”). Results We evaluated NanoMod on simulation data sets, based on different types of modifications and different magnitudes of neighborhood effects, and found that NanoMod outperformed other methods in identifying known modified bases. Additionally, we demonstrated superior performance of NanoMod on an E. coli data set with 5mC (5-methylcytosine) modifications. Conclusions In summary, NanoMod is a flexible tool to detect DNA modifications with single-base resolution from raw signals in Nanopore sequencing, and will greatly facilitate large-scale functional genomics experiments in the future that use modified nucleotides.

[1]  Ji Eun Lee,et al.  De novo Identification of DNA Modifications Enabled by Genome-Guided Nanopore Signal Processing , 2017, bioRxiv.

[2]  E. Schadt,et al.  Single molecule-level detection and long read-based phasing of epigenetic variations in bacterial methylomes , 2014, Nature Communications.

[3]  F. Massey The Kolmogorov-Smirnov Test for Goodness of Fit , 1951 .

[4]  Richard J. Roberts,et al.  Characterization of DNA methyltransferase specificities using single-molecule, real-time DNA sequencing , 2011, Nucleic acids research.

[5]  O. Elemento,et al.  Comprehensive Analysis of mRNA Methylation Reveals Enrichment in 3′ UTRs and near Stop Codons , 2012, Cell.

[6]  I. Derrington,et al.  Detection and mapping of 5-methylcytosine and 5-hydroxymethylcytosine with nanopore MspA , 2013, Proceedings of the National Academy of Sciences.

[7]  Tyson A. Clark,et al.  Direct detection of DNA methylation during single-molecule, real-time sequencing , 2010, Nature Methods.

[8]  Winston Timp,et al.  Detecting DNA cytosine methylation using nanopore sequencing , 2017, Nature Methods.

[9]  Jonas Korlach,et al.  Enhanced 5-methylcytosine detection in single-molecule, real-time sequencing via Tet1 oxidation , 2012, BMC Biology.

[10]  Coby Viner,et al.  DNAmod: the DNA modification database , 2016, bioRxiv.

[11]  F. Miura,et al.  Amplification-free whole-genome bisulfite sequencing by post-bisulfite adaptor tagging , 2012, Nucleic acids research.

[12]  Christian A. Ross,et al.  A role for the bacterial GATC methylome in antibiotic stress survival , 2016, Nature Genetics.

[13]  Samuel Kilcher,et al.  Tracking viral genomes in host cells at single-molecule resolution. , 2013, Cell host & microbe.

[14]  Timothy J. Mitchison,et al.  A chemical method for fast and sensitive detection of DNA synthesis in vivo , 2008, Proceedings of the National Academy of Sciences.

[15]  Kyle N. Klein,et al.  Genome-Wide Identification of Early-Firing Human Replication Origins by Optical Replication Mapping , 2017, bioRxiv.

[16]  M. Akeson,et al.  Nanopores Discriminate among Five C5-Cytosine Variants in DNA , 2014, Journal of the American Chemical Society.

[17]  Mark Akeson,et al.  Error rates for nanopore discrimination among cytosine, methylcytosine, and hydroxymethylcytosine along individual DNA strands , 2013, Proceedings of the National Academy of Sciences.

[18]  R. Castro,et al.  Epigenetic modifications: basic mechanisms and role in cardiovascular disease. , 2011, Circulation.

[19]  Jonas Korlach,et al.  The birth of the Epitranscriptome: deciphering the function of RNA modifications , 2012, Genome Biology.

[20]  Sheng Li,et al.  Nanopore detection of bacterial DNA base modifications , 2017, bioRxiv.

[21]  Jordan M. Eizenga,et al.  Mapping DNA Methylation with High Throughput Nanopore Sequencing , 2017, Nature Methods.

[22]  Gang Fang,et al.  Detecting DNA Modifications from SMRT Sequencing Data by Modeling Sequence Context Dependence of Polymerase Kinetic , 2013, PLoS Comput. Biol..

[23]  M. Kupiec,et al.  Topology of the human and mouse m6A RNA methylomes revealed by m6A-seq , 2012, Nature.

[24]  Heng Li Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM , 2013, 1303.3997.

[25]  Harold Riethman,et al.  CRISPR-CAS9 D10A nickase target-specific fluorescent labeling of double strand DNA for whole genome mapping and structural variation analysis , 2015, Nucleic acids research.

[26]  Matthew K Waldor,et al.  Entering the era of bacterial epigenomics with single molecule real time DNA sequencing. , 2013, Current opinion in microbiology.