Error correction enables use of Oxford Nanopore technology for reference-free transcriptome analysis

The authors have withdrawn their manuscript as the author list was changed after publication of the paper. The listed authors wish that the published version should be considered/cited as a reference for the work reported, which has the correct author list. Please refer to and cite the published work. Besides the change in the author list, the only differences are in the Acknowledgements, Competing Interests, and Data Availability sections. If you have any questions, please contact the corresponding author.

[1]  Mark Akeson,et al.  Reading canonical and modified nucleobases in 16S ribosomal RNA using nanopore native RNA sequencing , 2019, PloS one.

[2]  Maurits J. J. Dijkstra,et al.  Multiple Sequence Alignment. , 2017, Methods in molecular biology.

[3]  Feng Luo,et al.  MECAT: fast mapping, error correction, and de novo assembly for single-molecule sequencing reads , 2017, Nature Methods.

[4]  Xiandong Meng,et al.  Widespread Polycistronic Transcripts in Fungi Revealed by Single-Molecule mRNA Sequencing , 2015, PloS one.

[5]  Charles Cole,et al.  Realizing the potential of full-length transcriptome sequencing , 2019, Philosophical Transactions of the Royal Society B.

[6]  Pierre Peterlongo,et al.  De novo clustering of long reads by gene from transcriptomics data , 2018, Nucleic acids research.

[7]  Paul Medvedev,et al.  De Novo Clustering of Long-Read Transcriptome Data Using a Greedy, Quality-Value Based Algorithm , 2019, RECOMB.

[8]  Rayan Chikhi,et al.  Comparative assessment of long-read error correction software applied to Nanopore RNA-sequencing data , 2019, Briefings Bioinform..

[9]  Christopher J. Lee Generating Consensus Sequences from Partial Order Multiple Sequence Alignment Graphs , 2003, Bioinform..

[10]  K. Laidre,et al.  Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus) , 2019, bioRxiv.

[11]  Kateryna D. Makova,et al.  Deciphering highly similar multigene family transcripts from Iso-Seq data with IsoCon , 2018, Nature Communications.

[12]  Justin Chu,et al.  NanoSim: nanopore sequence read simulator based on statistical characterization , 2016, bioRxiv.

[13]  J. Ragoussis,et al.  Transcript Profiling Using Long-Read Sequencing Technologies. , 2018, Methods in molecular biology.

[14]  Eduardo Eyras,et al.  ReorientExpress: reference-free orientation of nanopore cDNA reads with deep learning , 2019, Genome Biology.

[15]  I. Nookaew,et al.  Complete genomic and transcriptional landscape analysis using third-generation sequencing: a case study of Saccharomyces cerevisiae CEN.PK113-7D , 2018, Nucleic acids research.

[16]  Joel A. Indi,et al.  ReorientExpress: reference-free orientation of nanopore cDNA reads with deep learning , 2019, Genome Biology.

[17]  Ali Mortazavi,et al.  TranscriptClean: variant-aware correction of indels, mismatches and splice junctions in long-read transcripts , 2018, Bioinform..

[18]  Robert C. Edgar,et al.  Multiple sequence alignment. , 2006, Current opinion in structural biology.

[19]  Esko Ukkonen,et al.  Accurate self-correction of errors in long reads using de Bruijn graphs , 2016, Bioinform..

[20]  Shilin Chen,et al.  IDP-denovo: de novo transcriptome assembly and isoform annotation by hybrid sequencing , 2018, Bioinform..

[21]  W Brad Barbazuk,et al.  Detecting alternatively spliced transcript isoforms from single‐molecule long‐read sequences without a reference genome , 2017, Molecular ecology resources.

[22]  Michael Roberts,et al.  Reducing storage requirements for biological sequence comparison , 2004, Bioinform..

[23]  Sven Rahmann,et al.  SimLoRD: Simulation of Long Read Data , 2016, Bioinform..

[24]  T. Derrien,et al.  Transcriptome profiling of mouse samples using nanopore sequencing of cDNA and RNA molecules , 2019, Scientific Reports.

[25]  S. Koren,et al.  Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation , 2016, bioRxiv.

[26]  Martin Sosic,et al.  Edlib: a C/C++ library for fast, exact sequence alignment using edit distance , 2016, bioRxiv.

[27]  Jean-Marc Aury,et al.  Transcriptome profiling of mouse samples using nanopore sequencing of cDNA and RNA molecules , 2019, Scientific Reports.

[28]  Paul J. Harrison,et al.  Long-read sequencing reveals the complex splicing profile of the psychiatric risk gene CACNA1C in human brain , 2019, Molecular Psychiatry.

[29]  Christopher J. Lee,et al.  Multiple sequence alignment using partial order graphs , 2002, Bioinform..

[30]  Angela N. Brooks,et al.  Nanopore native RNA sequencing of a human poly(A) transcriptome , 2018, bioRxiv.

[31]  Paul Medvedev,et al.  Data structures to represent sets of k-long DNA sequences , 2019, ArXiv.

[32]  David W. Burt,et al.  Illuminating the dark side of the human transcriptome with TAMA Iso-Seq analysis , 2019, bioRxiv.

[33]  Thomas Hackl,et al.  proovread: large-scale high-accuracy PacBio correction through iterative short read consensus , 2014, Bioinform..

[34]  Eugene W. Myers,et al.  Non Hybrid Long Read Consensus Using Local De Bruijn Graph Assembly , 2017, bioRxiv.

[35]  I. Nookaew,et al.  Decoding the epitranscriptional landscape from native RNA sequences , 2020, Nucleic acids research.

[36]  Renmin Han,et al.  DeepSimulator: a deep simulator for Nanopore sequencing , 2017, bioRxiv.

[37]  Nanopore native RNA sequencing of a human poly(A) transcriptome , 2019, Nature Methods.

[38]  Pascal Barbry,et al.  High throughput error corrected Nanopore single cell transcriptome sequencing , 2019, Nature Communications.

[39]  Barbara J. Wold,et al.  A technology-agnostic long-read analysis pipeline for transcriptome discovery and quantification , 2019, bioRxiv.

[40]  Heng Li,et al.  Minimap2: pairwise alignment for nucleotide sequences , 2017, Bioinform..

[41]  C. Vollmers,et al.  Complete characterization of the human immune cell transcriptome using accurate full-length cDNA sequencing , 2019, bioRxiv.

[42]  Arnaud Lefebvre,et al.  CONSENT: Scalable self-correction of long reads with multiple sequence alignment , 2019, bioRxiv.

[43]  Niranjan Nagarajan,et al.  Fast and accurate de novo genome assembly from long uncorrected reads. , 2017, Genome research.

[44]  Hugh E. Olsen,et al.  Nanopore long-read RNAseq reveals widespread transcriptional variation among the surface receptors of individual B cells , 2017, Nature Communications.

[45]  I. Nookaew,et al.  Decoding the epitranscriptional landscape from native RNA sequences , 2018, bioRxiv.

[46]  Richard E. Green,et al.  Improving nanopore read accuracy with the R2C2 method enables the sequencing of highly multiplexed full-length single-cell cDNA , 2018, Proceedings of the National Academy of Sciences.

[47]  Aaron A. Klammer,et al.  Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data , 2013, Nature Methods.

[48]  Anton J. Enright,et al.  RNA modifications detection by comparative Nanopore direct RNA sequencing , 2019, Nature Communications.

[49]  Colin R. Janssen,et al.  Evaluating the potential of direct RNA nanopore sequencing: Metatranscriptomics highlights possible seasonal differences in a marine pelagic crustacean zooplankton community. , 2019, Marine environmental research.

[50]  Yasuko Mori,et al.  Direct RNA sequencing on nanopore arrays redefines the transcriptional complexity of a viral pathogen , 2019, Nature Communications.

[51]  Joel A. Indi,et al.  RATTLE: reference-free reconstruction and quantification of transcriptomes from Nanopore sequencing , 2020, bioRxiv.

[52]  Daniel R. Garalde,et al.  Highly parallel direct RNA sequencing on an array of nanopores , 2016, Nature Methods.

[53]  Lennart Martens,et al.  1 SQANTI : extensive characterization of long read transcript sequences for quality control in 1 full-length transcriptome identification and quantification 2 3 , 2017 .