CarrierSeq: a sequence analysis workflow for low-input nanopore sequencing

BackgroundLong-read nanopore sequencing technology is of particular significance for taxonomic identification at or below the species level. For many environmental samples, the total extractable DNA is far below the current input requirements of nanopore sequencing, preventing “sample to sequence” metagenomics from low-biomass or recalcitrant samples.ResultsHere we address this problem by employing carrier sequencing, a method to sequence low-input DNA by preparing the target DNA with a genomic carrier to achieve ideal library preparation and sequencing stoichiometry without amplification. We then use CarrierSeq, a sequence analysis workflow to identify the low-input target reads from the genomic carrier. We tested CarrierSeq experimentally by sequencing from a combination of 0.2 ng Bacillus subtilis ATCC 6633 DNA in a background of 1000 ng Enterobacteria phage λ DNA. After filtering of carrier, low quality, and low complexity reads, we detected target reads (B. subtilis), contamination reads, and “high quality noise reads” (HQNRs) not mapping to the carrier, target or known lab contaminants. These reads appear to be artifacts of the nanopore sequencing process as they are associated with specific channels (pores).ConclusionBy treating sequencing as a Poisson arrival process, we implement a statistical test to reject data from channels dominated by HQNRs while retaining low-input target reads.

[1]  Heng Li Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM , 2013, 1303.3997.

[2]  G. Pertea fqtrim: v0.9.4 release , 2015 .

[3]  Alejandro A. Schäffer,et al.  A Fast and Symmetric DUST Implementation to Mask Low-Complexity DNA Sequences , 2006, J. Comput. Biol..

[4]  Christopher E. Carr,et al.  Nucleic Acid Extraction and Sequencing from Low-Biomass Synthetic Mars Analog Soils for In Situ Life Detection , 2019, Astrobiology.

[5]  Christopher E. Carr,et al.  Nucleic Acid Extraction from Synthetic Mars Analog Soils for in situ Life Detection , 2017, Astrobiology.

[6]  J. Leamon,et al.  Bias in Whole Genome Amplification: Causes and Considerations. , 2015, Methods in molecular biology.

[7]  Bo Barker Jørgensen,et al.  A modular method for the extraction of DNA and RNA, and the separation of DNA pools from diverse environmental sample types , 2015, Front. Microbiol..

[8]  Christopher E. Carr,et al.  Towards in situ sequencing for life detection , 2017, 2017 IEEE Aerospace Conference.

[9]  N. Taylor,et al.  DNA extraction from low-biomass carbonate rock: an improved method with reduced contamination and the low-biomass contaminant database. , 2006, Journal of microbiological methods.

[10]  G. Braus,et al.  One Juliet and four Romeos: VeA and its methyltransferases , 2015, Front. Microbiol..

[11]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[12]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[13]  K. Konstantinidis,et al.  Strengths and Limitations of 16S rRNA Gene Amplicon Sequencing in Revealing Temporal Microbial Community Dynamics , 2014, PloS one.