A Sample-to-Report Solution for Taxonomic Identification of Cultured Bacteria in the Clinical Setting Based on Nanopore Sequencing

Amplicon sequencing of the 16S rRNA gene is commonly used for the identification of bacterial isolates in diagnostic laboratories and mostly relies on the Sanger sequencing method. The latter, however, suffers from a number of limitations, with the most significant being the inability to resolve mixed amplicons when closely related species are coamplified from a mixed culture. This often leads to either increased turnaround time or absence of usable sequence data. Short-read next-generation sequencing (NGS) technologies could solve the mixed amplicon issue but would lack both cost efficiency at low throughput and fast turnaround times. ABSTRACT Amplicon sequencing of the 16S rRNA gene is commonly used for the identification of bacterial isolates in diagnostic laboratories and mostly relies on the Sanger sequencing method. The latter, however, suffers from a number of limitations, with the most significant being the inability to resolve mixed amplicons when closely related species are coamplified from a mixed culture. This often leads to either increased turnaround time or absence of usable sequence data. Short-read next-generation sequencing (NGS) technologies could solve the mixed amplicon issue but would lack both cost efficiency at low throughput and fast turnaround times. Nanopore sequencing developed by Oxford Nanopore Technologies (ONT) could solve those issues by enabling a flexible number of samples per run and an adjustable sequencing time. Here, we report on the development of a standardized laboratory workflow combined with a fully automated analysis pipeline LORCAN (long read consensus analysis), which together provide a sample-to-report solution for amplicon sequencing and taxonomic identification of the resulting consensus sequences. Validation of the approach was conducted on a panel of reference strains and on clinical samples consisting of single or mixed rRNA amplicons associated with various bacterial genera by direct comparison to the corresponding Sanger sequences. Additionally, simulated read and amplicon mixtures were used to assess LORCAN’s behavior when dealing with samples with known cross-contamination levels. We demonstrate that by combining ONT amplicon sequencing results with LORCAN, the accuracy of Sanger sequencing can be closely matched (>99.6% sequence identity) and that mixed samples can be resolved at the single-base resolution level. The presented approach has the potential to significantly improve the flexibility, reliability, and availability of amplicon sequencing in diagnostic settings.

[1]  Martin Hartmann,et al.  Introducing mothur: Open-Source, Platform-Independent, Community-Supported Software for Describing and Comparing Microbial Communities , 2009, Applied and Environmental Microbiology.

[2]  Heng Li,et al.  Minimap2: pairwise alignment for nucleotide sequences , 2017, Bioinform..

[3]  Aaron Pomerantz,et al.  Real-time DNA barcoding in a rainforest using nanopore sequencing: opportunities for rapid biodiversity assessments and local capacity building , 2018, GigaScience.

[4]  Szymon T Calus,et al.  NanoAmpli-Seq: a workflow for amplicon sequencing for mixed microbial communities on the nanopore sequencing platform , 2018, bioRxiv.

[5]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[6]  Niranjan Nagarajan,et al.  A MinION-based pipeline for fast and cost-effective DNA barcoding , 2018, bioRxiv.

[7]  Susanna K. P. Lau,et al.  Usefulness of the MicroSeq 500 16S Ribosomal DNA-Based Bacterial Identification System for Identification of Clinically Significant Bacterial Isolates with Ambiguous Biochemical Profiles , 2003, Journal of Clinical Microbiology.

[8]  Massimo Delledonne,et al.  A rapid and accurate MinION-based workflow for tracking species biodiversity in the field , 2019 .

[9]  A. von Haeseler,et al.  IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies , 2014, Molecular biology and evolution.

[10]  Wei Qian,et al.  Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. , 2000, Molecular biology and evolution.

[11]  A. Psaroulaki,et al.  Use of MALDI-TOF mass spectrometry in the battle against bacterial infectious diseases: recent achievements and future perspectives , 2017, Expert review of proteomics.

[12]  T. Imanishi,et al.  Rapid bacterial identification by direct PCR amplification of 16S rRNA genes using the MinION™ nanopore sequencer , 2018, bioRxiv.

[13]  J P Flandrois,et al.  16S rRNA sequencing in routine bacterial identification: a 30-month experiment. , 2006, Journal of microbiological methods.

[14]  Helen Sutton,et al.  Compilation of a MALDI-TOF mass spectral database for the rapid screening and characterisation of bacteria implicated in human infectious diseases. , 2004, Infection, genetics and evolution : journal of molecular epidemiology and evolutionary genetics in infectious diseases.

[15]  B. Chain,et al.  The sequence of sequencers: The history of sequencing DNA , 2016, Genomics.

[16]  R. Kirkegaard,et al.  Enabling high-accuracy long-read amplicon sequences using unique molecular identifiers and Nanopore sequencing , 2019, bioRxiv.

[17]  Y. Sanz,et al.  Species-level resolution of 16S rRNA gene amplicons sequenced through the MinION™ portable nanopore sequencer , 2015, bioRxiv.

[18]  Cesare Centomo,et al.  On site DNA barcoding by nanopore sequencing , 2017, PloS one.

[19]  T. Peto,et al.  Detection of Viral Pathogens With Multiplex Nanopore MinION Sequencing: Be Careful With Cross-Talk , 2018, bioRxiv.

[20]  Shuiquan Tang,et al.  Ultra-deep, long-read nanopore sequencing of mock microbial community standards , 2018 .

[21]  L. Kaplan,et al.  The Human Microbiome and Obesity: Moving beyond Associations. , 2017, Cell host & microbe.

[22]  Yan Li,et al.  SeqKit: A Cross-Platform and Ultrafast Toolkit for FASTA/Q File Manipulation , 2016, PloS one.

[23]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[24]  C. Ahrens,et al.  Long-read based de novo assembly of low-complexity metagenome samples results in finished genomes and reveals insights into strain diversity and an active phage system , 2018, BMC Microbiology.

[25]  Niranjan Nagarajan,et al.  Fast and accurate de novo genome assembly from long uncorrected reads. , 2017, Genome research.

[26]  F. Corpet Multiple sequence alignment with hierarchical clustering. , 1988, Nucleic acids research.

[27]  Eoin L. Brodie,et al.  Use of 16S rRNA Gene for Identification of a Broad Range of Clinically Relevant Bacterial Pathogens , 2015, PloS one.

[28]  Guy Perrière,et al.  leBIBIQBPP: a set of databases and a webtool for automatic phylogenetic analysis of prokaryotic sequences , 2015, BMC Bioinformatics.

[29]  K. Katoh,et al.  MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability , 2013, Molecular biology and evolution.

[30]  J. J. Abellán,et al.  Assessing Gut Microbial Diversity from Feces and Rectal Mucosa , 2010, Microbial Ecology.

[31]  Didier Raoult,et al.  16S Ribosomal DNA Sequence Analysis of a Large Collection of Environmental and Clinical Unidentifiable Bacterial Isolates , 2000, Journal of Clinical Microbiology.

[32]  M. Brent,et al.  A tale of two templates: automatically resolving double traces has many applications, including efficient PCR-based elucidation of alternative splices. , 2007, Genome research.

[33]  S. Abbott,et al.  16S rRNA Gene Sequencing for Bacterial Identification in the Diagnostic Laboratory: Pluses, Perils, and Pitfalls , 2007, Journal of Clinical Microbiology.

[34]  N. Dovichi,et al.  Use of non-cross-linked polyacrylamide for four-color DNA sequencing by capillary electrophoresis separation of fragments up to 640 bases in length in two hours. , 1995, Analytical chemistry.