Nanopore sequencing enables high-resolution analysis of resistance determinants and mobile elements in the human gut microbiome

The analysis of information rich whole-metagenome datasets acquired from complex microbial communities is often restricted by the fragmented nature of assembly from short-read sequencing. The availability of long-reads from third-generation sequencing technologies (e.g. PacBio or Oxford Nanopore) can help improve assembly quality in principle, but high error rates and low throughput have limited their application in metagenomics. In this work, we describe the first hybrid metagenomic assembler which combines the advantages of short and long-read technologies, providing an order of magnitude improvement in contiguity compared to short read assemblies, and high base-pair level accuracy. The proposed approach (OPERA-MS) integrates a novel assembly-based metagenome clustering technique with an exact scaffolding algorithm that can efficiently assemble repeat rich sequences. Based on evaluations with defined in vitro communities and virtual gut microbiomes, we show that it is possible to assemble near complete genomes from metagenomes with as little as 9× long read coverage, thus enabling high quality assembly of lowly abundant species (<1%). Furthermore, OPERA-MS’s fine-grained clustering is able to deconvolute and assemble multiple genomes of the same species in a single sample, allowing us to study the complex dynamics of the human microbiome at the sub-species level. Applying nanopore sequencing to gut metagenomes of patients undergoing antibiotic treatment, we show that long reads can be obtained from stool samples in clinical studies to produce more meaningful metagenomic assemblies (up to 200× improvement over short-read assemblies), including the closed assembly of >80 putative plasmid/phage sequences and a 263kbp jumbo phage. Our results highlight that high-quality hybrid assemblies provide an unprecedented view of the gut resistome in these patients, including strain dynamics and identification of novel plasmid sequences.

[1]  Petra F. G. Wolffs,et al.  The human microbiome as a reservoir of antimicrobial resistance , 2013, Front. Microbiol..

[2]  Brian C. Thomas,et al.  Measurement of bacterial replication rates in microbial communities , 2016, Nature Biotechnology.

[3]  J. Rolain,et al.  ARG-ANNOT, a New Bioinformatic Tool To Discover Antibiotic Resistance Genes in Bacterial Genomes , 2013, Antimicrobial Agents and Chemotherapy.

[4]  Rick L. Stevens,et al.  The RAST Server: Rapid Annotations using Subsystems Technology , 2008, BMC Genomics.

[5]  Duy Tin Truong,et al.  MetaPhlAn2 for enhanced metagenomic taxonomic profiling , 2015, Nature Methods.

[6]  S. Koren,et al.  Nanopore sequencing and assembly of a human genome with ultra-long reads , 2017, bioRxiv.

[7]  Yuki Moriya,et al.  KAAS: an automatic genome annotation and pathway reconstruction server , 2007, Nucleic Acids Res..

[8]  Gintaras Deikus,et al.  Metagenomic binning and association of plasmids with bacterial host genomes using DNA methylation , 2017, Nature Biotechnology.

[9]  Ryan R. Wick,et al.  Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads , 2016, bioRxiv.

[10]  A. Raftery Bayes Factors and BIC , 1999 .

[11]  Siu-Ming Yiu,et al.  IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth , 2012, Bioinform..

[12]  M. Mirzaei,et al.  Ménage à trois in the human gut: interactions between host, bacteria and phages , 2017, Nature Reviews Microbiology.

[13]  Aaron R. Quinlan,et al.  Poretools: a toolkit for analyzing nanopore sequence data , 2014, bioRxiv.

[14]  Jacques Schrenzel,et al.  Clinical metagenomics of bone and joint infections: a proof of concept study , 2016 .

[15]  E. Trecarichi,et al.  Therapeutic options for carbapenem-resistant Enterobacteriaceae infections , 2017, Virulence.

[16]  Donovan Parks,et al.  GroopM: an automated tool for the recovery of population genomes from related metagenomes , 2014, PeerJ.

[17]  M. Wagner,et al.  Complete nitrification by Nitrospira bacteria , 2015, Nature.

[18]  K. Pollard,et al.  Toward Accurate and Quantitative Comparative Metagenomics , 2016, Cell.

[19]  Philip D. Blood,et al.  Critical Assessment of Metagenome Interpretation—a benchmark of metagenomics software , 2017, Nature Methods.

[20]  Peer Bork,et al.  Microbial abundance, activity and population genomic profiling with mOTUs2 , 2019, Nature Communications.

[21]  Ronan M. T. Fleming,et al.  Generation of genome-scale metabolic reconstructions for 773 members of the human gut microbiota , 2016, Nature Biotechnology.

[22]  Niranjan Nagarajan,et al.  Fast and accurate de novo genome assembly from long uncorrected reads. , 2017, Genome research.

[23]  Wasserman,et al.  Bayesian Model Selection and Model Averaging. , 2000, Journal of mathematical psychology.

[24]  J. Banfield,et al.  dRep: a tool for fast and accurate genomic comparisons that enables improved genome recovery from metagenomes through de-replication , 2017, The ISME Journal.

[25]  Serafim Batzoglou,et al.  High-quality genome sequences of uncultured microbes by assembly of read clouds , 2018, Nature Biotechnology.

[26]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[27]  Anders F. Andersson,et al.  Binning metagenomic contigs by coverage and composition , 2014, Nature Methods.

[28]  K. Kupkova,et al.  Bioinformatics strategies for taxonomy independent binning and visualization of sequences in shotgun metagenomics , 2016, Computational and structural biotechnology journal.

[29]  Heng Li,et al.  Fast and accurate long-read assembly with wtdbg2 , 2019, Nature Methods.

[30]  Dmitry Antipov,et al.  hybridSPAdes: an algorithm for hybrid assembly of short and long reads , 2016, Bioinform..

[31]  Rob Knight,et al.  ConStrains identifies microbial strains in metagenomic datasets , 2015, Nature Biotechnology.

[32]  K. Bush,et al.  Carbapenemases: the Versatile β-Lactamases , 2007, Clinical Microbiology Reviews.

[33]  Oon Tek Ng,et al.  Clinical and Molecular Epidemiology of Carbapenem-Resistant Enterobacteriaceae Among Adult Inpatients in Singapore. , 2017, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[34]  S. Partridge,et al.  Analysis of antibiotic resistance regions in Gram-negative bacteria. , 2011, FEMS microbiology reviews.

[35]  Jillian F Banfield,et al.  Strain-resolved analysis of hospital rooms and infants reveals overlap between the human and room microbiome , 2017, Nature Communications.

[36]  S. Salzberg,et al.  Versatile and open software for comparing large genomes , 2004, Genome Biology.

[37]  O. Gascuel,et al.  New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. , 2010, Systematic biology.

[38]  Niranjan Nagarajan,et al.  Gut microbiome recovery after antibiotic usage is mediated by specific bacterial species , 2018, bioRxiv.

[39]  E. Birney,et al.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs. , 2008, Genome research.

[40]  Yohei Doi,et al.  Therapy of Infections due to Carbapenem-Resistant Gram-Negative Pathogens , 2014, Infection & chemotherapy.

[41]  Henry C. Lin,et al.  Phage therapy: An alternative to antibiotics in the age of multi-drug resistance , 2017, World journal of gastrointestinal pharmacology and therapeutics.

[42]  N. Loman,et al.  A culture-independent sequence-based metagenomics approach to the investigation of an outbreak of Shiga-toxigenic Escherichia coli O104:H4. , 2013, JAMA.

[43]  Johannes Alneberg,et al.  DESMAN: a new tool for de novo extraction of strains from metagenomes , 2017, Genome Biology.

[44]  N. Loman,et al.  A complete bacterial genome assembled de novo using only nanopore sequencing data , 2015, Nature Methods.

[45]  Shabhonam Caim,et al.  Rapid MinION metagenomic profiling of the preterm infant gut microbiota to aid in pathogen diagnostics , 2017 .

[46]  Kunihiko Sadakane,et al.  MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph , 2014, Bioinform..

[47]  P. B. Pope,et al.  Improved metagenome assemblies and taxonomic binning using long-read circular consensus sequence data , 2015, Scientific Reports.

[48]  Aaron A. Klammer,et al.  Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data , 2013, Nature Methods.

[49]  Glenn Tesler,et al.  Mapping single molecule sequencing reads using basic local alignment with successive refinement (BLASR): application and theory , 2012, BMC Bioinformatics.

[50]  Richard A. Davis,et al.  Remarks on Some Nonparametric Estimates of a Density Function , 2011 .

[51]  S. Koren,et al.  Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation , 2016, bioRxiv.

[52]  Edoardo Pasolli,et al.  Extensive Unexplored Human Microbiome Diversity Revealed by Over 150,000 Genomes from Metagenomes Spanning Age, Geography, and Lifestyle , 2019, Cell.

[53]  Yihui Yuan,et al.  Jumbo Bacteriophages: An Overview , 2017, Front. Microbiol..

[54]  Heng Li Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM , 2013, 1303.3997.

[55]  Niranjan Nagarajan,et al.  Fast and sensitive mapping of nanopore sequencing reads with GraphMap , 2016, Nature Communications.

[56]  J. Gilbert,et al.  Recovering complete and draft population genomes from metagenome datasets , 2016, Microbiome.

[57]  G. Meletis,et al.  Double- and multi-carbapenemase-producers: the excessively armored bacilli of the current decade , 2015, European Journal of Clinical Microbiology & Infectious Diseases.

[58]  Peter D Karp,et al.  Metabolic pathways for the whole community , 2014, BMC Genomics.

[59]  P. Pevzner,et al.  metaSPAdes: a new versatile metagenomic assembler. , 2017, Genome research.

[60]  Connor T. Skennerton,et al.  CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes , 2015, Genome research.

[61]  M. Schatz,et al.  Hybrid error correction and de novo assembly of single-molecule sequencing reads , 2012, Nature Biotechnology.

[62]  Geir Skogerbø,et al.  The human microbiome: a hot spot of microbial horizontal gene transfer. , 2012, Genomics.

[63]  Baoli Zhu,et al.  Human gut microbiome: the second genome of human body , 2010, Protein & Cell.

[64]  Jens Roat Kultima,et al.  Identification and assembly of genomes and genetic elements in complex metagenomic samples without using reference genomes , 2014, Nature Biotechnology.

[65]  Brian D. Ondov,et al.  Mash: fast genome and metagenome distance estimation using MinHash , 2015, Genome Biology.

[66]  Eugene Kulesha,et al.  What’s in my pot? Real-time species identification on the MinION™ , 2015, bioRxiv.

[67]  Anna E. Sheppard,et al.  Plasmid Classification in an Era of Whole-Genome Sequencing: Application in Studies of Antibiotic Resistance Epidemiology , 2017, Front. Microbiol..

[68]  G. Meletis,et al.  Carbapenem resistance: overview of the problem and future perspectives , 2016, Therapeutic advances in infectious disease.

[69]  Nicholas Chia,et al.  Identification of Prosthetic Joint Infection Pathogens Using a Shotgun Metagenomics Approach , 2018, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[70]  N. Nagarajan,et al.  OPERA-LG: efficient and exact scaffolding of large, repeat-rich eukaryotic genomes with performance guarantees , 2016, Genome Biology.

[71]  Zev N. Kronenberg,et al.  Hi-C deconvolution of a human gut microbiome yields high-quality draft genomes and reveals plasmid-genome interactions , 2017, bioRxiv.

[72]  Mihai Pop,et al.  Finding Biologically Accurate Clusterings in Hierarchical Tree Decompositions Using the Variation of Information , 2009, J. Comput. Biol..

[73]  J. Ronholm,et al.  Metagenomics: The Next Culture-Independent Game Changer , 2017, Front. Microbiol..

[74]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[75]  Denis Bertrand,et al.  FinIS: Improved in silico Finishing Using an Exact Quadratic Programming Formulation , 2012, WABI.

[76]  Peer Bork,et al.  MOCAT: A Metagenomics Assembly and Gene Prediction Toolkit , 2012, PloS one.

[77]  K. Kaye,et al.  Treatment Options for Carbapenem-Resistant Enterobacteriaceae Infections , 2015, Open forum infectious diseases.

[78]  Alexey A. Gurevich,et al.  MetaQUAST: evaluation of metagenome assemblies , 2016, Bioinform..

[79]  Blake A. Simmons,et al.  MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets , 2016, Bioinform..