Using metagenomics to investigate human and environmental resistomes

Antibiotic resistance is a global health concern declared by the WHO as one of the largest threats to modern healthcare. In recent years, metagenomic DNA sequencing has started to be applied as a tool to study antibiotic resistance in different environments, including the human microbiota. However, a multitude of methods exist for metagenomic data analysis, and not all methods are suitable for the investigation of resistance genes, particularly if the desired outcome is an assessment of risks to human health. In this review, we outline the current state of methods for sequence handling, mapping to databases of resistance genes, statistical analysis and metagenomic assembly. In addition, we provide an overview of important considerations related to the analysis of resistance genes, and recommend some of the currently used tools and methods that are best equipped to inform research and clinical practice related to antibiotic resistance.

[1]  K. Pearson On the Criterion that a Given System of Deviations from the Probable in the Case of a Correlated System of Variables is Such that it Can be Reasonably Supposed to have Arisen from Random Sampling , 1900 .

[2]  O. J. Dunn Estimation of the Medians for Dependent Variables , 1959 .

[3]  O. J. Dunn Multiple Comparisons among Means , 1961 .

[4]  S. Hurlbert The Nonconcept of Species Diversity: A Critique and Alternative Parameters. , 1971, Ecology.

[5]  R. Staden A strategy of DNA sequencing employing computer programs. , 1979, Nucleic acids research.

[6]  A. Chao Nonparametric estimation of the number of classes in a population , 1984 .

[7]  A. Chao,et al.  Estimating the Number of Classes via Sample Coverage , 1992 .

[8]  Robert K. Colwell,et al.  Estimating terrestrial biodiversity through extrapolation. , 1994, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[9]  K. Schleifer,et al.  Phylogenetic identification and in situ detection of individual microbial cells without cultivation. , 1995, Microbiological reviews.

[10]  Michael S. Waterman,et al.  A New Algorithm for DNA Sequence Assembly , 1995, J. Comput. Biol..

[11]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[12]  Gapped BLAST and PSI-BLAST: A new , 1997 .

[13]  Thomas Wetter,et al.  Genome Sequence Assembly Using Trace Signals and Additional Sequence Information , 1999, German Conference on Bioinformatics.

[14]  Eugene W. Myers,et al.  A whole-genome assembly of Drosophila. , 2000, Science.

[15]  T. Schmidt,et al.  rRNA Operon Copy Number Reflects Ecological Strategies of Bacteria , 2000, Applied and Environmental Microbiology.

[16]  S. Kjelleberg,et al.  rpoB-Based Microbial Community Analysis Avoids Limitations Inherent in 16S rRNA Gene Intraspecies Heterogeneity , 2000, Applied and Environmental Microbiology.

[17]  J. Whisstock,et al.  For the record: A single amino acid substitution affects substrate specificity in cysteine proteinases from Fasciola hepatica , 2000, Protein science : a publication of the Protein Society.

[18]  P. Pevzner,et al.  An Eulerian path approach to DNA fragment assembly , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[19]  H. Cheong,et al.  Alteration of a single amino acid changes the substrate specificity of dihydroflavonol 4-reductase. , 2001, The Plant journal : for cell and molecular biology.

[20]  J. Hughes,et al.  Counting the Uncountable: Statistical Approaches to Estimating Microbial Diversity , 2001, Applied and Environmental Microbiology.

[21]  A. Magurran,et al.  Measuring Biological Diversity , 2004 .

[22]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[23]  J. Handelsman,et al.  Uncultured soil bacteria are a reservoir of new antibiotic resistance genes. , 2004, Environmental microbiology.

[24]  Gordon K Smyth,et al.  Statistical Applications in Genetics and Molecular Biology Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments , 2011 .

[25]  Ian A. Wilson,et al.  A Single Amino Acid Substitution in 1918 Influenza Virus Hemagglutinin Changes Receptor Binding Specificity , 2005, Journal of Virology.

[26]  Laura S. Frost,et al.  Mobile genetic elements: the agents of open source evolution , 2005, Nature Reviews Microbiology.

[27]  James R. Knight,et al.  Genome sequencing in microfabricated high-density picolitre reactors , 2005, Nature.

[28]  Forest Rohwer,et al.  An application of statistics to comparative metagenomics , 2006, BMC Bioinformatics.

[29]  Jessica J Hellmann,et al.  The application of rarefaction techniques to molecular inventories of microbial diversity. , 2005, Methods in enzymology.

[30]  Ian B. Jeffery,et al.  Comparison and evaluation of methods for generating differentially expressed gene lists from microarray data , 2006, BMC Bioinformatics.

[31]  Mark B Gerstein,et al.  Assessment of whole genome amplification-induced bias through high-throughput, massively parallel whole genome sequencing , 2006, BMC Genomics.

[32]  René L. Warren,et al.  Assembling millions of short DNA sequences using SSAKE , 2006, Bioinform..

[33]  E. Birney,et al.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs. , 2008, Genome research.

[34]  Juliane C. Dohm,et al.  Substantial biases in ultra-short read data sets from high-throughput DNA sequencing , 2008, Nucleic acids research.

[35]  Andreas Wilke,et al.  phylogenetic and functional analysis of metagenomes , 2022 .

[36]  C. Nusbaum,et al.  ALLPATHS: de novo assembly of whole-genome shotgun microreads. , 2008, Genome research.

[37]  S. Austin,et al.  Switching Protein-DNA Recognition Specificity by Single-Amino-Acid Substitutions in the P1 par Family of Plasmid Partition Elements , 2008, Journal of bacteriology.

[38]  Karl Pearson F.R.S. X. On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling , 2009 .

[39]  Steven J. M. Jones,et al.  Abyss: a Parallel Assembler for Short Read Sequence Data Material Supplemental Open Access , 2022 .

[40]  Mihai Pop,et al.  Genome assembly reborn: recent computational challenges , 2009, Briefings Bioinform..

[41]  Erik Kristiansson,et al.  ShotgunFunctionalizeR: an R-package for functional comparison of metagenomes , 2009, Bioinform..

[42]  William Stafford Noble,et al.  How does multiple testing correction work? , 2009, Nature Biotechnology.

[43]  M. Robinson,et al.  A scaling normalization method for differential expression analysis of RNA-seq data , 2010, Genome Biology.

[44]  G. Church,et al.  Functional Characterization of the Antibiotic Resistance Reservoir in the Human Microflora , 2009, Science.

[45]  Heather K. Allen,et al.  Functional metagenomics reveals diverse β-lactamases in a remote Alaskan soil , 2009, The ISME Journal.

[46]  Mihai Pop,et al.  Statistical Methods for Detecting Differentially Abundant Features in Clinical Metagenomic Samples , 2009, PLoS Comput. Biol..

[47]  Patricia C. Babbitt,et al.  An Atlas of the Thioredoxin Fold Class Reveals the Complexity of Function-Enabling Adaptations , 2009, PLoS Comput. Biol..

[48]  Mihai Pop,et al.  ARDB—Antibiotic Resistance Genes Database , 2008, Nucleic Acids Res..

[49]  Søren J. Sørensen,et al.  Conjugative plasmids: vessels of the communal gene pool , 2009, Philosophical Transactions of the Royal Society B: Biological Sciences.

[50]  A. Oshlack,et al.  Transcript length bias in RNA-seq data confounds systems biology , 2009, Biology Direct.

[51]  L. Bianchi,et al.  A Single Amino Acid Change Converts the Sugar Sensor SGLT3 into a Sugar Transporter , 2010, PloS one.

[52]  J. Handelsman,et al.  Novel Florfenicol and Chloramphenicol Resistance Gene Discovered in Alaskan Soil by Using Functional Metagenomics , 2010, Applied and Environmental Microbiology.

[53]  John C. Wooley,et al.  A Primer on Metagenomics , 2010, PLoS Comput. Biol..

[54]  B. Haas,et al.  A Catalog of Reference Genomes from the Human Microbiome , 2010, Science.

[55]  W. Huber,et al.  Differential expression analysis for sequence count data , 2010 .

[56]  Robert B. O'Hara,et al.  Do not log‐transform count data , 2010 .

[57]  Huanming Yang,et al.  De novo assembly of human genomes with massively parallel short read sequencing. , 2010, Genome research.

[58]  S. Koren,et al.  Assembly algorithms for next-generation sequencing data. , 2010, Genomics.

[59]  Robert C. Edgar,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2001 .

[60]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[61]  J. Sung,et al.  Analysis of human and animal fecal microbiota for microbial source tracking , 2011, The ISME Journal.

[62]  E. Kristiansson,et al.  Pyrosequencing of Antibiotic-Contaminated River Sediments Reveals High Levels of Resistance and Gene Transfer Elements , 2011, PloS one.

[63]  G. B. Golding,et al.  Antibiotic resistance is ancient , 2011, Nature.

[64]  S. Tringe,et al.  Metagenomic Discovery of Biomass-Degrading Genes and Genomes from Cow Rumen , 2011, Science.

[65]  G. Torres-Cortes,et al.  Characterization of novel antibiotic resistance genes identified by functional metagenomics on soil samples. , 2011, Environmental microbiology.

[66]  Robert A. Edwards,et al.  Quality control and preprocessing of metagenomic datasets , 2011, Bioinform..

[67]  T. Glenn Field guide to next‐generation DNA sequencers , 2011, Molecular ecology resources.

[68]  B. Mishra,et al.  Comparing De Novo Genome Assembly: The Long and Short of It , 2011, PloS one.

[69]  Siu-Ming Yiu,et al.  Meta-IDBA: a de Novo assembler for metagenomic data , 2011, Bioinform..

[70]  M. David,et al.  Metagenomic analysis of a permafrost microbial community reveals a rapid response to thaw , 2011, Nature.

[71]  Rick L. Stevens,et al.  Unlocking the potential of metagenomics through replicated experimental design , 2012, Nature Biotechnology.

[72]  Jesse A. Port,et al.  Metagenomic Profiling of Microbial Composition and Antibiotic Resistance Determinants in Puget Sound , 2012, PloS one.

[73]  Yasubumi Sakakibara,et al.  MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads , 2012, Nucleic acids research.

[74]  Anders F. Andersson,et al.  Which sequencing depth is sufficient to describe patterns in bacterial α- and β-diversity? , 2012, Environmental microbiology reports.

[75]  M. Schatz,et al.  Algorithms Gage: a Critical Evaluation of Genome Assemblies and Assembly Material Supplemental , 2008 .

[76]  S. Rasmussen,et al.  Identification of acquired antimicrobial resistance genes , 2012, The Journal of antimicrobial chemotherapy.

[77]  Nan Li,et al.  Comparison of the two major classes of assembly algorithms: overlap-layout-consensus and de-bruijn-graph. , 2012, Briefings in functional genomics.

[78]  F. Raymond,et al.  which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Ray Meta: scalable de novo metagenome assembly and profiling , 2012 .

[79]  P. Chain,et al.  Next generation sequencing and bioinformatic bottlenecks: the current state of metagenomic data analysis. , 2012, Current opinion in biotechnology.

[80]  M. Toleman,et al.  blaNDM-1 Is a Chimera Likely Constructed in Acinetobacter baumannii , 2012, Antimicrobial Agents and Chemotherapy.

[81]  Arend Hintze,et al.  Scaling metagenome sequence assembly with probabilistic de Bruijn graphs , 2011, Proceedings of the National Academy of Sciences.

[82]  G. Dantas,et al.  The Shared Antibiotic Resistome of Soil Bacteria and Human Pathogens , 2012, Science.

[83]  P. Nordmann,et al.  Carbapenem resistance in Enterobacteriaceae: here is the storm! , 2012, Trends in molecular medicine.

[84]  J. Martínez,et al.  Bottlenecks in the Transferability of Antibiotic Resistance from Natural Ecosystems to Human Bacterial Pathogens , 2011, Front. Microbio..

[85]  Lisa M. Durso,et al.  Distribution and Quantification of Antibiotic Resistant Genes and Bacteria across Agricultural and Non-Agricultural Metagenomes , 2012, PloS one.

[86]  M. Sommer,et al.  Context matters - the complex interplay between resistome genotypes and resistance phenotypes. , 2012, Current opinion in microbiology.

[87]  P. Nordmann,et al.  Association of the Emerging Carbapenemase NDM-1 with a Bleomycin Resistance Protein in Enterobacteriaceae and Acinetobacter baumannii , 2012, Antimicrobial Agents and Chemotherapy.

[88]  Roger E Bumgarner Overview of DNA microarrays: types, applications, and their future. , 2013, Current protocols in molecular biology.

[89]  M. Pignatelli,et al.  Comparison of different assembly and annotation tools on analysis of simulated viral metagenomic communities in the gut , 2014, BMC Genomics.

[90]  E. Kristiansson,et al.  Acquired Genetic Mechanisms of a Multiresistant Bacterium Isolated from a Treatment Plant Receiving Wastewater from Antibiotic Production , 2013, Applied and Environmental Microbiology.

[91]  Jian Wang,et al.  Metagenome-wide analysis of antibiotic resistance genes in a large cohort of human gut microbiota , 2013, Nature Communications.

[92]  Tong Zhang,et al.  Metagenomic insights into chlorination effects on microbial antibiotic resistance in drinking water. , 2013, Water Research.

[93]  Alexandros Stamatakis,et al.  Metagenomic species profiling using universal phylogenetic marker genes , 2013, Nature Methods.

[94]  Ole Lund,et al.  Rapid Whole-Genome Sequencing for Detection and Characterization of Microorganisms Directly from Clinical Samples , 2013, Journal of Clinical Microbiology.

[95]  J. Rolain,et al.  ARG-ANNOT, a New Bioinformatic Tool To Discover Antibiotic Resistance Genes in Bacterial Genomes , 2013, Antimicrobial Agents and Chemotherapy.

[96]  M. Pop,et al.  Robust methods for differential abundance analysis in marker gene surveys , 2013, Nature Methods.

[97]  Thomas Backhaus,et al.  Human Health Risk Assessment (HHRA) for Environmental Development and Transfer of Antibiotic Resistance , 2013, Environmental health perspectives.

[98]  C. Mason,et al.  Comprehensive evaluation of differential gene expression analysis methods for RNA-seq data , 2013, Genome Biology.

[99]  Nicolas Servant,et al.  A comprehensive evaluation of normalization methods for Illumina high-throughput RNA sequencing data analysis , 2013, Briefings Bioinform..

[100]  R. Tippkötter,et al.  Comparison of commercial kits for the extraction of DNA from paddy soils , 2013, Letters in applied microbiology.

[101]  P. Baldrian,et al.  The Variability of the 16S rRNA Gene in Bacterial Genomes and Its Consequences for Bacterial Community Analyses , 2013, PloS one.

[102]  P. Collignon The importance of a One Health approach to preventing the development and spread of antibiotic resistance. , 2013, Current topics in microbiology and immunology.

[103]  Bing Li,et al.  Exploring variation of antibiotic resistance genes in activated sludge over a four-year period through a metagenomic approach. , 2013, Environmental science & technology.

[104]  Charity W. Law,et al.  voom: precision weights unlock linear model analysis tools for RNA-seq read counts , 2014, Genome Biology.

[105]  Peer Bork,et al.  Country-specific antibiotic use practices impact the human gut resistome , 2013, Genome research.

[106]  Robert A. Edwards,et al.  Multivariate Analysis of Functional Metagenomes , 2013, Front. Genet..

[107]  Steven Salzberg,et al.  GAGE-B: an evaluation of genome assemblers for bacterial organisms , 2013, Bioinform..

[108]  Bing Li,et al.  Fate of antibiotic resistance genes in sewage treatment plant revealed by metagenomic approach. , 2014, Water research.

[109]  Bing Li,et al.  Abundant rifampin resistance genes and significant correlations of antibiotic resistance genes and plasmids in various environments revealed by metagenomic analysis , 2014, Applied Microbiology and Biotechnology.

[110]  Yuan Zhang,et al.  A Scalable and Accurate Targeted Gene Assembly Tool (SAT-Assembler) for Next-Generation Sequencing Data , 2014, PLoS Comput. Biol..

[111]  Erik Kristiansson,et al.  Shotgun metagenomics reveals a wide array of antibiotic resistance genes and mobile elements in a polluted lake in India , 2014, Front. Microbiol..

[112]  Erik Kristiansson,et al.  BacMet: antibacterial biocide and metal resistance genes database , 2013, Nucleic Acids Res..

[113]  J. Handelsman,et al.  Diverse Antibiotic Resistance Genes in Dairy Cow Manure , 2014, mBio.

[114]  A. Blomberg,et al.  Metagenomics reveals that detoxification systems are underrepresented in marine bacterial communities , 2014, BMC Genomics.

[115]  Chien-Chi Lo,et al.  Improved Assemblies Using a Source-Agnostic Pipeline for MetaGenomic Assembly by Merging (MeGAMerge) of Contigs , 2014, Scientific Reports.

[116]  Björn Usadel,et al.  Trimmomatic: a flexible trimmer for Illumina sequence data , 2014, Bioinform..

[117]  Pascal Simonet,et al.  Large-Scale Metagenomic-Based Study of Antibiotic Resistance in the Environment , 2014, Current Biology.

[118]  S. Tringe,et al.  Tackling soil diversity with the assembly of large, complex metagenomes , 2014, Proceedings of the National Academy of Sciences.

[119]  Jukka Corander,et al.  Evolution and transmission of drug resistant tuberculosis in a Russian population , 2014, Nature Genetics.

[120]  V. Denef,et al.  RNA Preservation Agents and Nucleic Acid Extraction Method Bias Perceived Bacterial Community Composition , 2015, PloS one.

[121]  Johan Bengtsson-Palme,et al.  Antibiotic resistance genes in the environment: prioritizing risks , 2015, Nature Reviews Microbiology.

[122]  Johan Bengtsson-Palme,et al.  metaxa2: improved identification and taxonomic classification of small and large subunit rRNA in metagenomic data , 2015, Molecular ecology resources.

[123]  Jonathan Wilksch,et al.  Genomic analysis of diversity, population structure, virulence, and antimicrobial resistance in Klebsiella pneumoniae, an urgent threat to public health , 2015, Proceedings of the National Academy of Sciences.

[124]  Jay Shendure,et al.  Large-scale genomic sequencing of extraintestinal pathogenic Escherichia coli strains , 2015, Genome research.

[125]  Erik Kristiansson,et al.  The Human Gut Microbiome as a Transporter of Antibiotic Resistance Genes between Continents , 2015, Antimicrobial Agents and Chemotherapy.

[126]  F. Baquero,et al.  Tackling antibiotic resistance: the environmental framework , 2015, Nature Reviews Microbiology.

[127]  Lingling An,et al.  A robust approach for identifying differentially abundant features in metagenomic samples , 2015, Bioinform..

[128]  Scott Ferson,et al.  Accounting for uncertainty in DNA sequencing data. , 2015, Trends in genetics : TIG.

[129]  E. Kristiansson,et al.  Isolation of novel IncA/C and IncN fluoroquinolone resistance plasmids from an antibiotic-polluted lake. , 2015, The Journal of antimicrobial chemotherapy.

[130]  Teresa M. Coque,et al.  What is a resistance gene? Ranking risk in resistomes , 2014, Nature Reviews Microbiology.

[131]  Chao Xie,et al.  Fast and sensitive protein alignment using DIAMOND , 2014, Nature Methods.

[132]  M. Ellabaan,et al.  Limited dissemination of the wastewater treatment plant core resistome , 2015, Nature Communications.

[133]  J. Choo,et al.  Sample storage conditions significantly influence faecal microbiome profiles , 2015, Scientific Reports.

[134]  Elhanan Borenstein,et al.  MUSiCC: a marker genes based framework for metagenomic normalization and accurate profiling of gene abundances in the microbiome , 2014, bioRxiv.

[135]  Evelyn Schlenker Tips and Tricks for Successful Application of Statistical Methods to Biological Data. , 2016, Methods in molecular biology.

[136]  Amir Feizi,et al.  Strategies to improve usability and preserve accuracy in biological sequence databases , 2016, Proteomics.

[137]  Baoli Zhu,et al.  Dissemination of the mcr-1 colistin resistance gene. , 2016, The Lancet. Infectious diseases.

[138]  Bing Li,et al.  Metagenomic Assembly Reveals Hosts of Antibiotic Resistance Genes and the Shared Resistome in Pig, Chicken, and Human Feces. , 2016, Environmental science & technology.

[139]  E. Kristiansson,et al.  The structure and diversity of human, animal and environmental resistomes , 2016, Microbiome.

[140]  M. Tysklind,et al.  Elucidating selection processes for antibiotic resistance in sewage treatment plants using metagenomics. , 2016, The Science of the total environment.

[141]  Jianzhong Shen,et al.  Emergence of plasmid-mediated colistin resistance mechanism MCR-1 in animals and human beings in China: a microbiological and molecular biological study. , 2015, The Lancet. Infectious diseases.

[142]  F. Raymond,et al.  The initial state of the human gut microbiome determines its reshaping by antibiotics , 2015, The ISME Journal.

[143]  Molly K. Gibson,et al.  Developmental dynamics of the preterm infant gut microbiota and antibiotic resistome , 2016, Nature Microbiology.

[144]  Erik Kristiansson,et al.  Statistical evaluation of methods for identification of differentially abundant genes in comparative metagenomics , 2016, BMC Genomics.

[145]  R. Henrik Nilsson,et al.  Metaxa2 Diversity Tools: Easing microbial community analysis with Metaxa2 , 2016, Ecol. Informatics.

[146]  J. Rolain,et al.  Dissemination of the mcr-1 colistin resistance gene , 2016 .

[147]  A. von Haeseler,et al.  Next-generation sequencing diagnostics of bacteremia in septic patients , 2016, Genome Medicine.

[148]  Minh Duc Cao,et al.  Streaming algorithms for identification of pathogens and antibiotic resistance potential from real-time MinIONTM sequencing , 2015, bioRxiv.

[149]  Lisa C. Crossman,et al.  Identification of bacterial pathogens and antimicrobial resistance directly from clinical urines by nanopore-based metagenomic sequencing , 2017, The Journal of antimicrobial chemotherapy.

[150]  Johan Bengtsson-Palme,et al.  Antibiotic resistance in the food supply chain: where can sequencing and metagenomics aid risk assessment? , 2017 .

[151]  Erik Kristiansson,et al.  Variability in Metagenomic Count Data and Its Influence on the Identification of Differentially Abundant Genes , 2017, J. Comput. Biol..

[152]  F. Aarestrup,et al.  A sampling and metagenomic sequencing-based methodology for monitoring antimicrobial resistance in swine herds , 2017, The Journal of antimicrobial chemotherapy.

[153]  P. Pevzner,et al.  metaSPAdes: a new versatile metagenomic assembler. , 2017, Genome research.

[154]  Raymond Lo,et al.  CARD 2017: expansion and model-centric curation of the comprehensive antibiotic resistance database , 2016, Nucleic Acids Res..

[155]  Evan Bolton,et al.  Database resources of the National Center for Biotechnology Information , 2017, Nucleic Acids Res..