Whole genome sequencing options for bacterial strain typing and epidemiologic analysis based on single nucleotide polymorphism versus gene-by-gene-based approaches.

BACKGROUND Whole genome sequence (WGS)-based strain typing finds increasing use in the epidemiologic analysis of bacterial pathogens in both public health as well as more localized infection control settings. AIMS This minireview describes methodologic approaches that have been explored for WGS-based epidemiologic analysis and considers the challenges and pitfalls of data interpretation. SOURCES Personal collection of relevant publications. CONTENT When applying WGS to study the molecular epidemiology of bacterial pathogens, genomic variability between strains is translated into measures of distance by determining single nucleotide polymorphisms in core genome alignments or by indexing allelic variation in hundreds to thousands of core genes, assigning types to unique allelic profiles. Interpreting isolate relatedness from these distances is highly organism specific, and attempts to establish species-specific cutoffs are unlikely to be generally applicable. In cases where single nucleotide polymorphism or core gene typing do not provide the resolution necessary for accurate assessment of the epidemiology of bacterial pathogens, inclusion of accessory gene or plasmid sequences may provide the additional required discrimination. IMPLICATIONS As with all epidemiologic analysis, realizing the full potential of the revolutionary advances in WGS-based approaches requires understanding and dealing with issues related to the fundamental steps of data generation and interpretation.

[1]  I. Van Walle,et al.  PulseNet International: Vision for the implementation of whole genome sequencing (WGS) for global food-borne disease surveillance , 2017, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[2]  Julian Parkhill,et al.  Evaluation of an Optimal Epidemiological Typing Scheme for Legionella pneumophila with Whole-Genome Sequence Data Using Validation Guidelines , 2016, Journal of Clinical Microbiology.

[3]  Brian D. Ondov,et al.  The Harvest suite for rapid core-genome alignment and visualization of thousands of intraspecific microbial genomes , 2014, Genome Biology.

[4]  Jacqueline A. Keane,et al.  Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins , 2014, Nucleic acids research.

[5]  M. Forsman,et al.  Phylogeographical pattern of Francisella tularensis in a nationwide outbreak of tularaemia in Norway, 2011. , 2015, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[6]  D. Falush,et al.  Progressive genomic convergence of two Helicobacter pylori strains during mixed infection of a patient with chronic gastritis , 2014, Gut.

[7]  Ö. Melefors,et al.  Plasmidome-Analysis of ESBL-Producing Escherichia coli Using Conventional Typing and High-Throughput Sequencing , 2013, PloS one.

[8]  Steen Ethelberg,et al.  Whole-genome Sequencing Used to Investigate a Nationwide Outbreak of Listeriosis Caused by Ready-to-eat Delicatessen Meat, Denmark, 2014. , 2016, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[9]  D H Persing,et al.  Interpreting chromosomal DNA restriction patterns produced by pulsed-field gel electrophoresis: criteria for bacterial strain typing , 1995, Journal of clinical microbiology.

[10]  Sebastian Höppner,et al.  Rapid High Resolution Genotyping of Francisella tularensis by Whole Genome Sequence Comparison of Annotated Genes (“MLST+”) , 2015, PloS one.

[11]  C. Deming,et al.  Plasmid Dynamics in KPC-Positive Klebsiella pneumoniae during Long-Term Patient Colonization , 2016, mBio.

[12]  Chen Liu,et al.  Defining and Evaluating a Core Genome Multilocus Sequence Typing Scheme for Whole-Genome Sequence-Based Typing of Klebsiella pneumoniae , 2017, Front. Microbiol..

[13]  K. Holt,et al.  Klebsiella pneumoniae Population Genomics and Antimicrobial-Resistant Clones. , 2016, Trends in microbiology.

[14]  Julian Parkhill,et al.  Inferring patient to patient transmission of Mycobacterium tuberculosis from whole genome sequencing data , 2013, BMC Infectious Diseases.

[15]  D. Leach,et al.  Bacterial Genome Instability , 2014, Microbiology and Molecular Reviews.

[16]  Travis M. Drucker,et al.  Comparison of Whole-Genome Sequencing Methods for Analysis of Three Methicillin-Resistant Staphylococcus aureus Outbreaks , 2017, Journal of Clinical Microbiology.

[17]  Jianming Liang,et al.  Detection of Multiple Parallel Transmission Outbreak of Streptococcus suis Human Infection by Use of Genome Epidemiology, China, 2005 , 2017, Emerging infectious diseases.

[18]  S. Brisse,et al.  Genomic Definition of Hypervirulent and Multidrug-Resistant Klebsiella pneumoniae Clonal Groups , 2014, Emerging infectious diseases.

[19]  Daniel J. Wilson,et al.  Whole-genome sequencing to determine transmission of Neisseria gonorrhoeae: an observational study. , 2016, The Lancet. Infectious diseases.

[20]  C. Beard,et al.  Whole genome multilocus sequence typing as an epidemiologic tool for Yersinia pestis. , 2016, Diagnostic microbiology and infectious disease.

[21]  Jaideep P. Sundaram,et al.  Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: implications for the microbial "pan-genome". , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[22]  M. Pirmohamed,et al.  Genome-Based Infection Tracking Reveals Dynamics of Clostridium difficile Transmission and Disease Recurrence , 2015, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[23]  Eduardo N. Taboada,et al.  Whole-Genome Sequencing in Epidemiology of Campylobacter jejuni Infections , 2017, Journal of Clinical Microbiology.

[24]  S. Bentley,et al.  Developing insights into the mechanisms of evolution of bacterial pathogens from whole-genome sequences. , 2012, Future microbiology.

[25]  J. Martinez-Urtaza,et al.  Defining a Core Genome Multilocus Sequence Typing Scheme for the Global Epidemiology of Vibrio parahaemolyticus , 2017, Journal of Clinical Microbiology.

[26]  Julian Parkhill,et al.  Whole Genome Sequence Analysis of a Large Isoniazid-Resistant Tuberculosis Outbreak in London: A Retrospective Observational Study , 2016, PLoS medicine.

[27]  A. Mellmann,et al.  Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infection Control in an Institutional Setting , 2016, Journal of Clinical Microbiology.

[28]  Julian Parkhill,et al.  Comparison of bacterial genome assembly software for MinION data and their applicability to medical microbiology , 2016, bioRxiv.

[29]  Anna E. Sheppard,et al.  Plasmid Classification in an Era of Whole-Genome Sequencing: Application in Studies of Antibiotic Resistance Epidemiology , 2017, Front. Microbiol..

[30]  Andrew J. Page,et al.  Roary: rapid large-scale prokaryote pan genome analysis , 2015, bioRxiv.

[31]  Maliha Aziz,et al.  NASP: an accurate, rapid method for the identification of SNPs in WGS datasets that supports flexible input and output formats. , 2016, Microbial genomics.

[32]  Gary Van Domselaar,et al.  A Comparative Analysis of the Lyve-SET Phylogenomics Pipeline for Genomic Epidemiology of Foodborne Pathogens , 2017, Front. Microbiol..

[33]  M. Achtman,et al.  Multilocus sequence typing: a portable approach to the identification of clones within populations of pathogenic microorganisms. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[34]  C. Constantinidou,et al.  Genomic epidemiology of a protracted hospital outbreak caused by multidrug-resistant Acinetobacter baumannii in Birmingham, England , 2014, Genome Medicine.

[35]  Dag Harmsen,et al.  Development and evaluation of a core genome multilocus typing scheme for whole-genome sequence-based typing of Acinetobacter baumannii , 2017, PloS one.

[36]  Alan McNally,et al.  Why prokaryotes have pangenomes , 2017, Nature Microbiology.

[37]  Jonas Korlach,et al.  Single-molecule sequencing to track plasmid diversity of hospital-associated carbapenemase-producing Enterobacteriaceae , 2014, Science Translational Medicine.

[38]  J. Corander,et al.  Combined Analysis of Variation in Core, Accessory and Regulatory Genome Regions Provides a Super-Resolution View into the Evolution of Bacterial Populations , 2016, PLoS genetics.

[39]  D. Cirillo,et al.  Molecular typing of Mycobacterium Abscessus isolated from cystic fibrosis patients , 2017, International journal of mycobacteriology.

[40]  Lori A. S. Snyder,et al.  Epidemiological investigation of Pseudomonas aeruginosa isolates from a six-year-long hospital outbreak using high-throughput whole genome sequencing. , 2013, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[41]  D. Falush,et al.  Helicobacter pylori genome evolution during human infection , 2011, Proceedings of the National Academy of Sciences.

[42]  John P. Dekker,et al.  Commentary: Next-Generation Epidemiology: Using Real-Time Core Genome Multilocus Sequence Typing To Support Infection Control Policy , 2016, Journal of Clinical Microbiology.

[43]  H. Hasman,et al.  WGS-based surveillance of third-generation cephalosporin-resistant Escherichia coli from bloodstream infections in Denmark , 2017, The Journal of antimicrobial chemotherapy.

[44]  Bradd J. Haley,et al.  Genomic and Evolutionary Analysis of Two Salmonella enterica Serovar Kentucky Sequence Types Isolated from Bovine and Poultry Sources in North America , 2016, PloS one.

[45]  Jianzhong Shen,et al.  Emergence of plasmid-mediated colistin resistance mechanism MCR-1 in animals and human beings in China: a microbiological and molecular biological study. , 2015, The Lancet. Infectious diseases.

[46]  Dag Harmsen,et al.  Defining and Evaluating a Core Genome Multilocus Sequence Typing Scheme for Whole-Genome Sequence-Based Typing of Listeria monocytogenes , 2015, Journal of Clinical Microbiology.

[47]  D. Raoult,et al.  The bacterial pangenome as a new tool for analysing pathogenic bacteria , 2015, New microbes and new infections.

[48]  Carl T. Bergstrom,et al.  Bacteria are different: observations, interpretations, speculations, and opinions about the mechanisms of adaptive evolution in prokaryotes. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[49]  P. Worning,et al.  Monitoring meticillin resistant Staphylococcus aureus and its spread in Copenhagen, Denmark, 2013, through routine whole genome sequencing. , 2015, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[50]  V. Sintchenko,et al.  Whole genome sequencing of Salmonella Typhimurium illuminates distinct outbreaks caused by an endemic multi-locus variable number tandem repeat analysis type in Australia, 2014 , 2016, BMC Microbiology.

[51]  Yun-Xin Fu,et al.  Evidence for Recombination in Mycobacterium tuberculosis , 2006, Journal of bacteriology.

[52]  Robert G. Beiko,et al.  SNVPhyl: a single nucleotide variant phylogenomics pipeline for microbial genomic epidemiology , 2016, bioRxiv.

[53]  G. Domselaar,et al.  Usefulness of High-Quality Core Genome Single-Nucleotide Variant Analysis for Subtyping the Highly Clonal and the Most Prevalent Salmonella enterica Serovar Heidelberg Clone in the Context of Outbreak Investigations , 2015, Journal of Clinical Microbiology.

[54]  Evan S Snitkin,et al.  Tracking a Hospital Outbreak of Carbapenem-Resistant Klebsiella pneumoniae with Whole-Genome Sequencing , 2012, Science Translational Medicine.

[55]  G. Smith,et al.  Rapid bacterial whole-genome sequencing to enhance diagnostic and public health microbiology. , 2013, JAMA internal medicine.

[56]  Ruth E. Timme,et al.  Benchmark datasets for phylogenomic pipeline validation, applications for foodborne pathogen surveillance , 2017, PeerJ.

[57]  Anna E. Sheppard,et al.  Nested Russian Doll-Like Genetic Mobility Drives Rapid Dissemination of the Carbapenem Resistance Gene blaKPC , 2015, Antimicrobial Agents and Chemotherapy.

[58]  E. Holmes,et al.  Recombination within natural populations of pathogenic bacteria: short-term empirical estimates and long-term phylogenetic consequences. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[59]  F. de la Cruz,et al.  Plasmid Diversity and Adaptation Analyzed by Massive Sequencing of Escherichia coli Plasmids , 2014, Microbiology spectrum.

[60]  R. Köck,et al.  From theory to practice: molecular strain typing for the clinical and public health setting. , 2013, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[61]  J. Corander,et al.  Detection of recombination events in bacterial genomes from large population samples , 2011, Nucleic acids research.

[62]  B. Duim,et al.  Whole genome sequence analysis indicates recent diversification of mammal-associated Campylobacter fetus and implicates a genetic factor associated with H2S production , 2016, BMC Genomics.

[63]  Torsten Seemann,et al.  Prospective Whole-Genome Sequencing Enhances National Surveillance of Listeria monocytogenes , 2015, Journal of Clinical Microbiology.

[64]  K. Holt,et al.  Genetic diversity, mobilisation and spread of the yersiniabactin-encoding mobile element ICEKp in Klebsiella pneumoniae populations , 2018 .

[65]  Stefan Niemann,et al.  Whole-Genome-Based Mycobacterium tuberculosis Surveillance: a Standardized, Portable, and Expandable Approach , 2014, Journal of Clinical Microbiology.

[66]  Dag Harmsen,et al.  Bacterial Whole-Genome Sequencing Revisited: Portable, Scalable, and Standardized Analysis for Typing and Detection of Virulence and Antibiotic Resistance Genes , 2014, Journal of Clinical Microbiology.

[67]  J A Carriço,et al.  A primer on microbial bioinformatics for nonbioinformaticians. , 2018, Clinical microbiology and infection : the official publication of the European Society of Clinical Microbiology and Infectious Diseases.

[68]  Malbert R. C. Rogers,et al.  Core Genome Multilocus Sequence Typing Scheme for High-Resolution Typing of Enterococcus faecium , 2015, Journal of Clinical Microbiology.

[69]  Ruth Timme,et al.  Practical Value of Food Pathogen Traceability through Building a Whole-Genome Sequencing Network and Database , 2016, Journal of Clinical Microbiology.

[70]  Keith A. Jolley,et al.  Real-Time Genomic Epidemiological Evaluation of Human Campylobacter Isolates by Use of Whole-Genome Multilocus Sequence Typing , 2013, Journal of Clinical Microbiology.

[71]  H. Wahlström,et al.  Comparison of whole genome sequencing typing results and epidemiological contact information from outbreaks of Salmonella Dublin in Swedish cattle herds , 2016, Infection ecology & epidemiology.