A genomic overview of the population structure of Salmonella

For many decades, Salmonella enterica has been subdivided by serological properties into serovars or further subdivided for epidemiological tracing by a variety of diagnostic tests with higher resolution. Recently, it has been proposed that so-called eBurst groups (eBGs) based on the alleles of seven housekeeping genes (legacy multilocus sequence typing [MLST]) corresponded to natural populations and could replace serotyping. However, this approach lacks the resolution needed for epidemiological tracing and the existence of natural populations had not been independently validated by independent criteria. Here, we describe EnteroBase, a web-based platform that assembles draft genomes from Illumina short reads in the public domain or that are uploaded by users. EnteroBase implements legacy MLST as well as ribosomal gene MLST (rMLST), core genome MLST (cgMLST), and whole genome MLST (wgMLST) and currently contains over 100,000 assembled genomes from Salmonella. It also provides graphical tools for visual interrogation of these genotypes and those based on core single nucleotide polymorphisms (SNPs). eBGs based on legacy MLST are largely consistent with eBGs based on rMLST, thus demonstrating that these correspond to natural populations. rMLST also facilitated the selection of representative genotypes for SNP analyses of the entire breadth of diversity within Salmonella. In contrast, cgMLST provides the resolution needed for epidemiological investigations. These observations show that genomic genotyping, with the assistance of EnteroBase, can be applied at all levels of diversity within the Salmonella genus.

[1]  Martin C. J. Maiden,et al.  BIGSdb: Scalable analysis of bacterial genome variation at the population level , 2010, BMC Bioinformatics.

[2]  Thibaut Jombart,et al.  Phylogenetic structure of European Salmonella Enteritidis outbreak correlates with national and international egg distribution network , 2016, Microbial genomics.

[3]  M. A. Suchard,et al.  Distinguishable Epidemics of Multidrug-Resistant Salmonella Typhimurium DT104 in Different Hosts , 2013, Science.

[4]  I. Van Walle,et al.  PulseNet International: Vision for the implementation of whole genome sequencing (WGS) for global food-borne disease surveillance , 2017, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[5]  Alexandre P. Francisco,et al.  GrapeTree: visualization of core genomic relationships among 100,000 bacterial pathogens , 2017, bioRxiv.

[6]  Mark Achtman,et al.  Transforming Microbial Genotyping: A Robotic Pipeline for Genotyping Bacterial Strains , 2012, PloS one.

[7]  P. Ashton,et al.  Salmonella enterica serovar Typhimurium ST313 responsible for gastroenteritis in the UK are genetically distinct from isolates causing bloodstream infections in Africa , 2017, bioRxiv.

[8]  Gemma C. Langridge,et al.  Distinct Salmonella Enteritidis lineages associated with enterocolitis in high-income 1 settings and invasive disease in low-income settings , 2016 .

[9]  Gemma C. Langridge,et al.  Millennia of genomic stability within the invasive Para C Lineage of Salmonella enterica , 2017, bioRxiv.

[10]  F. Weill,et al.  WHO Collaborating Centre for Reference and Research on Salmonella ANTIGENIC FORMULAE OF THE SALMONELLA SEROVARS , 2007 .

[11]  Zhemin Zhou,et al.  Multilocus Sequence Typing as a Replacement for Serotyping in Salmonella enterica , 2012, PLoS pathogens.

[12]  João André Carriço,et al.  Adjusted Wallace Coefficient as a Measure of Congruence between Typing Methods , 2011, Journal of Clinical Microbiology.

[13]  Ruth Timme,et al.  Practical Value of Food Pathogen Traceability through Building a Whole-Genome Sequencing Network and Database , 2016, Journal of Clinical Microbiology.

[14]  Jacqueline A. Keane,et al.  An extended genotyping framework for Salmonella enterica serovar Typhi, the cause of human typhoid , 2016, Nature Communications.

[15]  Alexandre P. Francisco,et al.  GrapeTree: Visualization of core genomic relationships among 100,000 bacterial pathogens , 2017 .

[16]  Andrew Frost,et al.  Whole genome sequencing reveals an outbreak of Salmonella Enteritidis associated with reptile feeder mice in the United Kingdom, 2012-2015. , 2017, Food microbiology.

[17]  J. Rothberg,et al.  Prospective Genomic Characterization of the German Enterohemorrhagic Escherichia coli O104:H4 Outbreak by Rapid Next Generation Sequencing Technology , 2011, PloS one.

[18]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[19]  J. Wain,et al.  Intra-continental spread of human invasive Salmonella Typhimurium pathovariants in sub-Saharan Africa , 2012, Nature Genetics.

[20]  Keith A. Jolley,et al.  Ribosomal multilocus sequence typing: universal characterization of bacteria from domain to strain , 2012, Microbiology.

[21]  Paul Turner,et al.  Phylogeographical analysis of the dominant multidrug-resistant H58 clade of Salmonella Typhi identifies inter- and intracontinental transmission events , 2015, Nature Genetics.

[22]  Mark Achtman,et al.  Salmonella typhi, the causative agent of typhoid fever, is approximately 50,000 years old. , 2002, Infection, genetics and evolution : journal of molecular epidemiology and evolutionary genetics in infectious diseases.

[23]  M. Achtman,et al.  Multilocus sequence typing: a portable approach to the identification of clones within populations of pathogenic microorganisms. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Claire Jenkins,et al.  Identification of Salmonella for public health surveillance using whole genome sequencing , 2016, PeerJ.

[25]  M. Achtman,et al.  Neutral Genomic Microevolution of a Recently Emerged Pathogen, Salmonella enterica Serovar Agona , 2013, PLoS genetics.

[26]  J. Bray,et al.  MLST revisited: the gene-by-gene approach to bacterial genomics , 2013, Nature Reviews Microbiology.

[27]  Eduardo N. Taboada,et al.  The Salmonella In Silico Typing Resource (SISTR): An Open Web-Accessible Tool for Rapidly Typing and Subtyping Draft Salmonella Genome Assemblies , 2016, PloS one.

[28]  S. Nair,et al.  Transient Darwinian selection in Salmonella enterica serovar Paratyphi A during 450 years of global spread of enteric fever , 2014, Proceedings of the National Academy of Sciences.

[29]  T Jombart,et al.  Prospective use of whole genome sequencing (WGS) detected a multi-country outbreak of Salmonella Enteritidis , 2016, Epidemiology and Infection.

[30]  Gemma C. Langridge,et al.  What’s in a Name? Species-Wide Whole-Genome Sequencing Resolves Invasive and Noninvasive Lineages of Salmonella enterica Serotype Paratyphi B , 2016, mBio.

[31]  Eduardo P C Rocha,et al.  Whole genome-based population biology and epidemiological surveillance of Listeria monocytogenes , 2016, Nature Microbiology.

[32]  Gemma C. Langridge,et al.  Patterns of genome evolution that have accompanied host adaptation in Salmonella , 2014, Proceedings of the National Academy of Sciences.

[33]  Camille Roth,et al.  Natural Scales in Geographical Patterns , 2017, Scientific Reports.

[34]  M. Achtman,et al.  Distinct Genealogies for Plasmids and Chromosome , 2014, PLoS genetics.

[35]  C. Warinner,et al.  Salmonella enterica genomes from victims of a major sixteenth-century epidemic in Mexico , 2018, Nature Ecology & Evolution.