The Validation and Implications of Using Whole Genome Sequencing as a Replacement for Traditional Serotyping for a National Salmonella Reference Laboratory

Salmonella serotyping remains the gold-standard tool for the classification of Salmonella isolates and forms the basis of Canada’s national surveillance program for this priority foodborne pathogen. Public health officials have been increasingly looking toward whole genome sequencing (WGS) to provide a large set of data from which all the relevant information about an isolate can be mined. However, rigorous validation and careful consideration of potential implications in the replacement of traditional surveillance methodologies with WGS data analysis tools is needed. Two in silico tools for Salmonella serotyping have been developed, the Salmonella in silico Typing Resource (SISTR) and SeqSero, while seven gene MLST for serovar prediction can be adapted for in silico analysis. All three analysis methods were assessed and compared to traditional serotyping techniques using a set of 813 verified clinical and laboratory isolates, including 492 Canadian clinical isolates and 321 isolates of human and non-human sources. Successful results were obtained for 94.8, 88.2, and 88.3% of the isolates tested using SISTR, SeqSero, and MLST, respectively, indicating all would be suitable for maintaining historical records, surveillance systems, and communication structures currently in place and the choice of the platform used will ultimately depend on the users need. Results also pointed to the need to reframe serotyping in the genomic era as a test to understand the genes that are carried by an isolate, one which is not necessarily congruent with what is antigenically expressed. The adoption of WGS for serotyping will provide the simultaneous collection of information that can be used by multiple programs within the current surveillance paradigm; however, this does not negate the importance of the various programs or the role of serotyping going forward.

[1]  E. Lingohr,et al.  Multi-laboratory evaluation of the rapid genoserotyping array (SGSA) for the identification of Salmonella serovars. , 2014, Diagnostic microbiology and infectious disease.

[2]  E. J. Parmley,et al.  A Canadian application of one health: integration of Salmonella data from various Canadian surveillance programs (2005-2010). , 2013, Foodborne pathogens and disease.

[3]  Andrea Nesbitt,et al.  Estimates of the burden of foodborne illness in Canada for 30 specified pathogens and unspecified agents, circa 2006. , 2013, Foodborne pathogens and disease.

[4]  Pierre Wattiau,et al.  Methodologies for Salmonella enterica subsp. enterica Subtyping: Gold Standards and Alternatives , 2011, Applied and Environmental Microbiology.

[5]  John H. E. Nash,et al.  Evaluation of Molecular Methods for Identification of Salmonella Serovars , 2016, Journal of Clinical Microbiology.

[6]  F. Aarestrup,et al.  WHO Global Salm-Surv External Quality Assurance System for Serotyping of Salmonella Isolates from 2000 to 2007 , 2009, Journal of Clinical Microbiology.

[7]  Henrik C Wegener,et al.  Global monitoring of Salmonella serovar distribution from the World Health Organization Global Foodborne Infections Network Country Data Bank: results of quality assured laboratories from 2001 to 2007. , 2011, Foodborne pathogens and disease.

[8]  Claire Jenkins,et al.  Identification of Salmonella for public health surveillance using whole genome sequencing , 2016, PeerJ.

[9]  B Rowe,et al.  A mechanised microtechnique for salmonella serotyping. , 1980, Journal of clinical pathology.

[10]  L. Hutwagner,et al.  Using laboratory-based surveillance data for prevention: an algorithm for detecting Salmonella outbreaks. , 1997, Emerging infectious diseases.

[11]  Seonghan Kim,et al.  A Multiplex PCR based Method for the Identification of common Clinical Serotypes of Salmonella enterica subspecies enterica , 2006 .

[12]  Zhemin Zhou,et al.  Multilocus Sequence Typing as a Replacement for Serotyping in Salmonella enterica , 2012, PLoS pathogens.

[13]  J. Ronholm,et al.  Navigating Microbiological Food Safety in the Era of Whole-Genome Sequencing , 2016, Clinical Microbiology Reviews.

[14]  Steven Salzberg,et al.  BIOINFORMATICS ORIGINAL PAPER , 2004 .

[15]  Martin Wiedmann,et al.  Omics approaches in food safety: fulfilling the promise? , 2014, Trends in microbiology.

[16]  Yanlong Yin,et al.  Salmonella Serotype Determination Utilizing High-Throughput Genome Sequencing Data , 2015, Journal of Clinical Microbiology.

[17]  Eduardo N. Taboada,et al.  The Salmonella In Silico Typing Resource (SISTR): An Open Web-Accessible Tool for Rapidly Typing and Subtyping Draft Salmonella Genome Assemblies , 2016, PloS one.

[18]  Qing Liu,et al.  Effect of Deletion of Genes Involved in Lipopolysaccharide Core and O-Antigen Synthesis on Virulence and Immunogenicity of Salmonella enterica Serovar Typhimurium , 2011, Infection and Immunity.

[19]  E. Tietze,et al.  Population Structure of Salmonella enterica Serovar 4,[5],12:b:− Strains and Likely Sources of Human Infection , 2013, Applied and Environmental Microbiology.

[20]  V. Jasson,et al.  Extensive Genetic Variability Linked to IS26 Insertions in the fljB Promoter Region of Atypical Monophasic Variants of Salmonella enterica Serovar Typhimurium , 2015, Applied and Environmental Microbiology.

[21]  A. Fazil,et al.  Estimates of Foodborne Illness–Related Hospitalizations and Deaths in Canada for 30 Specified Pathogens and Unspecified Agents , 2015, Foodborne pathogens and disease.

[22]  Brendan R. Jackson,et al.  Outbreak-associated Salmonella enterica Serotypes and Food Commodities, United States, 1998–2008 , 2013, Emerging infectious diseases.

[23]  R. Parreñas,et al.  Sequencing and Comparative Analysis of Flagellin Genes fliC, fljB, and flpA from Salmonella , 2004, Journal of Clinical Microbiology.

[24]  Ning Ma,et al.  BLAST+: architecture and applications , 2009, BMC Bioinformatics.

[25]  P. Ashton,et al.  Identification and typing of Salmonella for public health surveillance using whole genome sequencing , 2015 .

[26]  W. Ewing,et al.  Edwards and Ewing's Identification of Enterobacteriaceae , 1986 .

[27]  B. Swaminathan,et al.  Building PulseNet International: an interconnected system of laboratory networks to facilitate timely public health recognition and response to foodborne disease outbreaks and emerging foodborne diseases. , 2006, Foodborne pathogens and disease.

[28]  Andrew Mackinnon,et al.  A spreadsheet for the calculation of comprehensive statistics for the assessment of diagnostic tests and inter-rater agreement , 2000, Comput. Biol. Medicine.

[29]  Sergey I. Nikolenko,et al.  SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing , 2012, J. Comput. Biol..

[30]  Seonghan Kim,et al.  Multiplex PCR-Based Method for Identification of Common Clinical Serotypes of Salmonella enterica subsp. enterica , 2006, Journal of Clinical Microbiology.

[31]  M. Achtman,et al.  Multilocus sequence typing: a portable approach to the identification of clones within populations of pathogenic microorganisms. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[32]  B. Liu,et al.  Genetics and Evolution of the Salmonella Galactose-Initiated Set of O Antigens , 2013, PloS one.

[33]  G. Grassl,et al.  Same species, different diseases: how and why typhoidal and non-typhoidal Salmonella enterica serovars differ , 2014, Front. Microbiol..

[34]  R. C. Jiménez,et al.  Mechanisms of O-Antigen Structural Variation of Bacterial Lipopolysaccharide (LPS) , 2012 .

[35]  G. Domselaar,et al.  Public Health Genomics and the New Molecular Epidemiology of Bacterial Pathogens , 2013, Public Health Genomics.

[36]  F. Weill,et al.  WHO Collaborating Centre for Reference and Research on Salmonella ANTIGENIC FORMULAE OF THE SALMONELLA SEROVARS , 2007 .