Rapid and Easy In Silico Serotyping of Escherichia coli Isolates by Use of Whole-Genome Sequencing Data

ABSTRACT Accurate and rapid typing of pathogens is essential for effective surveillance and outbreak detection. Conventional serotyping of Escherichia coli is a delicate, laborious, time-consuming, and expensive procedure. With whole-genome sequencing (WGS) becoming cheaper, it has vast potential in routine typing and surveillance. The aim of this study was to establish a valid and publicly available tool for WGS-based in silico serotyping of E. coli applicable for routine typing and surveillance. A FASTA database of specific O-antigen processing system genes for O typing and flagellin genes for H typing was created as a component of the publicly available Web tools hosted by the Center for Genomic Epidemiology (CGE) (www.genomicepidemiology.org). All E. coli isolates available with WGS data and conventional serotype information were subjected to WGS-based serotyping employing this specific SerotypeFinder CGE tool. SerotypeFinder was evaluated on 682 E. coli genomes, 108 of which were sequenced for this study, where both the whole genome and the serotype were available. In total, 601 and 509 isolates were included for O and H typing, respectively. The O-antigen genes wzx, wzy, wzm, and wzt and the flagellin genes fliC, flkA, fllA, flmA, and flnA were detected in 569 and 508 genome sequences, respectively. SerotypeFinder for WGS-based O and H typing predicted 560 of 569 O types and 504 of 508 H types, consistent with conventional serotyping. In combination with other available WGS typing tools, E. coli serotyping can be performed solely from WGS data, providing faster and cheaper typing than current routine procedures and making WGS typing a superior alternative to conventional typing strategies.

[1]  Tetsuya Hayashi,et al.  A complete view of the genetic diversity of the Escherichia coli O-antigen biosynthesis gene cluster , 2014, DNA research : an international journal for rapid publication of reports on genes and genomes.

[2]  Shankar S. Changayil,et al.  Genome Sequences of 228 Shiga Toxin-Producing Escherichia coli Isolates and 12 Isolates Representing Other Diarrheagenic E. coli Pathotypes , 2014, Genome Announcements.

[3]  Matthias Reumann,et al.  WGS Analysis and Interpretation in Clinical and Public Health Microbiology Laboratories: What Are the Requirements and How Do Existing Tools Compare? , 2014, Pathogens.

[4]  Ole Lund,et al.  Real-Time Whole-Genome Sequencing for Routine Typing, Surveillance, and Outbreak Detection of Verotoxigenic Escherichia coli , 2014, Journal of Clinical Microbiology.

[5]  David A Rasko,et al.  Refining the pathovar paradigm via phylogenomics of the attaching and effacing Escherichia coli , 2013, Proceedings of the National Academy of Sciences.

[6]  S. Rasmussen,et al.  Identification of acquired antimicrobial resistance genes , 2012, The Journal of antimicrobial chemotherapy.

[7]  Stefano Morabito,et al.  Multicenter Evaluation of a Sequence-Based Protocol for Subtyping Shiga Toxins and Standardizing Stx Nomenclature , 2012, Journal of Clinical Microbiology.

[8]  Ole Lund,et al.  Multilocus Sequence Typing of Total-Genome-Sequenced Bacteria , 2012, Journal of Clinical Microbiology.

[9]  Marcel Martin Cutadapt removes adapter sequences from high-throughput sequencing reads , 2011 .

[10]  P. Reeves,et al.  The variation of O antigens in gram-negative bacteria. , 2010, Sub-cellular biochemistry.

[11]  E. Birney,et al.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs. , 2008, Genome research.

[12]  B. Liu,et al.  A Genomic Islet Mediates Flagellar Phase Variation in Escherichia coli Strains Carrying the Flagellin-Specifying Locus flk , 2008, Journal of bacteriology.

[13]  C. Médigue,et al.  A New O-Antigen Gene Cluster Has a Key Role in the Virulence of the Escherichia coli Meningitis Clone O45:K1:H7 , 2007, Journal of bacteriology.

[14]  L. Beutin,et al.  Genetical and functional investigation of fliC genes encoding flagellar serotype H4 in wildtype strains of Escherichia coli and in a laboratory E. coli K-12 strain expressing flagellar antigen type H48 , 2005, BMC Microbiology.

[15]  F. Scheutz,et al.  Designation of O174 and O175 to temporary O groups OX3 and OX7, and six new E. coli O groups that include Verocytotoxin‐producing E. coli (VTEC): O176, O177, O178, O179, O180 and O181 , 2004, APMIS : acta pathologica, microbiologica, et immunologica Scandinavica.

[16]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[17]  Harry L. T. Mobley,et al.  Pathogenic Escherichia coli , 2004, Nature Reviews Microbiology.

[18]  A. Tominaga Characterization of six flagellin genes in the H3, H53 and H54 standard strains of Escherichia coli. , 2004, Genes & Genetic Systems.

[19]  P. Reeves,et al.  Species-Wide Variation in the Escherichia coli Flagellin (H-Antigen) Gene , 2003, Journal of bacteriology.

[20]  T. Whittam,et al.  Enteropathogenic Escherichia coli O157 Strains from Brazil , 2003, Emerging infectious diseases.

[21]  A. Fruth,et al.  Subtyping of pathogenic Escherichia coli strains using flagellar (H)-antigens: serotyping versus fliC polymorphisms. , 2003, International journal of medical microbiology : IJMM.

[22]  F. Grimont,et al.  Identification of Escherichia coli flagellar types by restriction of the amplified fliC gene. , 2000, Research in microbiology.

[23]  I S Roberts,et al.  Structure, assembly and regulation of expression of capsules in Escherichia coli , 1999, Molecular microbiology.

[24]  Y. Ratiner New Flagellin-Specifying Genes in SomeEscherichia coli Strains , 1998, Journal of bacteriology.

[25]  B. Swaminathan,et al.  Molecular characterization of the gene encoding H antigen in Escherichia coli and development of a PCR-restriction fragment length polymorphism test for identification of E. coli O157:H7 and O157:NM , 1997, Journal of clinical microbiology.

[26]  T. Joys,et al.  The flagellar filament protein. , 1988, Canadian journal of microbiology.

[27]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[28]  F. Neidhardt,et al.  Escherichia Coli and Salmonella: Typhimurium Cellular and Molecular Biology , 1987 .

[29]  I. Ørskov,et al.  2 Serotyping of Escherichia coli , 1984 .

[30]  Y. Ratiner Presence of two structural genes determining antigenically different phase-specific flagellins in some Escherichia coli strains , 1983 .

[31]  I. Orskov,et al.  Serology, chemistry, and genetics of O and K antigens of Escherichia coli. , 1977, Bacteriological reviews.

[32]  I. Orskov,et al.  Two new Escherichia coli o antigens, o162 and o163, and one new h antigen, h56. withdrawal of h antigen h50. , 2009, Acta pathologica et microbiologica Scandinavica. Section B, Microbiology.

[33]  F. Kauffmann The serology of the coli group. , 1947, Journal of immunology.