Defining the Phylogenomics of Shigella Species: a Pathway to Diagnostics

ABSTRACT Shigellae cause significant diarrheal disease and mortality in humans, as there are approximately 163 million episodes of shigellosis and 1.1 million deaths annually. While significant strides have been made in the understanding of the pathogenesis, few studies on the genomic content of the Shigella species have been completed. The goal of this study was to characterize the genomic diversity of Shigella species through sequencing of 55 isolates representing members of each of the four Shigella species: S. flexneri, S. sonnei, S. boydii, and S. dysenteriae. Phylogeny inferred from 336 available Shigella and Escherichia coli genomes defined exclusive clades of Shigella; conserved genomic markers that can identify each clade were then identified. PCR assays were developed for each clade-specific marker, which was combined with an amplicon for the conserved Shigella invasion antigen, IpaH3, into a multiplex PCR assay. This assay demonstrated high specificity, correctly identifying 218 of 221 presumptive Shigella isolates, and sensitivity, by not identifying any of 151 diverse E. coli isolates incorrectly as Shigella. This new phylogenomics-based PCR assay represents a valuable tool for rapid typing of uncharacterized Shigella isolates and provides a framework that can be utilized for the identification of novel genomic markers from genomic data.

[1]  J. Gregory Caporaso,et al.  The large-scale blast score ratio (LS-BSR) pipeline: a method to rapidly compare genetic content between bacterial genomes , 2014, PeerJ.

[2]  Inacio Mandomando,et al.  Burden and aetiology of diarrhoeal disease in infants and young children in developing countries (the Global Enteric Multicenter Study, GEMS): a prospective, case-control study , 2013, The Lancet.

[3]  David A Rasko,et al.  Refining the pathovar paradigm via phylogenomics of the attaching and effacing Escherichia coli , 2013, Proceedings of the National Academy of Sciences.

[4]  T. Farag,et al.  Housefly Population Density Correlates with Shigellosis among Children in Mirzapur, Bangladesh: A Time Series Analysis , 2013, PLoS neglected tropical diseases.

[5]  M. Gelfand,et al.  Evolution of Pan-Genomes of Escherichia coli, Shigella spp., and Salmonella enterica , 2013, Journal of bacteriology.

[6]  M. Pop,et al.  Quantitative PCR for Detection of Shigella Improves Ascertainment of Shigella Burden in Children with Moderate-to-Severe Diarrhea in Low-Income Countries , 2013, Journal of Clinical Microbiology.

[7]  Dani Cohen,et al.  The Global Enteric Multicenter Study (GEMS) of Diarrheal Disease in Infants and Young Children in Developing Countries: Epidemiologic and Clinical Methods of the Case/Control Study , 2012, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[8]  J. Tate,et al.  Risk Factors for Death among Children Less than 5 Years Old Hospitalized with Diarrhea in Rural Western Kenya, 2005–2007: A Cohort Study , 2012, PLoS medicine.

[9]  D. Rasko,et al.  Phylomark, a Tool To Identify Conserved Phylogenetic Markers from Whole-Genome Alignments , 2012, Applied and Environmental Microbiology.

[10]  D. Rasko,et al.  Draft Genome Sequences of the Diarrheagenic Escherichia coli Collection , 2012, Journal of bacteriology.

[11]  R. Welch,et al.  Atypical Shigella boydii 13 encodes virulence factors seen in attaching and effacing Escherichia coli. , 2012, FEMS microbiology letters.

[12]  Sung-Hou Kim,et al.  Whole-genome phylogeny of Escherichia coli/Shigella group by feature frequency profiles (FFPs) , 2011, Proceedings of the National Academy of Sciences.

[13]  Steven Salzberg,et al.  Mugsy: fast multiple alignment of closely related whole genomes , 2010, Bioinform..

[14]  Jason W. Sahl,et al.  A Comparative Genomic Analysis of Diverse Clonal Types of Enterotoxigenic Escherichia coli Reveals Pathovar-Specific Conservation , 2010, Infection and Immunity.

[15]  Robert C. Edgar,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2001 .

[16]  Matthew Berriman,et al.  Iterative Correction of Reference Nucleotides (iCORN) using second generation sequencing technology , 2010, Bioinform..

[17]  M. Berriman,et al.  Improving draft assemblies by iterative mapping and assembly of short reads to eliminate gaps , 2010, Genome Biology.

[18]  Paramvir S. Dehal,et al.  FastTree 2 – Approximately Maximum-Likelihood Trees for Large Alignments , 2010, PloS one.

[19]  Miriam L. Land,et al.  Trace: Tennessee Research and Creative Exchange Prodigal: Prokaryotic Gene Recognition and Translation Initiation Site Identification Recommended Citation Prodigal: Prokaryotic Gene Recognition and Translation Initiation Site Identification , 2022 .

[20]  Masahira Hattori,et al.  Comparative genomics reveal the mechanism of the parallel evolution of O157 and non-O157 enterohemorrhagic Escherichia coli , 2009, Proceedings of the National Academy of Sciences.

[21]  I. Filliol,et al.  A new multiplex PCR for differential identification of Shigella flexneri and Shigella sonnei and detection of Shigella virulence determinants , 2009, Epidemiology and Infection.

[22]  Thomas M. Keane,et al.  ABACAS: algorithm-based automatic contiguation of assembled sequences , 2009, Bioinform..

[23]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[24]  Bartek Wilczynski,et al.  Biopython: freely available Python tools for computational molecular biology and bioinformatics , 2009, Bioinform..

[25]  A. Danchin,et al.  Organised Genome Dynamics in the Escherichia coli Species Results in Highly Diverse Adaptive Paths , 2009, PLoS genetics.

[26]  P. Gajer,et al.  The Pangenome Structure of Escherichia coli: Comparative Genomic Analysis of E. coli Commensal and Pathogenic Isolates , 2008, Journal of bacteriology.

[27]  P. Reeves,et al.  Structure and genetics of Shigella O antigens. , 2008, FEMS microbiology reviews.

[28]  E. Birney,et al.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs. , 2008, Genome research.

[29]  V. Ramisse,et al.  Selection and Validation of a Multilocus Variable-Number Tandem-Repeat Analysis Panel for Typing Shigella spp , 2008, Journal of Clinical Microbiology.

[30]  Ruth Hershberg,et al.  Reduced selection leads to accelerated gene loss in Shigella , 2007, Genome Biology.

[31]  Daniel Falush,et al.  Sex and virulence in Escherichia coli: an evolutionary perspective , 2006, Molecular microbiology.

[32]  D. Bryant,et al.  A Simple and Robust Statistical Test for Detecting the Presence of Recombination , 2006, Genetics.

[33]  Jun Yu,et al.  Revisiting the Molecular Evolutionary History of Shigella spp. , 2006, Journal of Molecular Evolution.

[34]  E. Murphy,et al.  Iron and Pathogenesis of Shigella: Iron Acquisition in the Intracellular Environment , 2006, Biometals.

[35]  Rosanna Lagos,et al.  Surveillance for antimicrobial resistance profiles among Shigella species isolated from a semirural community in the northern administrative area of santiago, chile. , 2005, The American journal of tropical medicine and hygiene.

[36]  Jacques Ravel,et al.  Visualization of comparative genomic analyses by BLAST score ratio , 2005, BMC Bioinformatics.

[37]  Adam M. Phillippy,et al.  Comparative genome assembly , 2004, Briefings Bioinform..

[38]  Robert C. Edgar,et al.  MUSCLE: a multiple sequence alignment method with reduced time and space complexity , 2004, BMC Bioinformatics.

[39]  J. Clemens,et al.  Detection of Shigella by a PCR Assay Targeting the ipaH Gene Suggests Increased Prevalence of Shigellosis in Nha Trang, Vietnam , 2004, Journal of Clinical Microbiology.

[40]  Ruiting Lan,et al.  Escherichia coli in disguise: molecular origins of Shigella. , 2002, Microbes and infection.

[41]  N. Moran,et al.  Microbial Minimalism Genome Reduction in Bacterial Pathogens , 2002, Cell.

[42]  A. Maurelli,et al.  Pathoadaptive Mutations That Enhance Virulence: Genetic Organization of the cadA Regions ofShigella spp , 2001, Infection and Immunity.

[43]  G. Pupo,et al.  Multiple independent origins of Shigella clones of Escherichia coli and convergent evolution of many of their characteristics. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[44]  Eugene W. Myers,et al.  A whole-genome assembly of Drosophila. , 2000, Science.

[45]  J. Johnson Shigella and Escherichia coli at the crossroads: machiavellian masqueraders or taxonomic treachery? , 2000, Journal of medical microbiology.

[46]  S Rozen,et al.  Primer3 on the WWW for general users and for biologist programmers. , 2000, Methods in molecular biology.

[47]  M. Levine,et al.  Population-based study of the incidence of Shigella diarrhea and causative serotypes in Santiago, Chile. , 1999, The Pediatric infectious disease journal.

[48]  D. Swerdlow,et al.  Global burden of Shigella infections: implications for vaccine development and implementation of control strategies. , 1999, Bulletin of the World Health Organization.

[49]  M. Achtman,et al.  Multilocus sequence typing: a portable approach to the identification of clones within populations of pathogenic microorganisms. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[50]  H. Lior,et al.  Evaluation of commercial antisera for Shigella serogrouping , 1995, Journal of clinical microbiology.

[51]  Yu-Hui Lin,et al.  Analysis of clonal relationships among isolates of Shigella sonnei by different molecular typing methods , 1995, Journal of clinical microbiology.

[52]  S. Faruque,et al.  Differentiation of Shigella flexneri strains by rRNA gene restriction patterns , 1992, Journal of clinical microbiology.

[53]  D. Dykhuizen,et al.  Recombination in Escherichia coli and the definition of biological species , 1991, Journal of bacteriology.

[54]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[55]  G. Schoolnik,et al.  Detection of Shigella in feces using DNA amplification. , 1990, The Journal of infectious diseases.

[56]  H. Ochman,et al.  Standard reference strains of Escherichia coli from natural populations , 1984, Journal of bacteriology.

[57]  E. Wiley,et al.  The Evolutionary Species Concept Reconsidered , 1978 .

[58]  William H. Ewing SHIGELLA NOMENCLATURE , 1949, Journal of bacteriology.