Pan-genome analysis provides much higher strain typing resolution than multi-locus sequence typing.

The most widely used DNA-based method for bacterial strain typing, multi-locus sequence typing (MLST), lacks sufficient resolution to distinguish among many bacterial strains within a species. Here, we show that strain typing based on the presence or absence of distributed genes is able to resolve all completely sequenced genomes of six bacterial species. This was accomplished by the development of a clustering method, neighbour grouping, which is completely consistent with the lower-resolution MLST method, but provides far greater resolving power. Because the presence/absence of distributed genes can be determined by low-cost microarray analyses, it offers a practical, high-resolution alternative to MLST that could provide valuable diagnostic and prognostic information for pathogenic bacterial species.

[1]  Justin S. Hogg,et al.  Characterization and modeling of the Haemophilus influenzae core and supragenomes based on the complete genomic sequences of Rd and 12 clinical nontypeable strains , 2007, Genome Biology.

[2]  Miriam Barlow,et al.  Phylogenetic analysis as a tool in molecular epidemiology of infectious diseases. , 2006, Annals of epidemiology.

[3]  A. Hughes,et al.  Nucleotide Substitution and Recombination at Orthologous Loci in Staphylococcus aureus , 2005, Journal of bacteriology.

[4]  Evan Powell,et al.  Comparative Genomic Analyses of Seventeen Streptococcus pneumoniae Strains: Insights into the Pneumococcal Supragenome , 2007, Journal of bacteriology.

[5]  B. Spratt,et al.  Further evidence for the non-clonal population structure of Neisseria gonorrhoeae: extensive genetic diversity within isolates of the same electrophoretic type. , 1994, Microbiology.

[6]  B. Spratt,et al.  How Clonal Is Staphylococcus aureus? , 2003, Journal of bacteriology.

[7]  Jaideep P. Sundaram,et al.  Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: implications for the microbial "pan-genome". , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[8]  M. Pérez‐Losada,et al.  Population genetics of microbial pathogens estimated from multilocus sequence typing (MLST) data. , 2006, Infection, genetics and evolution : journal of molecular epidemiology and evolutionary genetics in infectious diseases.

[9]  David W Ussery,et al.  Characterization of probiotic Escherichia coli isolates with a novel pan-genome microarray , 2007, Genome Biology.

[10]  D. Graham,et al.  Population genetic analysis of Helicobacter pylori by multilocus enzyme electrophoresis: extensive allelic diversity and recombinational population structure , 1996, Journal of bacteriology.

[11]  Daniel Falush,et al.  Sex and virulence in Escherichia coli: an evolutionary perspective , 2006, Molecular microbiology.

[12]  D. Dykhuizen,et al.  Clonal divergence in Escherichia coli as a result of recombination, not mutation. , 1994, Science.

[13]  B. Snel,et al.  Genome trees and the nature of genome evolution. , 2005, Annual review of microbiology.

[14]  G. Ehrlich,et al.  Bacterial plurality as a general mechanism driving persistence in chronic infections. , 2005, Clinical orthopaedics and related research.

[15]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[16]  J. M. Smith,et al.  Estimating recombinational parameters in Streptococcus pneumoniae from multilocus sequence typing data. , 2000, Genetics.

[17]  A. Witney,et al.  Microarrays Reveal that Each of the Ten Dominant Lineages of Staphylococcus aureus Has a Unique Combination of Surface-Associated and Regulatory Genes , 2006, Journal of bacteriology.

[18]  D. Michael Olive,et al.  Principles and Applications of Methods for DNA-Based Typing of Microbial Organisms , 1999, Journal of Clinical Microbiology.

[19]  M. Stanhope,et al.  Evolution of the core and pan-genome of Streptococcus: positive selection, recombination, and genome composition , 2007, Genome Biology.

[20]  E. Holmes,et al.  Recombination within natural populations of pathogenic bacteria: short-term empirical estimates and long-term phylogenetic consequences. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[21]  P. Andersen,et al.  Genetic and environmental influences on premature death in adult adoptees. , 1988, The New England journal of medicine.

[22]  T. Popović,et al.  Characterization of Encapsulated and Noncapsulated Haemophilus influenzae and Determination of Phylogenetic Relationships by Multilocus Sequence Typing , 2003, Journal of Clinical Microbiology.

[23]  Alex van Belkum,et al.  Role of Genomic Typing in Taxonomy, Evolutionary Genetics, and Microbial Epidemiology , 2001, Clinical Microbiology Reviews.

[24]  Pascal Lapierre,et al.  Estimating the size of the bacterial pan-genome. , 2009, Trends in genetics : TIG.

[25]  W. Hanage,et al.  eBURST: Inferring Patterns of Evolutionary Descent among Clusters of Related Bacterial Genotypes from Multilocus Sequence Typing Data , 2004, Journal of bacteriology.

[26]  D. Falush,et al.  Inference of Bacterial Microevolution Using Multilocus Sequence Data , 2007, Genetics.

[27]  B. Spratt,et al.  Recombination and the population structures of bacterial pathogens. , 2001, Annual review of microbiology.