Whole-genome sequence comparison as a method for improving bacterial species definition.

We compared pairs of 1,226 bacterial strains with whole genome sequences and calculated their average nucleotide identity (ANI) between genomes to determine whether whole genome comparison can be directly used for bacterial species definition. We found that genome comparisons of two bacterial strains from the same species (SGC) have a significantly higher ANI than those of two strains from different species (DGC), and that the ANI between the query and the reference genomes can be used to determine whether two genomes come from the same species. Bacterial species definition based on ANI with a cut-off value of 0.92 matched well (81.5%) with the current bacterial species definition. The ANI value was shown to be consistent with the standard for traditional bacterial species definition, and it could be used in bacterial taxonomy for species definition. A new bioinformatics program (ANItools) was also provided in this study for users to obtain the ANI value of any two bacterial genome pairs (http://genome.bioinfo-icdc.org/). This program can match a query strain to all bacterial genomes, and identify the highest ANI value of the strain at the species, genus and family levels respectively, providing valuable insights for species definition.

[1]  M. Achtman,et al.  Multilocus sequence typing: a portable approach to the identification of clones within populations of pathogenic microorganisms. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[2]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[3]  W. Ludwig,et al.  Notes on the characterization of prokaryote strains for taxonomic purposes. , 2010, International journal of systematic and evolutionary microbiology.

[4]  P. Vandamme,et al.  DNA-DNA hybridization values and their relationship to whole-genome sequence similarities. , 2007, International journal of systematic and evolutionary microbiology.

[5]  K. Konstantinidis,et al.  Genomic insights that advance the species definition for prokaryotes. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[6]  S. Giovannoni,et al.  The uncultured microbial majority. , 2003, Annual review of microbiology.

[7]  K. Konstantinidis,et al.  Toward a More Robust Assessment of Intraspecies Diversity, Using Fewer Genetic Markers , 2006, Applied and Environmental Microbiology.

[8]  Meriem El Karoui,et al.  A Genomic Distance Based on MUM Indicates Discontinuity between Most Bacterial Species and Genera , 2008, Journal of bacteriology.

[9]  S. Quake,et al.  Single-cell genomics , 2011, Nature Methods.

[10]  P. Vandamme,et al.  Microbial systematics and taxonomy: relevance for a microbial commons. , 2010, Research in microbiology.

[11]  R. Amann,et al.  The species concept for prokaryotes. , 2013, FEMS microbiology reviews.

[12]  R. Rosselló-Móra,et al.  Shifting the genomic gold standard for the prokaryotic species definition , 2009, Proceedings of the National Academy of Sciences.

[13]  George E. Fox,et al.  Comparative Cataloging of 16S Ribosomal Ribonucleic Acid: Molecular Approach to Procaryotic Systematics , 1977 .