Gene sequences useful for predicting relatedness of whole genomes in bacteria.

Thirty-two protein-encoding genes that are distributed widely among bacterial genomes were tested for the potential usefulness of their DNA sequences in assigning bacterial strains to species. From publicly available data, it was possible to make 49 pairwise comparisons of whole bacterial genomes that were related at the genus or subgenus level. DNA sequence identity scores for eight of the genes correlated strongly with overall sequence identity scores for the genome pairs. Even single-gene alignments could predict overall genome relatedness with a high degree of precision and accuracy. Predictions could be refined further by including two or three genes in the analysis. The proposal that sequence analysis of a small set of protein-encoding genes could reliably assign novel strains or isolates to bacterial species is strongly supported.

[1]  W. Whitman,et al.  Report of the ad hoc committee for the re-evaluation of the species definition in bacteriology. , 2002, International journal of systematic and evolutionary microbiology.

[2]  S. Falkow,et al.  Polynucleotide Sequence Relationships among Members of Enterobacteriaceae , 1969, Journal of bacteriology.

[3]  S. Falkow,et al.  Molecular Relationships Among the Salmonelleae , 1973, Journal of bacteriology.

[4]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[5]  E. Asheshov,et al.  International Committee on Systematic Bacteriology: Subcommittee on the Phage-Typing of Staphylococci , 1975 .

[6]  S. Salzberg,et al.  Fast algorithms for large-scale genome alignment and comparison. , 2002, Nucleic acids research.

[7]  R. Amann,et al.  The species concept for prokaryotes. , 2013, FEMS microbiology reviews.

[8]  B. Mayall,et al.  Genomic approaches to typing, taxonomy and evolution of bacterial isolates. , 2001, International journal of systematic and evolutionary microbiology.

[9]  S. Falkow,et al.  Polynucleotide Sequence Divergence Among Strains of Escherichia coli and Closely Related Organisms , 1972, Journal of bacteriology.

[10]  O. Kandler,et al.  International Committee on Systematic Bacteriology: announcement of the report of the ad hoc Committee on Reconciliation of Approaches to Bacterial Systematics. , 1987, Zentralblatt fur Bakteriologie, Mikrobiologie, und Hygiene. Series A, Medical microbiology, infectious diseases, virology, parasitology.

[11]  G. Fox,et al.  How close is close: 16S rRNA sequence identity may not be sufficient to guarantee species identity. , 1992, International journal of systematic bacteriology.

[12]  Michael J. Stanhope,et al.  Universal trees based on large combined protein sequence data sets , 2001, Nature Genetics.

[13]  Erko Stackebrandt,et al.  Taxonomic Note: A Place for DNA-DNA Reassociation and 16S rRNA Sequence Analysis in the Present Species Definition in Bacteriology , 1994 .

[14]  H. Somerville,et al.  DNA competition studies within the Bacillus cereus group of bacilli. , 1972, Journal of general microbiology.

[15]  S. Schramek,et al.  Deoxyribonucleic Acid Heterogeneity Between Human and Murine Strains of Chlamydia trachomatis , 1970, Infection and immunity.

[16]  H. Fukushi,et al.  Genetic diversity of avian and mammalian Chlamydia psittaci strains and relation to host origin , 1989, Journal of bacteriology.