Phylogenomics and protein signatures elucidating the evolutionary relationships among the Gammaproteobacteria.

The class Gammaproteobacteria, which forms one of the largest groups within bacteria, is currently distinguished from other bacteria solely on the basis of its branching in phylogenetic trees. No molecular or biochemical characteristic is known that is unique to the class Gammaproteobacteria or its different subgroups (orders). The relationship among different orders of gammaproteobacteria is also not clear. In this study, we present detailed phylogenomic and comparative genomic analyses on gammaproteobacteria that clarify some of these issues. Phylogenetic trees based on concatenated sequences for 13 and 36 universally distributed proteins were constructed for 45 members of the class Gammaproteobacteria covering 13 of its 14 orders. In these trees, species from a number of the subgroups formed distinct clades and their relative branching order was indicated as follows (from the most recent to the earliest diverging): Enterobacteriales >Pasteurellales >Vibrionales, Aeromonadales >Alteromonadales >Oceanospirillales, Pseudomonadales >Chromatiales, Legionellales, Methylococcales, Xanthomonadales, Cardiobacteriales, Thiotrichales. Four conserved indels in four widely distributed proteins that are specific for gammaproteobacteria are also described. A 2 aa deletion in 5'-phosphoribosyl-5-aminoimidazole-4-carboxamide transformylase (AICAR transformylase; PurH) was a distinctive characteristic of all gammaproteobacteria (except Francisella tularensis). Two other conserved indels (a 4 aa deletion in RNA polymerase beta-subunit and a 1 aa deletion in ribosomal protein L16) were found uniquely in various species of the orders Enterobacteriales, Pasteurellales, Vibrionales, Aeromonadales and Alteromonadales, but were not found in other gammaproteobacteria. Lastly, a 2 aa deletion in leucyl-tRNA synthetase was commonly present in the above orders of the class Gammaproteobacteria and also in some members of the order Oceanospirillales. The presence of the conserved indels in these gammaproteobacterial orders indicates that species from these orders shared a common ancestor that was separate from other bacteria, a suggestion that is supported by phylogenetic studies. Systematic blastp searches were also conducted on various open reading frames (ORFs) in the genome of Escherichia coli K-12. These analyses identified 75 proteins that were unique to most members of the class Gammaproteobacteria or were restricted to species from some of its main orders (Enterobacteriales; Enterobacteriales and Pasteurellales; Enterobacteriales, Pasteurellales, Vibrionales, Aeromonadales and Alteromonadales; and the Enterobacteriales, Pasteurellales, Vibrionales, Aeromonadales, Alteromonadales, Oceanospirillales and Pseudomonadales etc.). The genes for these proteins have evolved at various stages during the evolution of gammaproteobacteria and their species distribution pattern, in conjunction with other results presented here, provide valuable information regarding the evolutionary relationships among these bacteria.

[1]  Andrés Moya,et al.  Genome Rearrangement Distances and Gene Order Phylogeny in γ-Proteobacteria , 2005 .

[2]  M. Nei,et al.  MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. , 2007, Molecular biology and evolution.

[3]  S. Kanaya,et al.  Sequential binding of SeqA protein to nascent DNA segments at replication forks in synchronized cultures of Escherichia coli , 2004, Molecular microbiology.

[4]  J. Thompson,et al.  Multiple sequence alignment with Clustal X. , 1998, Trends in biochemical sciences.

[5]  T. Kunisawa Gene arrangements and phylogeny in the class Proteobacteria. , 2001, Journal of theoretical biology.

[6]  K. Holsinger The neutral theory of molecular evolution , 2004 .

[7]  Antoine Danchin,et al.  How essential are nonessential genes? , 2005, Molecular biology and evolution.

[8]  J. Côté,et al.  Phylogenetic analysis of gamma-proteobacteria inferred from nucleotide sequence comparisons of the house-keeping genes adk, aroE and gdh: comparisons with phylogeny inferred from 16S rRNA gene sequences. , 2006, The Journal of general and applied microbiology.

[9]  N. W. Davis,et al.  The complete genome sequence of Escherichia coli K-12. , 1997, Science.

[10]  R. Ghirlando,et al.  MukE and MukF Form Two Distinct High Affinity Complexes* , 2007, Journal of Biological Chemistry.

[11]  Doolittle Wf Phylogenetic Classification and the Universal Tree , 1999 .

[12]  Jeremy D. Glasner,et al.  Systematic Mutagenesis of the Escherichia coli Genome , 2004, Journal of bacteriology.

[13]  Alyssa C. Bumbaugh,et al.  Inferences from whole-genome sequences of bacterial pathogens. , 2002, Current opinion in genetics & development.

[14]  R. Overbeek,et al.  The winds of (evolutionary) change: breathing new life into microbiology. , 1996, Journal of bacteriology.

[15]  Jon R. Armstrong,et al.  Identification of genes subject to positive selection in uropathogenic strains of Escherichia coli: a comparative genomics approach. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Michael Y. Galperin,et al.  Prokaryotic genomes: the emerging paradigm of genome-based microbiology. , 1997, Current opinion in genetics & development.

[17]  E. Stackebrandt,et al.  Proteobacteria classis nov., a Name for the Phylogenetic Taxon That Includes the “Purple Bacteria and Their Relatives” , 1988 .

[18]  J. Palmer,et al.  Animals and fungi are each other's closest relatives: congruent evidence from multiple proteins. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Hans-Peter Klenk,et al.  Overview: A Phylogenetic Backbone and Taxonomic Framework for Procaryotic Systematics , 2015 .

[20]  Yves Van de Peer,et al.  TREECON for Windows: a software package for the construction and drawing of evolutionary trees for the Microsoft Windows environment , 1994, Comput. Appl. Biosci..

[21]  Martin Vingron,et al.  TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing , 2002, Bioinform..

[22]  W. Doolittle,et al.  Prokaryotic evolution in light of gene transfer. , 2002, Molecular biology and evolution.

[23]  R. Gupta,et al.  The phylogeny of proteobacteria: relationships to other eubacterial phyla and eukaryotes. , 2000, FEMS microbiology reviews.

[24]  J. W. Campbell,et al.  Experimental Determination and System Level Analysis of Essential Genes in Escherichia coli MG1655 , 2003, Journal of bacteriology.

[25]  H. Ochman,et al.  Bacterial genomes as new gene homes: the genealogy of ORFans in E. coli. , 2004, Genome research.

[26]  J. Setubal,et al.  Comparative genomic analysis of plant-associated bacteria. , 2002, Annual review of phytopathology.

[27]  in chief George M. Garrity Bergey’s Manual® of Systematic Bacteriology , 1989, Springer New York.

[28]  S. Carroll,et al.  Genome-scale approaches to resolving incongruence in molecular phylogenies , 2003, Nature.

[29]  H. Philippe,et al.  Ancient phylogenetic relationships. , 2002, Theoretical population biology.

[30]  J. Felsenstein Cases in which Parsimony or Compatibility Methods will be Positively Misleading , 1978 .

[31]  Radhey S. Gupta,et al.  Phylogenomics and signature proteins for the alpha Proteobacteria and its main groups , 2007, BMC Microbiology.

[32]  Guy Plunkett,et al.  Comparative Genomics of Salmonellaenterica Serovar Typhi Strains Ty2 and CT18 , 2003, Journal of bacteriology.

[33]  G. Olsen,et al.  Comparative genomics of closely related salmonellae. , 2002, Trends in microbiology.

[34]  Radhey S. Gupta,et al.  Signature proteins that are distinctive characteristics of Actinobacteria and their subgroups , 2006, Antonie van Leeuwenhoek.

[35]  A. Witney,et al.  Application of Comparative Phylogenomics To Study the Evolution of Yersinia enterocolitica and To Identify Genetic Differences Relating to Pathogenicity , 2006, Journal of bacteriology.

[36]  Radhey S. Gupta,et al.  Phylogeny and molecular signatures (conserved proteins and indels) that are specific for the Bacteroidetes and Chlorobi species , 2007, BMC Evolutionary Biology.

[37]  L. Orgel,et al.  Phylogenetic Classification and the Universal Tree , 1999 .

[38]  Wei Qian,et al.  Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. , 2000, Molecular biology and evolution.

[39]  Emma Griffiths,et al.  BLAST screening of chlamydial genomes to identify signature proteins that are unique for the Chlamydiales, Chlamydiaceae, Chlamydophila and Chlamydia groups of species , 2006, BMC Genomics.

[40]  Radhey S. Gupta Molecular signatures (unique proteins and conserved indels) that are specific for the epsilon proteobacteria (Campylobacterales) , 2006, BMC Genomics.

[41]  Peter H. A. Sneath,et al.  Application of the Character Compatibility Approach to Generalized Molecular Sequence Data: Branching Order of the Proteobacterial Subdivisions , 2006, Journal of Molecular Evolution.

[42]  Michael Y. Galperin,et al.  The COG database: a tool for genome-scale analysis of protein functions and evolution , 2000, Nucleic Acids Res..

[43]  Peter F. Hallin,et al.  Ten years of bacterial genome sequencing: comparative-genomics-based discoveries , 2006, Functional & Integrative Genomics.

[44]  J. Ley The Proteobacteria. Ribosomal RNA cistron similarities and bacterial taxonomy , 1992 .

[45]  James R Brown,et al.  Phylogeny of gamma-proteobacteria: resolution of one branch of the universal tree? , 2004, BioEssays : news and reviews in molecular, cellular and developmental biology.

[46]  C R Woese,et al.  The phylogeny of purple bacteria: the alpha subdivision. , 1984, Systematic and applied microbiology.

[47]  Thomas L. Madden,et al.  Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements. , 2001, Nucleic acids research.

[48]  J A Lake,et al.  Evidence that eukaryotes and eocyte prokaryotes are immediate relatives. , 1992, Science.

[49]  P. Holland,et al.  Rare genomic changes as a tool for phylogenetics. , 2000, Trends in ecology & evolution.

[50]  S. Karlin,et al.  Genomic comparisons among γ‐proteobacteria , 2006 .

[51]  B. Snel,et al.  Toward Automatic Reconstruction of a Highly Resolved Tree of Life , 2006, Science.

[52]  Eugene Rosenberg,et al.  Introduction to the Proteobacteria , 2004 .

[53]  Radhey S. Gupta Protein Phylogenies and Signature Sequences: A Reappraisal of Evolutionary Relationships among Archaebacteria, Eubacteria, and Eukaryotes , 1998, Microbiology and Molecular Biology Reviews.

[54]  N. Moran,et al.  From Gene Trees to Organismal Phylogeny in Prokaryotes:The Case of the γ-Proteobacteria , 2003, PLoS biology.

[55]  R. Fleischmann,et al.  Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. , 1995, Science.