Functional Classes in the Three Domains of Life

Abstract. The evolutionary divergence among the three major domains of life can now be addressed through the first set of complete genomes from representative species. These model species from the three domains of life, Haemophilus influenzae for Bacteria, Saccharomyces cerevisiae for Eukarya, and Methanococcus jannaschii for Archaea, provide the basis for a universal functional classification and analysis. We have chosen 13 functional classes and three superclasses (ENERGY, COMMUNICATION and INFORMATION) as global descriptors of protein function. Compositional comparison of the three complete genomes reveals that functional classes are ubiquitous yet diverse in the three domains of life. Proteins related with ENERGY processes are generally represented in all three domains, while those related with COMMUNICATION represent the most distinctive functional feature of each single domain. Finally, functions related with INFORMATION processing (translation, transcription, and replication) show a complex behaviour. In Archaea, proteins in this superclass are related with proteins in either Eukarya or Bacteria, as recognized previously. The distribution of functional classes in the three domains accurately reflects the principal characteristics of cellular life forms.

[1]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[2]  C. Sander,et al.  Database of homology‐derived protein structures and the structural meaning of sequence alignment , 1991, Proteins.

[3]  M. G. Kidwell Lateral transfer in natural populations of eukaryotes. , 1993, Annual review of genetics.

[4]  M. Riley,et al.  Functions of the gene products of Escherichia coli , 1993, Microbiological reviews.

[5]  Sequence of the Schizosaccharomyces pombe gtp1 gene and identification of a novel family of putative GTP-binding proteins. , 1993, Gene.

[6]  C. Sander,et al.  Yeast chromosome III: new gene functions. , 1994, The EMBO journal.

[7]  L. Kroos,et al.  Regulation of the transcription of a cluster of Bacillus subtilis spore coat genes. , 1994, Journal of molecular biology.

[8]  A. Haenni,et al.  A highly conserved eukaryotic protein family possessing properties of polypeptide chain release factor , 1994, Nature.

[9]  R. Fleischmann,et al.  The Minimal Gene Complement of Mycoplasma genitalium , 1995, Science.

[10]  R. Fleischmann,et al.  Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. , 1995, Science.

[11]  C. Sander,et al.  Challenging times for bioinformatics , 1995, Nature.

[12]  B. Barrell,et al.  Life with 6000 Genes , 1996, Science.

[13]  C Sander,et al.  Bioinformatics and the discovery of gene function. , 1996, Trends in genetics : TIG.

[14]  S. Henikoff,et al.  Blocks database and its applications. , 1996, Methods in enzymology.

[15]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence data bank and its new supplement TREMBL , 1996, Nucleic Acids Res..

[16]  C Ouzounis,et al.  The emergence of major cellular processes in evolution , 1996, FEBS letters.

[17]  P. Bork,et al.  Metabolism and evolution of Haemophilus influenzae deduced from a whole-genome comparison with Escherichia coli , 1996, Current Biology.

[18]  C. Sander,et al.  Genequiz II: Automatic Function Assignment For Genome Sequence Analysis , 1996 .

[19]  W. Pearson Effective protein sequence comparison. , 1996, Methods in enzymology.

[20]  C Ouzounis,et al.  Novelties from the complete genome of Mycoplasma genitalium , 1996, Molecular microbiology.

[21]  C Ouzounis,et al.  Genomes with distinct function composition , 1996, FEBS letters.

[22]  R. Fleischmann,et al.  Complete Genome Sequence of the Methanogenic Archaeon, Methanococcus jannaschii , 1996, Science.

[23]  E V Koonin,et al.  Complete genome sequences of cellular life forms: glimpses of theoretical evolutionary genomics. , 1996, Current opinion in genetics & development.

[24]  H. Hilbert,et al.  Comparative analysis of the genomes of the bacteria Mycoplasma pneumoniae and Mycoplasma genitalium. , 1997, Nucleic acids research.

[25]  H. Ochman,et al.  Amelioration of Bacterial Genomes: Rates of Change and Exchange , 1997, Journal of Molecular Evolution.

[26]  Miguel A. Andrade-Navarro,et al.  Sequence analysis of the Methanococcus jannaschii genome and the prediction of protein function , 1997, Comput. Appl. Biosci..

[27]  Amos Bairoch,et al.  The PROSITE database, its status in 1997 , 1997, Nucleic Acids Res..

[28]  H. Ochman,et al.  Molecular archaeology of the Escherichia coli genome. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[29]  T. Traut,et al.  A minimal gene set for cellular life derived by comparison of complete bacterial genomes , 1998 .

[30]  Chris Sander,et al.  EUCLID: automatic classification of proteins in functional classes by their database annotations , 1998, Bioinform..

[31]  Cathy H. Wu,et al.  The PIR-International Protein Sequence Database , 1999, Nucleic Acids Res..

[32]  Nikos Kyrpides,et al.  Universal Protein Families and the Functional Content of the Last Universal Common Ancestor , 1999, Journal of Molecular Evolution.

[33]  Rodrigo Lopez,et al.  The EMBL Nucleotide Sequence Database , 1999, Nucleic Acids Res..

[34]  Y. Ioannou Sequence Analysis , 2000, Science.