The phylogenetic extent of metabolic enzymes and pathways.

The evolution of metabolic enzymes and pathways has been a subject of intense study for more than half a century. Yet, so far, previous studies have focused on a small number of enzyme families or biochemical pathways. Here, we examine the phylogenetic distribution of the full-known metabolic complement of Escherichia coli, using sequence comparison against taxa-specific databases. Half of the metabolic enzymes have homologs in all domains of life, representing families involved in some of the most fundamental cellular processes. We thus show for the first time and in a comprehensive way that metabolism is conserved at the enzyme level. In addition, our analysis suggests that despite the sequence conservation and the extensive phylogenetic distribution of metabolic enzymes, their groupings into biochemical pathways are much more variable than previously thought.

[1]  J A Eisen,et al.  Microbial Genes in the Human Genome: Lateral Transfer or Gene Loss? , 2001, Science.

[2]  D. Eisenberg,et al.  Assigning protein functions by comparative genome analysis: protein phylogenetic profiles. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Peter D. Karp,et al.  HinCyc: A Knowledge Base of the Complete Genome and Metabolic Pathways of H. influenzae , 1996, ISMB.

[4]  C. Ouzounis,et al.  Recent developments and future directions in computational genomics , 2000, FEBS letters.

[5]  Michael J. Stanhope,et al.  Universal trees based on large combined protein sequence data sets , 2001, Nature Genetics.

[6]  Michael J. Stanhope,et al.  Phylogenetic analyses do not support horizontal gene transfers from bacteria to vertebrates , 2001, Nature.

[7]  P D Karp,et al.  Global properties of the metabolic map of Escherichia coli. , 2000, Genome research.

[8]  Thure Etzold,et al.  SRS - an indexing and retrieval tool for flat file data libraries , 1993, Comput. Appl. Biosci..

[9]  R. Doolittle,et al.  Determining Divergence Times of the Major Kingdoms of Living Organisms with a Protein Clock , 1996, Science.

[10]  C. Chothia,et al.  The evolution and structural anatomy of the small molecule metabolic pathways in Escherichia coli. , 2001, Journal of molecular biology.

[11]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 , 2000, Nucleic Acids Res..

[12]  P D Karp,et al.  Pathway Databases: A Case Study in Computational Symbolic Theories , 2001, Science.

[13]  C. Woese,et al.  Phylogenetic structure of the prokaryotic domain: The primary kingdoms , 1977, Proceedings of the National Academy of Sciences of the United States of America.

[14]  K. Struhl Fundamentally Different Logic of Gene Regulation in Eukaryotes and Prokaryotes , 1999, Cell.

[15]  Peter D. Karp,et al.  The EcoCyc and MetaCyc databases , 2000, Nucleic Acids Res..

[16]  C. Woese The universal ancestor. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[18]  Sophia Tsoka,et al.  Prediction of protein interactions: metabolic enzymes are frequently involved in gene fusion , 2000, Nature Genetics.

[19]  R F Doolittle,et al.  Determining divergence times with a protein clock: update and reevaluation. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[20]  K. Schulten,et al.  Phylogenetic Analysis of Metabolic Pathways , 2001, Journal of Molecular Evolution.

[21]  Jonathan P. Bollback,et al.  Bayesian Inference of Phylogeny and Its Impact on Evolutionary Biology , 2001, Science.

[22]  S Tsoka,et al.  Functional versatility and molecular diversity of the metabolic map of Escherichia coli. , 2001, Genome research.

[23]  P. Bork,et al.  Variation and evolution of the citric-acid cycle: a genomic perspective. , 1999, Trends in microbiology.

[24]  Francis Crick,et al.  The Statue Within , 1992 .

[25]  J. V. Van Etten,et al.  Chlorella virus PBCV-1 encodes a functional homospermidine synthase. , 1999, Virology.

[26]  I. Crawford,et al.  Evolution of a biosynthetic pathway: the tryptophan paradigm. , 1989, Annual review of microbiology.

[27]  Chris Sander,et al.  CAST: an iterative algorithm for the complexity analysis of sequence tracts , 2000, Bioinform..

[28]  R. Jensen Enzyme recruitment in evolution of new function. , 1976, Annual review of microbiology.

[29]  B. Snel,et al.  Pathway alignment: application to the comparative analysis of glycolytic enzymes. , 1999, The Biochemical journal.

[30]  Nikos Kyrpides,et al.  Universal Protein Families and the Functional Content of the Last Universal Common Ancestor , 1999, Journal of Molecular Evolution.