Functional genomics and enzyme evolution

Computational analysis of complete genomes, followed by experimental testing of emerging hypotheses--the area of research often referred to as 'functional genomics'--aims at deciphering the wealth of information contained in genome sequences and at using it to improve our understanding of the mechanisms of cell function. This review centers on the recent progress in the genome analysis with special emphasis on the new insights in enzyme evolution. Standard methods of predicting functions for new proteins are listed and the common errors in their application are discussed. A new method of improving the functional predictions is introduced, based on a phylogenetic approach to functional prediction, as implemented in the recently constructed Clusters of Orthologous Groups (COG) database (available at http:@www.ncbi.nlm.nih.gov/COG). This approach provides a convenient way to characterize the protein families (and metabolic pathways) that are present or absent in any given organism. Comparative analysis of microbial genomes based on this approach shows that metabolic diversity generally correlates with the genome size-parasitic bacteria code for fewer enzymes and lesser number of metabolic pathways than their free-living relatives. Comparison of different genomes reveals another evolutionary trend, the non-orthologous gene displacement of some enzymes by unrelated proteins with the same cellular function. An examination of the phylogenetic distribution of such cases provides new clues to the problems of biochemical evolution, including evolution of glycolysis and the TCA cycle.

[1]  Amos Bairoch,et al.  The PROSITE database, its status in 1999 , 1999, Nucleic Acids Res..

[2]  V. K. Eugene Beyond the complete genomes : from sequences to structure and function , 1998 .

[3]  Michael Y. Galperin,et al.  Using metabolic pathway databases for functional annotation. , 1998, Trends in genetics : TIG.

[4]  Michael Y. Galperin,et al.  A superfamily of metalloenzymes unifies phosphopentomutase and cofactor‐independent phosphoglycerate mutase with alkaline phosphatases and sulfatases , 1998, Protein science : a publication of the Protein Society.

[5]  Michael Y. Galperin,et al.  Analogous enzymes: independent inventions in enzyme evolution. , 1998, Genome research.

[6]  C. Clayton,et al.  Helicobacter pylori porCDAB and oorDABCGenes Encode Distinct Pyruvate:Flavodoxin and 2-Oxoglutarate:Acceptor Oxidoreductases Which Mediate Electron Transport to NADP , 1998, Journal of bacteriology.

[7]  Michael Y. Galperin,et al.  Sources of systematic error in functional annotation of genomes: domain rearrangement, non-orthologous gene displacement, and operon disruption , 1998, Silico Biol..

[8]  Sean R. Eddy,et al.  Pfam: multiple sequence alignments and HMM-profiles of protein domains , 1998, Nucleic Acids Res..

[9]  Terri K. Attwood,et al.  The PRINTS protein fingerprint database in its fifth year , 1998, Nucleic Acids Res..

[10]  Shmuel Pietrokovski,et al.  Superior performance in protein homology detection with the Blocks Database servers , 1998, Nucleic Acids Res..

[11]  Jérôme Gouzy,et al.  The ProDom database of protein domain families , 1998, Nucleic Acids Res..

[12]  D. Lipman,et al.  A genomic perspective on protein families. , 1997, Science.

[13]  N. W. Davis,et al.  The complete genome sequence of Escherichia coli K-12. , 1997, Science.

[14]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[15]  Michael Y. Galperin,et al.  Comparison of archaeal and bacterial genomes: computer analysis of protein sequences predicts novel functions and suggests a chimeric origin for the archaea , 1997, Molecular microbiology.

[16]  R. Overbeek,et al.  Representation of function: the next step. , 1997, Gene.

[17]  H. Nakashita,et al.  Studies on the biosynthesis of bialaphos. Biochemical mechanism of C-P bond formation: discovery of phosphonopyruvate decarboxylase which catalyzes the formation of phosphonoacetaldehyde from phosphonopyruvate. , 1997, The Journal of antibiotics.

[18]  A. Bairoch,et al.  The PROSITE database, its status in 1997 , 1997, Nucleic Acids Res..

[19]  W. Hunter,et al.  The crystal structure of a class II fructose-1,6-bisphosphate aldolase shows a novel binuclear metal-binding active site embedded in a familiar fold. , 1996, Structure.

[20]  E. Koonin,et al.  A minimal gene set for cellular life derived by comparison of complete bacterial genomes. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[21]  P. Bork,et al.  Non-orthologous gene displacement. , 1996, Trends in genetics : TIG.

[22]  J W Fickett,et al.  Finding genes by computer: the state of the art. , 1996, Trends in genetics : TIG.

[23]  T. Conway,et al.  Evolution of carbohydrate metabolic pathways. , 1996, Research in microbiology.

[24]  C Ouzounis,et al.  Novelties from the complete genome of Mycoplasma genitalium , 1996, Molecular microbiology.

[25]  P. Labbé,et al.  Cloning and Characterization of the Yeast HEM14 Gene Coding for Protoporphyrinogen Oxidase, the Molecular Target of Diphenyl Ether-type Herbicides (*) , 1996, The Journal of Biological Chemistry.

[26]  P. Bork,et al.  Metabolism and evolution of Haemophilus influenzae deduced from a whole-genome comparison with Escherichia coli , 1996, Current Biology.

[27]  C R Woese,et al.  Methanococcus jannaschii genome: revisited. , 1996, Microbial & comparative genomics.

[28]  W. D. de Vos,et al.  Purification and Characterization of a Novel ADP-dependent Glucokinase from the Hyperthermophilic Archaeon Pyrococcus furiosus(*) , 1995, The Journal of Biological Chemistry.

[29]  G. Weinstock,et al.  Generation of auxotrophic mutants of Enterococcus faecalis , 1995, Journal of bacteriology.

[30]  R. Fleischmann,et al.  The Minimal Gene Complement of Mycoplasma genitalium , 1995, Science.

[31]  R. Fleischmann,et al.  Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. , 1995, Science.

[32]  Y. Takada,et al.  Differential expression in Escherichia coli of the Vibrio sp. strain ABE-1 icdI and icdII genes encoding structurally different isocitrate dehydrogenase isozymes , 1995, Journal of bacteriology.

[33]  H. Nakashita,et al.  The carboxyphosphonoenolpyruvate synthase-encoding gene from the bialaphos-producing organism Streptomyces hygroscopicus. , 1995, Gene.

[34]  M. Brendel,et al.  Cloning, sequencing, and characterizing the Lactobacillus leichmannii pyrC gene encoding dihydroorotase. , 1995, Biochimie.

[35]  John C. Wootton,et al.  Non-globular Domains in Protein Sequences: Automated Segmentation Using Complexity Measures , 1994, Comput. Chem..

[36]  M. Adams,et al.  Pyruvate ferredoxin oxidoreductases of the hyperthermophilic archaeon, Pyrococcus furiosus, and the hyperthermophilic bacterium, Thermotoga maritima, have different catalytic mechanisms. , 1994, Biochemistry.

[37]  C. Sander,et al.  Convergent evolution of similar enzymatic function on different protein folds: The hexokinase, ribokinase, and galactokinase families of sugar kinases , 1993, Protein science : a publication of the Protein Society.

[38]  P. Michels,et al.  Evolution of glycolysis. , 1993, Progress in biophysics and molecular biology.

[39]  S. Ehrlich,et al.  Branched-chain amino acid biosynthesis genes in Lactococcus lactis subsp. lactis , 1992, Journal of bacteriology.

[40]  J. Marsh,et al.  Fructose-bisphosphate aldolases: an evolutionary history. , 1992, TIBS -Trends in Biochemical Sciences. Regular ed.

[41]  R. Switzer,et al.  Functional organization and nucleotide sequence of the Bacillus subtilis pyrimidine biosynthetic operon. , 1991, The Journal of biological chemistry.

[42]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[43]  D. Lipman,et al.  Improved tools for biological sequence comparison. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[44]  L. Fothergill-Gilmore Evolution in glycolysis. , 1987, Biochemical Society transactions.

[45]  H. Gest Evolutionary roots of the citric acid cycle in prokaryotes. , 1987, Biochemical Society symposium.

[46]  T. B. Powers,et al.  Iron superoxide dismutase from Escherichia coli at 3.1-A resolution: a structure unlike that of copper/zinc protein at both monomer and dimer levels. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[47]  G. Pons,et al.  Phylogeny and ontogeny of the phosphoglycerate mutases--IV. Distribution of glycerate-2,3-P2 dependent and independent phosphoglycerate mutases in algae, fungi, plants and animals. , 1982, Comparative biochemistry and physiology. B, Comparative biochemistry.

[48]  F. Daldal,et al.  Tn10 insertions in the pfkB region of Escherichia coli , 1981, Journal of bacteriology.

[49]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[50]  P. Weitzman Unity and diversity in some bacterial citric acid-cycle enzymes. , 1981, Advances in microbial physiology.

[51]  W. Rutter,et al.  EVOLUTION OF ALDOLASE. , 1964, Federation proceedings.