The evolution and structural anatomy of the small molecule metabolic pathways in Escherichia coli.

The 106 small molecule metabolic (SMM) pathways in Escherichia coli are formed by the protein products of 581 genes. We can define 722 domains, nearly all of which are homologous to proteins of known structure, that form all or part of 510 of these proteins. This information allows us to answer general questions on the structural anatomy of the SMM pathway proteins and to trace family relationships and recruitment events within and across pathways. Half the gene products contain a single domain and half are formed by combinations of between two and six domains. The 722 domains belong to one of 213 families that have between one and 51 members. Family members usually conserve their catalytic or cofactor binding properties; substrate recognition is rarely conserved. Of the 213 families, members of only a quarter occur in isolation, i.e. they form single-domain proteins. Most members of the other families combine with domains from just one or two other families and a few more versatile families can combine with several different partners. Excluding isoenzymes, more than twice as many homologues are distributed across pathways as within pathways. However, serial recruitment, with two consecutive enzymes both being recruited to another pathway, is rare and recruitment of three consecutive enzymes is not observed. Only eight of the 106 pathways have a high number of homologues. Homology between consecutive pairs of enzymes with conservation of the main substrate-binding site but change in catalytic mechanism (which would support a simple model of retrograde pathway evolution) occurs only six times in the whole set of enzymes. Most of the domains that form SMM pathways have homologues in non-SMM pathways. Taken together, these results imply a pervasive "mosaic" model for the formation of protein repertoires and pathways.

[1]  Calvin B. Bridges,et al.  SALIVARY CHROMOSOME MAPSWith a Key to the Banding of the Chromosomes of Drosophila Melanogaster , 1935 .

[2]  N H Horowitz,et al.  On the Evolution of Biochemical Syntheses. , 1945, Proceedings of the National Academy of Sciences of the United States of America.

[3]  E. Lewis Pseudoallelism and gene evolution. , 1951, Cold Spring Harbor symposia on quantitative biology.

[4]  N. Horowitz,et al.  The Evolution of Biochemical Syntheses — Retrospect and Prospect , 1965 .

[5]  B. Hartley Homologies in serine proteinases. , 1970, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[6]  Michael G. Rossmann,et al.  Chemical and biological evolution of a nucleotide-binding protein , 1974, Nature.

[7]  R. Jensen Enzyme recruitment in evolution of new function. , 1976, Annual review of microbiology.

[8]  Gregory A. Petsko,et al.  Mandelate racemase and muconate lactonizing enzyme are mechanistically distinct and structurally homologous , 1990, Nature.

[9]  M Wilmanns,et al.  Structural conservation in parallel beta/alpha-barrel enzymes that catalyze three sequential reactions in the pathway of tryptophan biosynthesis. , 1991, Biochemistry.

[10]  G. Petsko,et al.  On the origin of enzymatic species. , 1993, Trends in biochemical sciences.

[11]  A. Murzin Can homologous proteins evolve different enzymatic activities? , 1993, Trends in biochemical sciences.

[12]  Tipton Kf,et al.  Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (NC-IUBMB). Enzyme nomenclature. Recommendations 1992. Supplement: corrections and additions. , 1994 .

[13]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[14]  Patricia C. Babbitt,et al.  Understanding Enzyme Superfamilies , 1997, The Journal of Biological Chemistry.

[15]  Monica Riley,et al.  Genes and proteins of Escherichia coli K-12 (GenProtEC) , 1997, Nucleic Acids Res..

[16]  Sarah A. Teichmann,et al.  DIVCLUS: an automatic method in the GEANFAMMER package that finds homologous domains in single- and multi-domain proteins , 1998, Bioinform..

[17]  Monica Riley,et al.  Genes and proteins of Escherichia coli K-12 , 1998, Nucleic Acids Res..

[18]  C. Chothia,et al.  Assessing sequence comparison methods with reliable structurally identified distant evolutionary relationships. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[19]  D. Haussler,et al.  Sequence comparisons using multiple sequences detect three times as many remote homologues as pairwise methods. , 1998, Journal of molecular biology.

[20]  H. Ochman,et al.  Molecular archaeology of the Escherichia coli genome. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[21]  Richard Hughey,et al.  Hidden Markov models for detecting remote protein homologies , 1998, Bioinform..

[22]  C. Orengo,et al.  Evolution of protein function, from a structural perspective. , 1999, Current opinion in chemical biology.

[23]  P. Bork,et al.  Variation and evolution of the citric-acid cycle: a genomic perspective. , 1999, Trends in microbiology.

[24]  Michael Y. Galperin,et al.  Comparative genomics of the Archaea (Euryarchaeota): evolution of conserved protein families, the stable core, and the variable shell. , 1999, Genome research.

[25]  B. Snel,et al.  Pathway alignment: application to the comparative analysis of glycolytic enzymes. , 1999, The Biochemical journal.

[26]  P. Bork,et al.  Homology among (betaalpha)(8) barrels: implications for the evolution of metabolic pathways. , 2000, Journal of molecular biology.

[27]  Cyrus Chothia,et al.  Optimal Hidden Markov Models for All Sequences of Known Structure , 2000 .

[28]  P. Bork,et al.  Homology among (βα) 8 barrels: implications for the evolution of metabolic pathways 1 1Edited by G. Von Heijne , 2000 .

[29]  S. Copley,et al.  Evolution of a metabolic pathway for degradation of a toxic xenobiotic: the patchwork approach. , 2000, Trends in biochemical sciences.

[30]  M Wilmanns,et al.  Structural evidence for evolution of the beta/alpha barrel scaffold by gene duplication and fusion. , 2000, Science.

[31]  B. Snel,et al.  Gene and context: integrative approaches to genome analysis. , 2000, Advances in protein chemistry.

[32]  Annabel E. Todd,et al.  Evolution of function in protein superfamilies, from a structural perspective. , 2001, Journal of molecular biology.

[33]  M. Riley,et al.  Divergence of function in sequence-related groups of Escherichia coli proteins. , 2001, Genome research.

[34]  Matthias Wilmanns,et al.  Structural Evidence for Evolution of the b / a Barrel Scaffold by Gene Duplication and Fusion , 2022 .