Superfamily active site templates

We show that three‐dimensional signatures consisting of only a few functionally important residues can be diagnostic of membership in superfamilies of enzymes. Using the enolase superfamily as a model system, we demonstrate that such a signature, or template, can identify superfamily members in structural databases with high sensitivity and specificity. This is remarkable because superfamilies can be highly diverse, with members catalyzing many different overall reactions; the unifying principle can be a conserved partial reaction or chemical capability. Our definition of a superfamily thus hinges on the disposition of residues involved in a conserved function, rather than on fold similarity alone. A clear advantage of basing structure searches on such active site templates rather than on fold similarity is the specificity with which superfamilies with distinct functional characteristics can be identified within a large set of proteins with the same fold, such as the (β/α)8 barrels. Preliminary results are presented for an additional group of enzymes with a different fold, the haloacid dehalogenase superfamily, suggesting that this approach may be generally useful for assigning reading frames of unknown function to specific superfamilies and thereby allowing inference of some of their functional properties. Proteins 2004;9999:000–000. © 2004 Wiley‐Liss, Inc.

[1]  R. Jensen Enzyme recruitment in evolution of new function. , 1976, Annual review of microbiology.

[2]  L. Lebioda,et al.  Crystal Structure of Holoenzyme Refined at 1.9 Angstroms Resolution: Trigonal-Bipyramidal Geometry of the Cation Binding Site , 1989 .

[3]  L. Lebioda,et al.  Refined structure of yeast apo-enolase at 2.25 A resolution. , 1990, Journal of molecular biology.

[4]  J. Brewer,et al.  Inhibition of enolase: the crystal structures of enolase-Ca2(+)- 2-phosphoglycerate and enolase-Zn2(+)-phosphoglycolate complexes at 2.2-A resolution. , 1992, Biochemistry.

[5]  G. Petsko,et al.  Mechanism of the reaction catalyzed by mandelate racemase. 2. Crystal structure of mandelate racemase at 2.5-A resolution: identification of the active site and possible catalytic residues. , 1991, Biochemistry.

[6]  L. Lebioda,et al.  Mechanism of enolase: the crystal structure of enolase-Mg2(+)-2-phosphoglycerate/phosphoenolpyruvate complex at 2.2-A resolution. , 1991, Biochemistry.

[7]  E. Webb Enzyme nomenclature 1992. Recommendations of the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology on the Nomenclature and Classification of Enzymes. , 1992 .

[8]  E. Zhang,et al.  Fluoride inhibition of yeast enolase: Crystal structure of the enolase–Mg2+–F−–Pi complex at 2.6 Å resolution , 1993, Proteins.

[9]  C. Sander,et al.  Protein structure comparison by alignment of distance matrices. , 1993, Journal of molecular biology.

[10]  G. Petsko,et al.  On the origin of enzymatic species. , 1993, Trends in biochemical sciences.

[11]  E V Koonin,et al.  Computer analysis of bacterial haloacid dehalogenases defines a large superfamily of hydrolases with diverse specificity. Application of an iterative approach to database search. , 1994, Journal of molecular biology.

[12]  G. H. Reed,et al.  Chelation of serine 39 to Mg2+ latches a gate at the active site of enolase: structure of the bis(Mg2+) complex of yeast enolase and the intermediate analog phosphonoacetohydroxamate at 2.1-A resolution. , 1994, Biochemistry.

[13]  P. Willett,et al.  A graph-theoretic approach to the identification of three-dimensional patterns of amino acid side-chains in protein structures. , 1994, Journal of molecular biology.

[14]  G L Kenyon,et al.  The role of lysine 166 in the mechanism of mandelate racemase from Pseudomonas putida: mechanistic and crystallographic evidence for stereospecific alkylation by (R)-alpha-phenylglycidate. , 1994, Biochemistry.

[15]  E. Zhang,et al.  Catalytic metal ion binding in enolase: the crystal structure of an enolase-Mn2+-phosphonoacetohydroxamate complex at 2.4-A resolution. , 1994 .

[16]  E. Zhang,et al.  Catalytic metal ion binding in enolase: the crystal structure of an enolase-Mn2+-phosphonoacetohydroxamate complex at 2.4-A resolution. , 1994, Biochemistry.

[17]  R. Nussinov,et al.  Three‐dimensional, sequence order‐independent structural comparison of a serine protease against the crystallographic database reveals active site similarities: Potential implications to evolution and to protein folding , 1994, Protein science : a publication of the Protein Society.

[18]  Crystal structure of chloromuconate cycloisomerase from Alcaligenes eutrophus JMP134 (pJP4) at 3 A resolution. , 1993, Acta crystallographica. Section D, Biological crystallography.

[19]  J. Janin,et al.  X-ray structure and catalytic mechanism of lobster enolase. , 1995, Biochemistry.

[20]  S. Bryant,et al.  Threading a database of protein cores , 1995, Proteins.

[21]  G. Petsko,et al.  Mechanism of the reaction catalyzed by mandelate racemase: importance of electrophilic catalysis by glutamic acid 317. , 1995, Biochemistry.

[22]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[23]  Mechanism of the reaction catalyzed by mandelate racemase: structure and mechanistic properties of the K166R mutant. , 1995, Biochemistry.

[24]  G. H. Reed,et al.  Octahedral coordination at the high-affinity metal site in enolase: crystallographic analysis of the MgII--enzyme complex from yeast at 1.9 A resolution. , 1995, Biochemistry.

[25]  A. Goldman,et al.  The refined X-ray structure of muconate lactonizing enzyme from Pseudomonas putida PRS2000 at 1.85 A resolution. , 1995, Journal of molecular biology.

[26]  J F Gibrat,et al.  Surprising similarities in structure comparison. , 1996, Current opinion in structural biology.

[27]  G L Kenyon,et al.  Mechanism of the reaction catalyzed by mandelate racemase: structure and mechanistic properties of the D270N mutant. , 1995, Biochemistry.

[28]  G. H. Reed,et al.  A carboxylate oxygen of the substrate bridges the magnesium ions at the active site of enolase: structure of the yeast enzyme complexed with the equilibrium mixture of 2-phosphoglycerate and phosphoenolpyruvate at 1.8 A resolution. , 1996, Biochemistry.

[29]  W R Taylor,et al.  SSAP: sequential structure alignment program for protein structure comparison. , 1996, Methods in enzymology.

[30]  G. H. Reed,et al.  The enolase superfamily: a general strategy for enzyme-catalyzed abstraction of the alpha-protons of carboxylic acids. , 1996, Biochemistry.

[31]  G J Kleywegt,et al.  A re-evaluation of the crystal structure of chloromuconate cycloisomerase. , 1996, Acta crystallographica. Section D, Biological crystallography.

[32]  Y. Hata,et al.  Crystal structure of L-2-haloacid dehalogenase from Pseudomonas sp. YL. An alpha/beta hydrolase structure that is different from the alpha/beta hydrolase fold. , 1996, The Journal of biological chemistry.

[33]  J M Thornton,et al.  Derivation of 3D coordinate templates for searching structural databases: Application to ser‐His‐Asp catalytic triads in the serine proteinases and lipases , 1996, Protein science : a publication of the Protein Society.

[34]  I. S. Ridder,et al.  Three-dimensional Structure of l-2-Haloacid Dehalogenase from Xanthobacter autotrophicus GJ10 Complexed with the Substrate-analogue Formate* , 1997, The Journal of Biological Chemistry.

[35]  L. A. Carreira,et al.  Mechanism of enolase: the crystal structure of asymmetric dimer enolase-2-phospho-D-glycerate/enolase-phosphoenolpyruvate at 2.0 A resolution. , 1997, Biochemistry.

[36]  J. Thornton,et al.  Tess: A geometric hashing algorithm for deriving 3D coordinate templates for searching structural databases. Application to enzyme active sites , 1997, Protein science : a publication of the Protein Society.

[37]  Y. Hata,et al.  Crystal Structures of Reaction Intermediates ofl-2-Haloacid Dehalogenase and Implications for the Reaction Mechanism* , 1998, The Journal of Biological Chemistry.

[38]  P C Babbitt,et al.  Insights into the mechanism of catalysis by the P-C bond-cleaving enzyme phosphonoacetaldehyde hydrolase derived from gene sequence analysis and mutagenesis. , 1998, Biochemistry.

[39]  P C Babbitt,et al.  Evolution of an enzyme active site: the structure of a new crystal form of muconate lactonizing enzyme compared with mandelate racemase and enolase. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[40]  R. Russell,et al.  Detection of protein three-dimensional side-chain patterns: new examples of convergent evolution. , 1998, Journal of molecular biology.

[41]  P E Bourne,et al.  Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. , 1998, Protein engineering.

[42]  Michael Y. Galperin,et al.  The catalytic domain of the P-type ATPase has the haloacid dehalogenase fold. , 1998, Trends in biochemical sciences.

[43]  P C Babbitt,et al.  Mechanistically diverse enzyme superfamilies: the importance of chemistry in the evolution of catalysis. , 1998, Current opinion in chemical biology.

[44]  J. Skolnick,et al.  Method for prediction of protein function from sequence using the sequence-to-structure-to-function paradigm with application to glutaredoxins/thioredoxins and T1 ribonucleases. , 1998, Journal of molecular biology.

[45]  D L Brutlag,et al.  Modeling and superposition of multiple protein structures using affine transformations: analysis of the globins. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[46]  P C Babbitt,et al.  Evolution of enzymatic activities in the enolase superfamily: crystal structure of (D)-glucarate dehydratase from Pseudomonas putida. , 1998, Biochemistry.

[47]  M. Riley Systems for categorizing functions of gene products. , 1998, Current Opinion in Structural Biology.

[48]  A. Goldman,et al.  Structural basis for the activity of two muconate cycloisomerase variants toward substituted muconates , 1999, Proteins.

[49]  I. S. Ridder,et al.  Identification of the Mg2+-binding site in the P-type ATPase and phosphatase members of the HAD (haloacid dehalogenase) superfamily by structural similarity to the response regulator protein CheY. , 1999, The Biochemical journal.

[50]  G. Petsko,et al.  Evolution of enzymatic activities in the enolase superfamily: Identification of a 'new' general acid catalyst in the active site of D- galactonate dehydratase from Escherichia coli , 1999 .

[51]  S. Brenner Errors in genome annotation. , 1999, Trends in genetics : TIG.

[52]  G J Kleywegt,et al.  Recognition of spatial motifs in protein structures. , 1999, Journal of molecular biology.

[53]  I. S. Ridder,et al.  Crystal Structures of Intermediates in the Dehalogenation of Haloalkanoates by l-2-Haloacid Dehalogenase* , 1999, The Journal of Biological Chemistry.

[54]  D. Christianson,et al.  Detoxification of environmental mutagens and carcinogens: structure, mechanism, and evolution of liver epoxide hydrolase. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[55]  Karen N. Allen,et al.  The crystal structure of bacillus cereus phosphonoacetaldehyde hydrolase: insight into catalysis of phosphorus bond cleavage and catalytic diversification within the HAD enzyme superfamily. , 2000, Biochemistry.

[56]  M. Gerstein,et al.  Assessing annotation transfer for genomics: quantifying the relations between protein sequence, structure and function through traditional and probabilistic scores. , 2000, Journal of molecular biology.

[57]  J. Szustakowski,et al.  Protein structure alignment using a genetic algorithm , 2000, Proteins.

[58]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[59]  A. Goldman,et al.  Buried charged surface in proteins. , 2000, Structure.

[60]  B D Hammock,et al.  Binding of Alkylurea Inhibitors to Epoxide Hydrolase Implicates Active Site Tyrosines in Substrate Activation* , 2000, The Journal of Biological Chemistry.

[61]  P C Babbitt,et al.  New functions from old scaffolds: how nature reengineers enzymes for new functions. , 2000, Advances in protein chemistry.

[62]  I. Rayment,et al.  Evolution of enzymatic activity in the enolase superfamily: structure of o-succinylbenzoate synthase from Escherichia coli in complex with Mg2+ and o-succinylbenzoate. , 2000, Biochemistry.

[63]  I. Rayment,et al.  Evolution of enzymatic activities in the enolase superfamily: crystallographic and mutagenesis studies of the reaction catalyzed by D-glucarate dehydratase from Escherichia coli. , 2000, Biochemistry.

[64]  M. Nakasako,et al.  Crystal structure of the calcium pump of sarcoplasmic reticulum at 2.6 Å resolution , 2000, Nature.

[65]  A. Valencia,et al.  Intrinsic errors in genome annotation. , 2001, Trends in genetics : TIG.

[66]  S. Kim,et al.  BeF(3)(-) acts as a phosphate analog in proteins phosphorylated on aspartate: structure of a BeF(3)(-) complex with phosphoserine phosphatase. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[67]  I. Rayment,et al.  Evolution of enzymatic activities in the enolase superfamily: crystal structures of the L-Ala-D/L-Glu epimerases from Escherichia coli and Bacillus subtilis. , 2001, Biochemistry.

[68]  Annabel E. Todd,et al.  Evolution of function in protein superfamilies, from a structural perspective. , 2001, Journal of molecular biology.

[69]  J. Skolnick,et al.  Enhanced functional annotation of protein sequences via the use of structural descriptors. , 2001, Journal of structural biology.

[70]  B. Luisi,et al.  Crystal structure of the Escherichia coli RNA degradosome component enolase. , 2001, Journal of molecular biology.

[71]  C. Chothia,et al.  The evolution and structural anatomy of the small molecule metabolic pathways in Escherichia coli. , 2001, Journal of molecular biology.

[72]  I. Rayment,et al.  Evolution of enzymatic activities in the enolase superfamily: identification of the general acid catalyst in the active site of D-glucarate dehydratase from Escherichia coli. , 2001, Biochemistry.

[73]  P. Babbitt,et al.  Divergent evolution of enzymatic function: mechanistically diverse superfamilies and functionally distinct suprafamilies. , 2001, Annual review of biochemistry.

[74]  James H Naismith,et al.  The Structure of 3-Methylaspartase from Clostridium tetanomorphum Functions via the Common Enolase Chemical Step* , 2002, Journal of Biological Chemistry.

[75]  C. Orengo,et al.  Plasticity of enzyme active sites. , 2002, Trends in biochemical sciences.

[76]  M. Vidal,et al.  Structural genomics: A pipeline for providing structures for the biologist , 2002, Protein science : a publication of the Protein Society.

[77]  E. Eisenstein,et al.  From structure to function: YrbI from Haemophilus influenzae (HI1679) is a phosphatase , 2002, Proteins.

[78]  C. Orengo,et al.  One fold with many functions: the evolutionary relationships between TIM barrel families based on their sequences, structures and functions. , 2002, Journal of molecular biology.

[79]  Henry H Nguyen,et al.  Structural characterization of the reaction pathway in phosphoserine phosphatase: crystallographic "snapshots" of intermediate states. , 2002, Journal of molecular biology.

[80]  B. Stoddard,et al.  Structure of a tRNA repair enzyme and molecular biology workhorse: T4 polynucleotide kinase. , 2002, Structure.

[81]  G. Kleywegt,et al.  Interactive motif and fold recognition in protein structures , 2002 .

[82]  Guofeng Zhang,et al.  Caught in the Act : The Structure of Phosphorylated â-Phosphoglucomutase from Lactococcus lactis , 2002 .

[83]  D. Rice,et al.  Insights into enzyme evolution revealed by the structure of methylaspartate ammonia lyase. , 2002, Structure.

[84]  P. Babbitt Definitions of enzyme function for the structural genomics era. , 2003, Current opinion in chemical biology.

[85]  J. Gerlt,et al.  Evolution of function in (beta/alpha)8-barrel enzymes. , 2003, Current opinion in chemical biology.

[86]  Gail J. Bartlett,et al.  Catalysing new reactions during evolution: economy of residues and mechanism. , 2003, Journal of molecular biology.

[87]  John Alan Gerlt,et al.  Evolution of function in (β/α)8-barrel enzymes , 2003 .