Large-Scale Analysis Exploring Evolution of Catalytic Machineries and Mechanisms in Enzyme Superfamilies

Enzymes, as biological catalysts, form the basis of all forms of life. How these proteins have evolved their functions remains a fundamental question in biology. Over 100 years of detailed biochemistry studies, combined with the large volumes of sequence and protein structural data now available, means that we are able to perform large-scale analyses to address this question. Using a range of computational tools and resources, we have compiled information on all experimentally annotated changes in enzyme function within 379 structurally defined protein domain superfamilies, linking the changes observed in functions during evolution to changes in reaction chemistry. Many superfamilies show changes in function at some level, although one function often dominates one superfamily. We use quantitative measures of changes in reaction chemistry to reveal the various types of chemical changes occurring during evolution and to exemplify these by detailed examples. Additionally, we use structural information of the enzymes active site to examine how different superfamilies have changed their catalytic machinery during evolution. Some superfamilies have changed the reactions they perform without changing catalytic machinery. In others, large changes of enzyme function, in terms of both overall chemistry and substrate specificity, have been brought about by significant changes in catalytic machinery. Interestingly, in some superfamilies, relatives perform similar functions but with different catalytic machineries. This analysis highlights characteristics of functional evolution across a wide range of superfamilies, providing insights that will be useful in predicting the function of uncharacterised sequences and the design of new synthetic enzymes.

[1]  M. Harms,et al.  Evolutionary biochemistry: revealing the historical and physical causes of protein properties , 2013, Nature Reviews Genetics.

[2]  María Martín,et al.  Activities at the Universal Protein Resource (UniProt) , 2013, Nucleic Acids Res..

[3]  Todd Ae,et al.  Evolution of function in protein superfamilies. , 2001 .

[4]  Richard N. Armstrong,et al.  Large-Scale Determination of Sequence, Structure, and Function Relationships in Cytosolic Glutathione Transferases across the Biosphere , 2014, PLoS biology.

[5]  Gregory A. Petsko,et al.  Mandelate racemase and muconate lactonizing enzyme are mechanistically distinct and structurally homologous , 1990, Nature.

[6]  Michael Y. Galperin,et al.  Divergence and Convergence in Enzyme Evolution , 2011, The Journal of Biological Chemistry.

[7]  Rainer Schrader,et al.  Small Molecule Subgraph Detector (SMSD) toolkit , 2009, J. Cheminformatics.

[8]  Korbinian Strimmer,et al.  APE: Analyses of Phylogenetics and Evolution in R language , 2004, Bioinform..

[9]  Patricia C. Babbitt,et al.  Understanding Enzyme Superfamilies , 1997, The Journal of Biological Chemistry.

[10]  Benoit H. Dessailly,et al.  Functional site plasticity in domain superfamilies☆ , 2013, Biochimica et biophysica acta.

[11]  Janet M. Thornton,et al.  The Catalytic Site Atlas 2.0: cataloging catalytic sites and residues identified in enzymes , 2013, Nucleic Acids Res..

[12]  Tao Liu,et al.  TreeFam: 2008 Update , 2007, Nucleic Acids Res..

[13]  Miss A.O. Penney (b) , 1974, The New Yale Book of Quotations.

[14]  Fanny Sunden,et al.  Differential catalytic promiscuity of the alkaline phosphatase superfamily bimetallo core reveals mechanistic features underlying enzyme evolution , 2017, The Journal of Biological Chemistry.

[15]  C. Orengo,et al.  Plasticity of enzyme active sites. , 2002, Trends in biochemical sciences.

[16]  Gemma L. Holliday,et al.  MACiE: exploring the diversity of biochemical reactions , 2011, Nucleic Acids Res..

[17]  Y. Lindqvist,et al.  Refined structure of spinach glycolate oxidase at 2 A resolution. , 1989, Journal of molecular biology.

[18]  Stanley L. Miller,et al.  On the Origin of Metabolic Pathways , 1999, Journal of Molecular Evolution.

[19]  Alfonso Valencia,et al.  firestar—prediction of functionally important residues using structural templates and alignment reliability , 2007, Nucleic Acids Res..

[20]  A. Mclachlan,et al.  Repeating sequences and gene duplication in proteins. , 1972, Journal of molecular biology.

[21]  Ian Sillitoe,et al.  FunTree: a resource for exploring the functional evolution of structurally defined enzyme superfamilies , 2011, Nucleic Acids Res..

[22]  M. Swindells,et al.  Protein clefts in molecular recognition and function. , 1996, Protein science : a publication of the Protein Society.

[23]  David A. Lee,et al.  CATH: comprehensive structural and functional annotations for genome sequences , 2014, Nucleic Acids Res..

[24]  Janet M Thornton,et al.  Pathway evolution, structurally speaking. , 2002, Current opinion in structural biology.

[25]  Janet M. Thornton,et al.  The Catalytic Site Atlas: a resource of catalytic sites and residues identified in enzymes using structural data , 2004, Nucleic Acids Res..

[26]  J. Thornton,et al.  Missing in action: enzyme functional annotations in biological databases. , 2009, Nature chemical biology.

[27]  Hua Huang,et al.  Panoramic view of a superfamily of phosphatases through substrate profiling , 2015, Proceedings of the National Academy of Sciences.

[28]  N. Tokuriki,et al.  Connectivity between catalytic landscapes of the metallo-β-lactamase superfamily. , 2014, Journal of molecular biology.

[29]  V Massey,et al.  L-lactate oxidase and L-lactate monooxygenase: mechanistic variations on a common structural theme. , 1995, Biochimie.

[30]  W R Taylor,et al.  Protein structure alignment. , 1989, Journal of molecular biology.

[31]  Ian Sillitoe,et al.  Exploring the Evolution of Novel Enzyme Functions within Structurally Defined Protein Superfamilies , 2012, PLoS Comput. Biol..

[32]  C. Chothia,et al.  The evolution and structural anatomy of the small molecule metabolic pathways in Escherichia coli. , 2001, Journal of molecular biology.

[33]  P. Babbitt,et al.  Divergent Evolution in Enolase Superfamily: Strategies for Assigning Functions* , 2011, The Journal of Biological Chemistry.

[34]  Dan S. Tawfik,et al.  Enzyme promiscuity: a mechanistic and evolutionary perspective. , 2010, Annual review of biochemistry.

[35]  Benjamin A. Shoemaker,et al.  Inferred Biomolecular Interaction Server—a web server to analyze and predict protein interacting partners and binding sites , 2009, Nucleic Acids Res..

[36]  F. S. Mathews,et al.  Three-dimensional structure of flavocytochrome b2 from baker's yeast at 3.0-A resolution. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[37]  Annabel E. Todd,et al.  Evolution of function in protein superfamilies, from a structural perspective. , 2001, Journal of molecular biology.

[38]  Pasch,et al.  References and Notes Supporting Online Material Evolution of Hormone-receptor Complexity by Molecular Exploitation , 2022 .

[39]  Gemma L. Holliday,et al.  EC-BLAST: A Tool to Automatically Search and Compare Enzyme Reactions , 2014, Nature Methods.

[40]  David A. Lee,et al.  Functional classification of CATH superfamilies: a domain-based approach for protein function annotation , 2015, Bioinform..

[41]  E. Webb Enzyme nomenclature 1992. Recommendations of the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology on the Nomenclature and Classification of Enzymes. , 1992 .

[42]  Dan S. Tawfik,et al.  What makes a protein fold amenable to functional innovation? Fold polarity and stability trade-offs. , 2013, Journal of molecular biology.

[43]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[44]  N. Tokuriki,et al.  Dynamics and constraints of enzyme evolution. , 2014, Journal of experimental zoology. Part B, Molecular and developmental evolution.

[45]  John B. Shoven,et al.  I , Edinburgh Medical and Surgical Journal.

[46]  J. Krug,et al.  Empirical fitness landscapes and the predictability of evolution , 2014, Nature Reviews Genetics.

[47]  Y. Lindqvist,et al.  Role of tyrosine 129 in the active site of spinach glycolate oxidase. , 1993, European journal of biochemistry.

[48]  Christine A. Orengo,et al.  Protein function prediction using domain families , 2013, BMC Bioinformatics.