A global analysis of function and conservation of catalytic residues in enzymes

The catalytic residues of an enzyme comprise the amino acids located in the active center responsible for accelerating the enzyme-catalyzed reaction. These residues lower the activation energy of reactions by performing several catalytic functions. Decades of enzymology research has established general themes regarding the roles of specific residues in these catalytic reactions, but it has been more difficult to explore these roles in a more systematic way. Here, we review the data on the catalytic residues of 648 enzymes, as annotated in the Mechanism and Catalytic Site Atlas (M-CSA), and compare our results with those in previous studies. We structured this analysis around three key properties of the catalytic residues: amino acid type, catalytic function, and sequence conservation in homologous proteins. As expected, we observed that catalysis is mostly accomplished by a small set of residues performing a limited number of catalytic functions. Catalytic residues are typically highly conserved, but to a smaller degree in homologues that perform different reactions or are nonenzymes (pseudoenzymes). Cross-analysis yielded further insights revealing which residues perform particular functions and how often. We obtained more detailed specificity rules for certain functions by identifying the chemical group upon which the residue acts. Finally, we show the mutation tolerance of the catalytic residues based on their roles. The characterization of the catalytic residues, their functions, and conservation, as presented here, is key to understanding the impact of mutations in evolution, disease, and enzyme design. The tools developed for this analysis are available at the M-CSA website and allow for user specific analysis of the same data.

[1]  M J Sternberg,et al.  Analysis and prediction of the location of catalytic residues in enzymes. , 1988, Protein engineering.

[2]  Gail J. Bartlett,et al.  Analysis of catalytic residues in enzyme active sites. , 2002, Journal of molecular biology.

[3]  Gail J. Bartlett,et al.  Catalysing new reactions during evolution: economy of residues and mechanism. , 2003, Journal of molecular biology.

[4]  Janet M. Thornton,et al.  The Catalytic Site Atlas: a resource of catalytic sites and residues identified in enzymes using structural data , 2004, Nucleic Acids Res..

[5]  Peter Murray-Rust,et al.  MACiE: a database of enzyme reaction mechanisms , 2005, Bioinform..

[6]  Mona Singh,et al.  Predicting functionally important residues from sequence conservation , 2007, Bioinform..

[7]  Patricia C. Babbitt,et al.  Annotation Error in Public Databases: Misannotation of Molecular Function in Enzyme Superfamilies , 2009, PLoS Comput. Biol..

[8]  Yang Shi,et al.  The conserved NAD(H)-dependent corepressor CTBP-1 regulates Caenorhabditis elegans life span , 2009, Proceedings of the National Academy of Sciences.

[9]  Gemma L. Holliday,et al.  Understanding the functional roles of amino acid residues in enzyme catalysis. , 2009, Journal of molecular biology.

[10]  Dan S. Tawfik,et al.  Enzyme promiscuity: a mechanistic and evolutionary perspective. , 2010, Annual review of biochemistry.

[11]  Gemma L. Holliday,et al.  The structures and physicochemical properties of organic cofactors in biocatalysis. , 2010, Journal of molecular biology.

[12]  Gemma L. Holliday,et al.  Characterizing the complexity of enzymes on the basis of their mechanisms and structures with a bio-computational analysis , 2011, The FEBS journal.

[13]  Ian Sillitoe,et al.  Exploring the Evolution of Novel Enzyme Functions within Structurally Defined Protein Superfamilies , 2012, PLoS Comput. Biol..

[14]  Gemma L. Holliday,et al.  MACiE: exploring the diversity of biochemical reactions , 2011, Nucleic Acids Res..

[15]  Daniel W. A. Buchan,et al.  A large-scale evaluation of computational protein function prediction , 2013, Nature Methods.

[16]  Andrew G McDonald,et al.  Fifty‐five years of enzyme classification: advances and difficulties , 2014, The FEBS journal.

[17]  Janet M. Thornton,et al.  The Catalytic Site Atlas 2.0: cataloging catalytic sites and residues identified in enzymes , 2013, Nucleic Acids Res..

[18]  Gemma L. Holliday,et al.  Exploring the Biological and Chemical Complexity of the Ligases , 2014, Journal of molecular biology.

[19]  Gemma L. Holliday,et al.  EC-BLAST: A Tool to Automatically Search and Compare Enzyme Reactions , 2014, Nature Methods.

[20]  Michael A. Hicks,et al.  The Structure–Function Linkage Database , 2006, Nucleic Acids Res..

[21]  John B. O. Mitchell,et al.  The Natural History of Biocatalytic Mechanisms , 2014, PLoS Comput. Biol..

[22]  S. Copley An evolutionary biochemist's perspective on promiscuity. , 2015, Trends in biochemical sciences.

[23]  Tsuyoshi Kato,et al.  EzCatDB: the enzyme reaction database, 2015 update , 2014, Nucleic Acids Res..

[24]  Frances H Arnold,et al.  Expanding the enzyme universe: accessing non-natural reactions by mechanism-guided directed evolution. , 2015, Angewandte Chemie.

[25]  Michael J E Sternberg,et al.  The Phyre2 web portal for protein modeling, prediction and analysis , 2015, Nature Protocols.

[26]  D. Baker,et al.  The coming of age of de novo protein design , 2016, Nature.

[27]  N. Jura,et al.  Structural Basis for the Non-catalytic Functions of Protein Kinases. , 2016, Structure.

[28]  P. Eyers,et al.  Live and let die: insights into pseudoenzyme mechanisms from structure. , 2017, Current opinion in structural biology.

[29]  Georg K. A. Hochberg,et al.  Reconstructing Ancient Proteins to Understand the Causes of Structure and Function. , 2017, Annual review of biophysics.

[30]  Maximilian Ccjc Ebert,et al.  Computational tools for enzyme improvement: why everyone can - and should - use them. , 2017, Current opinion in chemical biology.

[31]  Janet M. Thornton,et al.  Mechanism and Catalytic Site Atlas (M-CSA): a database of enzyme reaction mechanisms and active sites , 2017, Nucleic Acids Res..

[32]  Gholamreza Haffari,et al.  PREvaIL, an integrative approach for inferring catalytic residues using sequence, structural, and network features in a machine-learning framework. , 2018, Journal of theoretical biology.

[33]  Cathleen Zeymer,et al.  Directed Evolution of Protein Catalysts. , 2018, Annual review of biochemistry.

[34]  The UniProt Consortium,et al.  UniProt: a worldwide hub of protein knowledge , 2018, Nucleic Acids Res..

[35]  B. Rost,et al.  funtrp: identifying protein positions for variation driven functional tuning , 2019, bioRxiv.

[36]  Silvio C. E. Tosatto,et al.  The Pfam protein families database in 2019 , 2018, Nucleic Acids Res..

[37]  António J. M. Ribeiro,et al.  Emerging concepts in pseudoenzyme classification, evolution, and signaling , 2019, Science Signaling.

[38]  Ian Sillitoe,et al.  CATH: expanding the horizons of structure-based functional annotations for genome sequences , 2018, Nucleic Acids Res..