Graph transformation for enzymatic mechanisms

Abstract Motivation The design of enzymes is as challenging as it is consequential for making chemical synthesis in medical and industrial applications more efficient, cost-effective and environmentally friendly. While several aspects of this complex problem are computationally assisted, the drafting of catalytic mechanisms, i.e. the specification of the chemical steps—and hence intermediate states—that the enzyme is meant to implement, is largely left to human expertise. The ability to capture specific chemistries of multistep catalysis in a fashion that enables its computational construction and design is therefore highly desirable and would equally impact the elucidation of existing enzymatic reactions whose mechanisms are unknown. Results We use the mathematical framework of graph transformation to express the distinction between rules and reactions in chemistry. We derive about 1000 rules for amino acid side chain chemistry from the M-CSA database, a curated repository of enzymatic mechanisms. Using graph transformation, we are able to propose hundreds of hypothetical catalytic mechanisms for a large number of unrelated reactions in the Rhea database. We analyze these mechanisms to find that they combine in chemically sound fashion individual steps from a variety of known multistep mechanisms, showing that plausible novel mechanisms for catalysis can be constructed computationally. Availability and implementation The source code of the initial prototype of our approach is available at https://github.com/Nojgaard/mechsearch Supplementary information Supplementary data are available at Bioinformatics online.

[1]  C. Chapple,et al.  Related Arabidopsis Serine Carboxypeptidase-Like Sinapoylglucose Acyltransferases Display Distinct But Overlapping Substrate Specificities1[OA] , 2007, Plant Physiology.

[2]  Manfred Nagl,et al.  Graph-Grammars and Their Application to Computer Science , 1982, Lecture Notes in Computer Science.

[3]  P. Anastas,et al.  Designing for a green chemistry future , 2020, Science.

[4]  Marwin H. S. Segler,et al.  Modelling Chemical Reasoning to Predict Reactions , 2016, Chemistry.

[5]  D. Pleissner,et al.  Green Chemistry and Its Contribution to Industrial Biotechnology. , 2018, Advances in biochemical engineering/biotechnology.

[6]  Daniel Merkle,et al.  Inferring chemical reaction patterns using rule composition in graph grammars , 2012, ArXiv.

[7]  Teresa Head-Gordon,et al.  Computational Design of Synthetic Enzymes. , 2018, Chemical reviews.

[8]  Hartmut Ehrig,et al.  Graph-Grammars and Their Application to Computer Science and Biology , 1978, Lecture Notes in Computer Science.

[9]  R. Wolfenden,et al.  The Rate of Spontaneous Decarboxylation of Amino Acids , 2000 .

[10]  Daniel Merkle,et al.  Generic Strategies for Chemical Space Exploration , 2013, Int. J. Comput. Biol. Drug Des..

[11]  Daniel Merkle,et al.  Chemical Graph Transformation with Stereo-Information , 2017, ICGT.

[12]  Daniel Merkle,et al.  A Software Package for Chemically Inspired Graph Transformation , 2016, ICGT.

[13]  Hartmut Ehrig,et al.  Introduction to the Algebraic Theory of Graph Grammars (A Survey) , 1978, Graph-Grammars and Their Application to Computer Science and Biology.

[14]  Joerg H. Schrittwieser,et al.  Artificial Biocatalytic Linear Cascades for Preparation of Organic Molecules. , 2018, Chemical reviews.

[15]  Anne Morgat,et al.  Updates in Rhea: SPARQLing biochemical reaction data , 2018, Nucleic Acids Res..

[16]  Hartmut Ehrig,et al.  Fundamentals of Algebraic Graph Transformation , 2006, Monographs in Theoretical Computer Science. An EATCS Series.

[17]  Anthony P. F. Cook,et al.  Computer‐aided synthesis design: 40 years on , 2012 .

[18]  Matthew H Todd,et al.  Computer-aided organic synthesis. , 2005, Chemical Society reviews.

[19]  Gaël Varoquaux,et al.  Proceedings of the 20th Python in Science Conference 2021 (SciPy 2021), Virtual Conference, July 12 - July 18, 2021 , 2008, SciPy.

[20]  Aric Hagberg,et al.  Exploring Network Structure, Dynamics, and Function using NetworkX , 2008, Proceedings of the Python in Science Conference.

[21]  D. Strack,et al.  Structure determinants and substrate recognition of serine carboxypeptidase‐like acyltransferases from plant secondary metabolism , 2006, FEBS letters.

[22]  Hartmut Ehrig,et al.  Graph-Grammars: An Algebraic Approach , 1973, SWAT.

[23]  Annegret Habel,et al.  Double-pushout graph transformation revisited , 2001, Mathematical Structures in Computer Science.

[24]  Janet M. Thornton,et al.  Mechanism and Catalytic Site Atlas (M-CSA): a database of enzyme reaction mechanisms and active sites , 2017, Nucleic Acids Res..