Automatic assignment of reaction operators to enzymatic reactions

BACKGROUND Enzymes are classified in a numerical classification scheme introduced by the Nomenclature Committee of the IUBMB based on the overall reaction chemistry. Due to the manifold of enzymatic reactions the system has become highly complex. Assignment of enzymes to the enzyme classes requires a detailed knowledge of the system and manual analysis. Frequently rearrangements and deletions of enzymes and sub-subclasses are necessary. RESULTS We use the Dugundji-Ugi model for coding of biochemical reactions which is based on electron shift patterns occurring during reactions. Changes of the bonds or of non-bonded valence electrons are expressed by reaction matrices. Our program calculates reaction matrices automatically on the sole basis of substrate and product chemical structures based on a new strategy for maximal common substructure determination, which allows an accurate atom mapping of the substrate and product atoms. The system has been tested for a large set of enzymatic reactions including all sub-subclasses of the EC classification system. Altogether 147 different representative reaction operators were found in the classified enzymes, 121 of which are unique with respect to an EC sub-subclass. The other 26 comprise groups of enzymes with very similar reactions, being identical with respect to the bonds formed and broken. CONCLUSION The analysis and comparison of enzymatic reactions according to their electron shift patterns is defining enzyme groups characterised by unique reaction cores. Our results demonstrate the applicability of the Dugundji-Ugi model as a reasonable pre-classification system allowing an objective and rational view on biochemical reactions. AVAILABILITY The program to generate reaction matrix descriptors is available upon request.

[1]  Keith F. Tipton,et al.  History of the enzyme nomenclature system , 2000, Bioinform..

[2]  G. Levi A note on the derivation of maximal common subgraphs of two directed or undirected graphs , 1973 .

[3]  M. Kanehisa,et al.  Development of a chemical structure comparison method for integrated analysis of chemical and genomic information in the metabolic pathways. , 2003, Journal of the American Chemical Society.

[4]  Andreas Dietz,et al.  Models, concepts, theories, and formal languages in chemistry and their use as a basis for computer assistance in chemistry , 1994, Journal of chemical information and computer sciences.

[5]  Ina Koch,et al.  Enumerating all connected maximal common subgraphs in two graphs , 2001, Theor. Comput. Sci..

[6]  Peter Willett,et al.  Maximum common subgraph isomorphism algorithms for the matching of chemical structures , 2002, J. Comput. Aided Mol. Des..

[7]  Chunhui Li,et al.  Exploring the diversity of complex metabolic networks , 2005, Bioinform..

[8]  Joannis Apostolakis,et al.  Automatic Determination of Reaction Mappings and Reaction Center Information. 1. The Imaginary Transition State Energy Approach , 2008, J. Chem. Inf. Model..

[9]  C. Bron,et al.  Algorithm 457: finding all cliques of an undirected graph , 1973 .

[10]  Johannes M. Bauer IGOR2: a PC-program for generating new reactions and molecular structures , 1989 .

[11]  E. Webb,et al.  Enzyme Nomenclature. Recommendations 1978. Supplement 4: corrections and additions. , 1983, European journal of biochemistry.

[12]  Eric Fontain,et al.  The generation of reaction networks with RAIN. 1. The reaction generator , 1991, J. Chem. Inf. Comput. Sci..

[13]  Antje Chang,et al.  BRENDA , the enzyme database : updates and major new developments , 2003 .

[14]  James Dugundji,et al.  An algebraic model of constitutional chemistry as a basis for chemical computer programs , 1973 .

[15]  J. Gasteiger,et al.  Enabling the exploration of biochemical pathways. , 2004, Organic & biomolecular chemistry.

[16]  J. J. McGregor,et al.  Backtrack search algorithms and the maximal common subgraph problem , 1982, Softw. Pract. Exp..

[17]  Johannes M. Bauer IGOR2: A PC‐Program for Generating New Reactions and Molecular Structures , 1991 .

[18]  Joannis Apostolakis,et al.  Graph-Based Molecular Alignment (GMA) , 2007, J. Chem. Inf. Model..

[19]  Susumu Goto,et al.  Systematic Analysis of Enzyme-Catalyzed Reaction Patterns and Prediction of Microbial Biodegradation Pathways , 2007, J. Chem. Inf. Model..

[20]  Josef Brandt,et al.  An efficient algorithm for the computation of the canonical numbering of reaction matrices , 1983, Comput. Chem..

[21]  Johann Gasteiger,et al.  Automatic Determination of Reaction Mappings and Reaction Center Information. 2. Validation on a Biochemical Reaction Database , 2008, J. Chem. Inf. Model..

[22]  M. Kanehisa,et al.  Computational assignment of the EC numbers for genomic-scale analysis of enzymatic reactions. , 2004, Journal of the American Chemical Society.