Accurate Atom-Mapping Computation for Biochemical Reactions

The complete atom mapping of a chemical reaction is a bijection of the reactant atoms to the product atoms that specifies the terminus of each reactant atom. Atom mapping of biochemical reactions is useful for many applications of systems biology, in particular for metabolic engineering where synthesizing new biochemical pathways has to take into account for the number of carbon atoms from a source compound that are conserved in the synthesis of a target compound. Rapid, accurate computation of the atom mapping(s) of a biochemical reaction remains elusive despite significant work on this topic. In particular, past researchers did not validate the accuracy of mapping algorithms. We introduce a new method for computing atom mappings called the minimum weighted edit-distance (MWED) metric. The metric is based on bond propensity to react and computes biochemically valid atom mappings for a large percentage of biochemical reactions. MWED models can be formulated efficiently as Mixed-Integer Linear Programs (MILPs). We have demonstrated this approach on 7501 reactions of the MetaCyc database for which 87% of the models could be solved in less than 10 s. For 2.1% of the reactions, we found multiple optimal atom mappings. We show that the error rate is 0.9% (22 reactions) by comparing these atom mappings to 2446 atom mappings of the manually curated Kyoto Encyclopedia of Genes and Genomes (KEGG) RPAIR database. To our knowledge, our computational atom-mapping approach is the most accurate and among the fastest published to date. The atom-mapping data will be available in the MetaCyc database later in 2012; the atom-mapping software will be available within the Pathway Tools software later in 2012.

[1]  Peter Willett,et al.  Use of a maximum common subgraph algorithm in the automatic identification of ostensible bond changes occurring in chemical reactions , 1981, J. Chem. Inf. Comput. Sci..

[2]  Matthias Rarey,et al.  Maximum common subgraph isomorphism algorithms and their applications in molecular science: a review , 2011 .

[3]  Johann Gasteiger,et al.  Automatic Determination of Reaction Mappings and Reaction Center Information. 2. Validation on a Biochemical Reaction Database , 2008, J. Chem. Inf. Model..

[4]  K. Anderson,et al.  Insights into the Mechanism of 3-Deoxy-D-arabino-heptulosonate 7-Phosphate Synthase (Phe) from Escherichia coli Using a Transient Kinetic Analysis* , 2004, Journal of Biological Chemistry.

[5]  D J Weber,et al.  NMR and isotopic exchange studies of the site of bond cleavage in the MutT reaction. , 1992, The Journal of biological chemistry.

[6]  Masanori Arita In silico atomic tracing by substrate-product relationships in Escherichia coli intermediary metabolism. , 2003, Genome research.

[7]  Patrick F Suthers,et al.  Construction of an E. Coli genome‐scale atom mapping model for MFA calculations , 2011, Biotechnology and bioengineering.

[8]  L. Hedstrom,et al.  3-Deoxy-D-manno-octulosonate-8-phosphate synthase catalyzes the C-O bond cleavage of phosphoenolpyruvate. , 1988, Biochemical and biophysical research communications.

[9]  Masanori Arita The metabolic world of Escherichia coli is not small. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[10]  M. Kanehisa,et al.  Development of a chemical structure comparison method for integrated analysis of chemical and genomic information in the metabolic pathways. , 2003, Journal of the American Chemical Society.

[11]  Joannis Apostolakis,et al.  Automatic Determination of Reaction Mappings and Reaction Center Information. 1. The Imaginary Transition State Energy Approach , 2008, J. Chem. Inf. Model..

[12]  Tobias Achterberg,et al.  SCIP: solving constraint integer programs , 2009, Math. Program. Comput..

[13]  M. Cohn,et al.  Mechanisms of enzymic cleavage of some organic phosphates. , 1959, Journal of cellular and comparative physiology.

[14]  A. Deleo,et al.  Mechanism of 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthetase. , 1968, Biochemical and biophysical research communications.

[15]  Christodoulos A. Floudas,et al.  Stereochemically Consistent Reaction Mapping and Identification of Multiple Reaction Mechanisms through Integer Linear Optimization , 2012, J. Chem. Inf. Model..