Discretized Kinetic Models for Abductive Reasoning in Systems Biology

The study of systems biology through inductive logic programming (ILP) aims at improving the understanding of the physiological state of the cell by reasoning with rules and relations instead of ordinary differential equations. This paper presents a method for enabling the ILP framework to deal with quantitative information from some experimental data in systems biology. The method consist in both discretizing the evolution of concentrations of metabolites during experiments and transcribing enzymatic kinetics (for instance Michaelis-Menten kinetics) into logic rules. Kinetic rules are added to background knowledge, along with the topology of the metabolic pathway, whereas discretized concentrations are observations. Applying ILP allows for abduction and induction in such a system. A logical model of the glycolysis and pentose phosphate pathways of E. Coli is proposed to support our method description. Logical formulae on concentrations of some metabolites, which could not be measured during the dynamic state, are produced through logical abduction. Finally, as this results in a large number of hypotheses, they are ranked with an expectation maximization algorithm working on binary decision diagrams.

[1]  Blaz Zupan,et al.  GenePath: from mutations to genetic networks and back , 2005, Nucleic Acids Res..

[2]  Stephen Muggleton,et al.  Inverse entailment and progol , 1995, New Generation Computing.

[3]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[4]  Chitta Baral,et al.  A knowledge based approach for representing and reasoning about signaling networks , 2004, ISMB/ECCB.

[5]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[6]  Yoshihiro Yamanishi,et al.  KEGG for linking genomes to life and the environment , 2007, Nucleic Acids Res..

[7]  Oliver Ray,et al.  A Nonmonotonic Logical Approach for Modelling and Revising Metabolic Networks , 2009, 2009 International Conference on Complex, Intelligent and Software Intensive Systems.

[8]  Ashish Tiwari,et al.  Analyzing Pathways Using SAT-Based Approaches , 2007, AB.

[9]  H. Sahm,et al.  Pyruvate carboxylase is a major bottleneck for glutamate and lysine production by Corynebacterium glutamicum. , 2001, Journal of molecular microbiology and biotechnology.

[10]  Katsumi Inoue,et al.  SOLAR: A Consequence Finding System for Advanced Reasoning , 2003, TABLEAUX.

[11]  Andrei Doncescu,et al.  A Bayesian Hybrid Approach to Unsupervised Time Series Discretization , 2010, 2010 International Conference on Technologies and Applications of Artificial Intelligence.

[12]  Raymond J. Mooney,et al.  Integrating Abduction and Induction in Machine Learning , 2000 .

[13]  Katsumi Inoue,et al.  Biological Systems Analysis Using Inductive Logic Programming , 2007, 21st International Conference on Advanced Information Networking and Applications Workshops (AINAW'07).

[14]  Eamonn J. Keogh,et al.  HOT SAX: efficiently finding the most unusual time series subsequence , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[15]  Peter C. Cheeseman,et al.  Bayesian Classification (AutoClass): Theory and Results , 1996, Advances in Knowledge Discovery and Data Mining.

[16]  Luc De Raedt Logical and Relational Learning , 2008, SBIA.

[17]  Lawrence Carin,et al.  Variational Bayes for continuous hidden Markov models and its application to active learning , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Katsumi Inoue,et al.  Linear Resolution for Consequence Finding , 1992, Artif. Intell..

[19]  Laurence T. Yang,et al.  Differential Evolutionary Algorithms for In Vivo Dynamic Analysis of Glycolysis and Pentose Phosphate Pathway in Escherichia coli , 2005 .

[20]  Katsumi Inoue,et al.  Induction as Consequence Finding , 2004, Machine Learning.

[21]  Frédéric Benhamou,et al.  Interval Constraint Logic Programming , 1994, Constraint Programming.

[22]  Matthew J. Beal Variational algorithms for approximate Bayesian inference , 2003 .

[23]  Christopher H. Bryant,et al.  Functional genomic hypothesis generation and experimentation by a robot scientist , 2004, Nature.

[24]  H. Kitano Systems Biology: A Brief Overview , 2002, Science.

[25]  Chin-Hui Lee,et al.  Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains , 1994, IEEE Trans. Speech Audio Process..

[26]  Ross D. King,et al.  On the use of qualitative reasoning to simulate and identify metabolic pathway , 2005, Bioinform..

[27]  Pierre Geurts,et al.  Pattern Extraction for Time Series Classification , 2001, PKDD.

[28]  Stephen Muggleton,et al.  Application of abductive ILP to learning metabolic network inhibition from temporal data , 2006, Machine Learning.

[29]  François Fages,et al.  Model Revision from Temporal Logic Properties in Computational Systems Biology , 2008, Probabilistic Inductive Logic Programming.

[30]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[31]  Torsten Schaub,et al.  Modeling Biological Networks by Action Languages via Answer Set Programming , 2008, Constraints.