Kinetic Models and Qualitative Abstraction for Relational Learning in Systems Biology

This paper presents a method for enabling the relational learning or inductive logic programming (ILP) frame- work to deal with quantitative information from experimental data in systems biology. The study of systems biology through ILP aims at improving the understanding of the physiological state of the cell and the interpre- tation of the interactions between metabolites and signaling networks. A logical model of the glycolysis and pentose phosphate pathways of E. Coli is proposed to support our method description. We explain our original approach to building a symbolic model applied to kinetics based on Michaelis-Menten equation, starting with the discretization of the changes in concentration of some of the metabolites over time into relevant levels. We can then use them in our ILP-based model. Logical formulae on concentrations of some metabolites, which could not be measured during the dynamic state, are produced through logical abduction. Finally, as this re- sults in a large number of hypotheses, they are ranked with an expectation maximization algorithm working on binary decision diagrams.

[1]  Katsumi Inoue,et al.  SOLAR: A Consequence Finding System for Advanced Reasoning , 2003, TABLEAUX.

[2]  H. Sahm,et al.  Pyruvate carboxylase is a major bottleneck for glutamate and lysine production by Corynebacterium glutamicum. , 2001, Journal of molecular microbiology and biotechnology.

[3]  Blaz Zupan,et al.  GenePath: from mutations to genetic networks and back , 2005, Nucleic Acids Res..

[4]  Ross D. King,et al.  On the use of qualitative reasoning to simulate and identify metabolic pathway , 2005, Bioinform..

[5]  Pierre Geurts,et al.  Pattern Extraction for Time Series Classification , 2001, PKDD.

[6]  Stephen Muggleton,et al.  Inverse entailment and progol , 1995, New Generation Computing.

[7]  Chin-Hui Lee,et al.  Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains , 1994, IEEE Trans. Speech Audio Process..

[8]  Sylvain Soliman,et al.  Model Revision from Temporal Logic Properties in Systems Biology , 2008 .

[9]  Lawrence Carin,et al.  Variational Bayes for continuous hidden Markov models and its application to active learning , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Katsumi Inoue,et al.  Linear Resolution for Consequence Finding , 1992, Artif. Intell..

[11]  Katsumi Inoue,et al.  In defense of PDDL axioms , 2003, Artif. Intell..

[12]  Stephen Muggleton,et al.  Application of abductive ILP to learning metabolic network inhibition from temporal data , 2006, Machine Learning.

[13]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[14]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[15]  Yoshihiro Yamanishi,et al.  KEGG for linking genomes to life and the environment , 2007, Nucleic Acids Res..

[16]  E I Canela,et al.  Computer simulation of purine metabolism. , 1984, European journal of biochemistry.

[17]  Luc De Raedt,et al.  Logical and relational learning , 2008, Cognitive Technologies.

[18]  Oliver Ray,et al.  A Nonmonotonic Logical Approach for Modelling and Revising Metabolic Networks , 2009, 2009 International Conference on Complex, Intelligent and Software Intensive Systems.

[19]  Peter A. Flach,et al.  Abduction and induction: essays on their relation and integration , 2000 .

[20]  Christopher H. Bryant,et al.  Functional genomic hypothesis generation and experimentation by a robot scientist , 2004, Nature.

[21]  Katsumi Inoue,et al.  Biological Systems Analysis Using Inductive Logic Programming , 2007, 21st International Conference on Advanced Information Networking and Applications Workshops (AINAW'07).

[22]  Eamonn J. Keogh,et al.  HOT SAX: efficiently finding the most unusual time series subsequence , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[23]  Peter C. Cheeseman,et al.  Bayesian Classification (AutoClass): Theory and Results , 1996, Advances in Knowledge Discovery and Data Mining.

[24]  Chitta Baral,et al.  A knowledge based approach for representing and reasoning about signaling networks , 2004, ISMB/ECCB.

[25]  Ashish Tiwari,et al.  Analyzing Pathways Using SAT-Based Approaches , 2007, AB.

[26]  Torsten Schaub,et al.  Modeling Biological Networks by Action Languages via Answer Set Programming , 2008, Constraints.

[27]  Katsumi Inoue,et al.  Induction as Consequence Finding , 2004, Machine Learning.

[28]  Katsumi Inoue,et al.  Knowledge Based Discovery in Systems Biology Using CF-Induction , 2007, IEA/AIE.

[29]  Laurence T. Yang,et al.  Differential Evolutionary Algorithms for In Vivo Dynamic Analysis of Glycolysis and Pentose Phosphate Pathway in Escherichia coli , 2005 .

[30]  Frédéric Benhamou,et al.  Interval Constraint Logic Programming , 1994, Constraint Programming.

[31]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[32]  Matthew J. Beal Variational algorithms for approximate Bayesian inference , 2003 .

[33]  Raymond J. Mooney,et al.  Integrating Abduction and Induction in Machine Learning , 2000 .