Using Machine Learning Techniques for Modelling and Simulation of Metabolic Networks

Metabolomics is increasingly becoming an important field. The fundamental task in this area is to measure and interpret complex time and condition dependent parameters such as the activity or flux of metabolites in cells, their concentration, tissues elements and other biosamples. The careful study of all these elements has led to important insights in the functioning of metabolism. Recently, however, there is a growing interest towards an intagrated approach to studying biological systems. This is the main goal in Systems Biology where a combined investigation of several components of a biological system is thought to produce a thorough understanding of such systems. Metabolic networks are not only structurally complex but behave also in a stochastic fashion. Therefore, it is necessary to express structure and handle uncertainty to construct complete dynamics of these networks. In this paper we describe how stochastic modeling and simulation can be performed in a symbolic-statistical machine learning (ML) framework. We show that symbolic ML deals with structural and relational complexity while statistical ML provides principled approaches to uncertainty modeling. Learning is used to analyze traces of biochemical reactions and model the dynamicity through parameter learning, while inference is used to produce stochastic simulation of the network.

[1]  Joseph Y. Halpern An Analysis of First-Order Logics of Probability , 1989, IJCAI.

[2]  Nicolas Le Novère,et al.  STOCHSIM: modelling of stochastic biomolecular processes , 2001, Bioinform..

[3]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[4]  Nils J. Nilsson,et al.  Probabilistic Logic * , 2022 .

[5]  W. Weckwerth Metabolomics in systems biology. , 2003, Annual review of plant biology.

[6]  Thomas G. Dietterich,et al.  Learning first-order probabilistic models with combining rules , 2005, Annals of Mathematics and Artificial Intelligence.

[7]  Jennifer Neville,et al.  Dependency networks for relational data , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[8]  Steffen Klamt,et al.  FluxAnalyzer: exploring structure, pathways, and flux distributions in metabolic networks on interactive flux maps , 2003, Bioinform..

[9]  R. Goodacre,et al.  Metabolic Profiling: Its Role in Biomarker Discovery and Gene Function Analysis , 2003, Springer US.

[10]  Oliver Fiehn,et al.  Combining Genomics, Metabolome Analysis, and Biochemical Modelling to Understand Metabolic Networks , 2001, Comparative and functional genomics.

[11]  Lawrence B. Holder,et al.  Application of Graph-based Data Mining to Metabolic Pathways , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).

[12]  Hiroaki Kitano,et al.  Foundations of systems biology , 2001 .

[13]  Robert P. Goldman,et al.  From knowledge bases to decision models , 1992, The Knowledge Engineering Review.

[14]  F Baganz,et al.  Systematic functional analysis of the yeast genome. , 1998, Trends in biotechnology.

[15]  Manfred Jaeger Parameter learning for relational Bayesian networks , 2007, ICML '07.

[16]  Yoshitaka Kameya,et al.  Parameter Learning of Logic Programs for Symbolic-Statistical Modeling , 2001, J. Artif. Intell. Res..

[17]  Stephen Muggleton,et al.  Abduction and induction for learning models of inhibition in metabolic networks , 2005, Fourth International Conference on Machine Learning and Applications (ICMLA'05).

[18]  Fahiem Bacchus,et al.  Representing and reasoning with probabilistic knowledge , 1988 .

[19]  Christian Blaschke,et al.  Text Mining for Metabolic Pathways, Signaling Cascades, and Protein Networks , 2005, Science's STKE.

[20]  Wojciech Szpankowski,et al.  An efficient algorithm for detecting frequent subgraphs in biological networks , 2004, ISMB/ECCB.

[21]  Ben Taskar,et al.  Learning Probabilistic Models of Link Structure , 2003, J. Mach. Learn. Res..

[22]  Ben Taskar,et al.  Introduction to statistical relational learning , 2007 .

[23]  S. Muggleton Stochastic Logic Programs , 1996 .

[24]  Xingming Zhao,et al.  Computational Systems Biology , 2013, TheScientificWorldJournal.

[25]  Stephen Muggleton,et al.  Learning Structure and Parameters of Stochastic Logic Programs , 2002, ILP.

[26]  Taisuke Sato,et al.  A Statistical Learning Method for Logic Programs with Distribution Semantics , 1995, ICLP.

[27]  Stephen Muggleton,et al.  Machine learning metabolic pathway descriptions using a probabilistic relational representation , 2002 .