Two-stage Bayesian networks for metabolic network prediction

Metabolism is a set of chemical reactions, used by living organisms to process chemical compounds in order to take energy and eliminate toxic compounds, for example. Its processes are referred as metabolic pathways. Understanding metabolism is imperative to biology, toxicology and medicine, but the number and complexity of metabolic pathways makes this a difficult task. In our paper, we investigate the use of causal Bayesian networks to model the pathways of yeast saccharomyces cerevisiae metabolism: such a network can be used to draw predictions about the levels of metabolites and enzymes in a particular specimen. We, propose a two-stage methodology for causal networks, as follows. First construct a causal network from the network of metabolic pathways. The viability of this causal network depends on the validity of the causal Markov condition. If this condition fails, however, the principle of the common cause motivates the addition of a new causal arrow or a new`hidden' common cause to the network (stage 2 of the model formation process). Algorithms for adding arrows or hidden nodes have been developed separately in a number of papers, and in this paper we combine them, showing how the resulting procedure can be applied to the metabolic pathway problem. Our general approach was tested on neural cell morphology data and demonstrated noticeable improvements in both prediction and network accuracy .

[1]  J. Pearl,et al.  A statistical semantics for causation , 1992 .

[2]  Jon Williamson Approximating discrete probability distributions with Bayesian networks , 2003 .

[3]  Kevin P. Murphy,et al.  Learning the Structure of Dynamic Probabilistic Networks , 1998, UAI.

[4]  M Kanehisa,et al.  Organizing and computing metabolic pathway data in terms of binary relations. , 1997, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[5]  Duncan Fyfe Gillies,et al.  Interpretation of Hidden Node Methodology with Network Accuracy , 2003 .

[6]  Gregory F. Cooper,et al.  The Computational Complexity of Probabilistic Inference Using Bayesian Belief Networks , 1990, Artif. Intell..

[7]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[8]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[9]  Sholl Da Dendritic organization in the neurons of the visual and motor cortices of the cat. , 1953 .

[10]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[11]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[12]  Duncan Fyfe Gillies,et al.  Using Bayesian Networks with Hidden Nodes to Recognise Neural Cell Morphology , 2002, PRICAI.

[13]  Satoru Miyano,et al.  Estimation of Genetic Networks and Functional Structures Between Genes by Using Bayesian Networks and Nonparametric Regression , 2001, Pacific Symposium on Biocomputing.

[14]  Jon Williamson,et al.  Foundations for Bayesian Networks , 2001 .

[15]  A. G. Flook The use of dilation logic on the quantimet to achieve fractal dimension characterisation of textured and structured profiles , 1978 .

[16]  Jon Williamson,et al.  Learning Causal Relationships , 2002 .