Modelling non-stationary dynamic gene regulatory processes with the BGM model

Recently, a Bayesian network model for inferring non-stationary regulatory processes from gene expression time series has been proposed. The Bayesian Gaussian Mixture (BGM) Bayesian network model divides the data into disjunct compartments (data subsets) by a free allocation model, and infers network structures, which are kept fixed for all compartments. Fixing the network structure allows for some information sharing among compartments, and each compartment is modelled separately and independently with the Gaussian BGe scoring metric for Bayesian networks. The BGM model can equally be applied to both static (steady-state) and dynamic (time series) gene expression data. However, it is this flexibility that renders its application to time series data suboptimal. To improve the performance of the BGM model on time series data we propose a revised approach in which the free allocation of data points is replaced by a changepoint process so as to take the temporal structure into account. The practical inference follows the Bayesian paradigm and approximately samples the network, the number of compartments and the changepoint locations from the posterior distribution with Markov chain Monte Carlo (MCMC). Our empirical results show that the proposed modification leads to a more efficient inference tool for analysing gene expression time series.

[1]  Marco Grzegorczyk,et al.  Modelling non-stationary gene regulatory processes with a non-homogeneous Bayesian network and the allocation sampler , 2008, Bioinform..

[2]  Anton J. Enright,et al.  A logic-based diagram of signalling pathways central to macrophage activation , 2008, BMC Systems Biology.

[3]  Simon Rogers,et al.  A Bayesian regression approach to the inference of regulatory networks from gene expression data , 2005, Bioinform..

[4]  Agostino Nobile,et al.  Bayesian finite mixtures with an unknown number of components: The allocation sampler , 2007, Stat. Comput..

[5]  Dirk Husmeier,et al.  Sensitivity and specificity of inferring genetic regulatory interactions from microarray experiments with dynamic Bayesian networks , 2003, Bioinform..

[6]  P. Green Reversible jump Markov chain Monte Carlo computation and Bayesian model determination , 1995 .

[7]  K. Honda,et al.  Type I Inteferon Gene Induction by the Interferon Regulatory Factor Family of Transcription Factors , 2006 .

[8]  D. Rubin,et al.  Inference from Iterative Simulation Using Multiple Sequences , 1992 .

[9]  ChengXiang Zhai,et al.  Inference of Gene Pathways Using Gaussian Mixture Models , 2007, 2007 IEEE International Conference on Bioinformatics and Biomedicine (BIBM 2007).

[10]  K. Sachs,et al.  Causal Protein-Signaling Networks Derived from Multiparameter Single-Cell Data , 2005, Science.

[11]  Nir Friedman,et al.  Being Bayesian about Network Structure , 2000, UAI.

[12]  Zoubin Ghahramani,et al.  A Bayesian approach to reconstructing genetic regulatory networks with hidden factors , 2005, Bioinform..

[13]  David Heckerman,et al.  A Tutorial on Learning with Bayesian Networks , 1999, Innovations in Bayesian Networks.

[14]  Nir Friedman,et al.  Being Bayesian About Network Structure. A Bayesian Approach to Structure Discovery in Bayesian Networks , 2004, Machine Learning.

[15]  David Heckerman,et al.  Learning Gaussian Networks , 1994, UAI.

[16]  J. York,et al.  Bayesian Graphical Models for Discrete Data , 1995 .

[17]  Marco Grzegorczyk,et al.  Improving the structure MCMC sampler for Bayesian networks by introducing a new edge reversal move , 2008, Machine Learning.

[18]  J. Darnell,et al.  Jak-STAT pathways and transcriptional activation in response to IFNs and other extracellular signaling proteins. , 1994, Science.