Non-homogeneous dynamic Bayesian networks with Bayesian regularization for inferring gene regulatory networks with gradually time-varying structure

The proper functioning of any living cell relies on complex networks of gene regulation. These regulatory interactions are not static but respond to changes in the environment and evolve during the life cycle of an organism. A challenging objective in computational systems biology is to infer these time-varying gene regulatory networks from typically short time series of transcriptional profiles. While homogeneous models, like conventional dynamic Bayesian networks, lack the flexibility to succeed in this task, fully flexible models suffer from inflated inference uncertainty due to the limited amount of available data. In the present paper we explore a semi-flexible model based on a piecewise homogeneous dynamic Bayesian network regularized by gene-specific inter-segment information sharing. We explore different choices of prior distribution and information coupling and evaluate their performance on synthetic data. We apply our method to gene expression time series obtained during the life cycle of Drosophila melanogaster, and compare the predicted segmentation with other state-of-the-art techniques. We conclude our evaluation with an application to synthetic biology, where the objective is to predict an in vivo regulatory network of five genes in Saccharomyces cerevisiae subjected to a changing environment.

[1]  N. Hengartner,et al.  Structural learning with time‐varying components: tracking the cross‐section of financial time series , 2005 .

[2]  Jérôme Euzenat,et al.  Grasping at molecular interactions and genetic networks in Drosophila melanogaster using FlyNets, an Internet database , 1999, Nucleic Acids Res..

[3]  J. Littleton,et al.  Characterization of a hypercontraction-induced myopathy in Drosophila caused by mutations in Mhc , 2004, The Journal of cell biology.

[4]  S. Parkhurst,et al.  wimp, a dominant maternal-effect mutation, reduces transcription of a specific subset of segmentation genes in Drosophila. , 1991, Genes & development.

[5]  Thomas S. Huang,et al.  Time Varying Dynamic Bayesian Network for Nonstationary Events Modeling and Online Inference , 2011, IEEE Transactions on Signal Processing.

[6]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[7]  W. K. Hastings,et al.  Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .

[8]  Edward R. Dougherty,et al.  Inferring gene regulatory networks from time series data using the minimum description length principle , 2006, Bioinform..

[9]  E. Andrianantoandro,et al.  Synthetic biology: new engineering rules for an emerging discipline , 2006, Molecular systems biology.

[10]  David Sims,et al.  FLIGHT: database and tools for the integration and cross-correlation of large-scale RNAi phenotypic datasets , 2005, Nucleic Acids Res..

[11]  Anthony Hall,et al.  Disruption of Hepatic Leptin Signaling Protects Mice From Age- and Diet-Related Glucose Intolerance , 2010, Diabetes.

[12]  Dirk Husmeier,et al.  Heterogeneous Continuous Dynamic Bayesian Networks with Flexible Structure and Inter-Time Segment Information Sharing , 2010, ICML.

[13]  C. Emerson,et al.  Functional interactions between unlinked muscle genes within haploinsufficient regions of the Drosophila genome. , 1988, Genetics.

[14]  Amr Ahmed,et al.  Recovering time-varying networks of dependencies in social and biological studies , 2009, Proceedings of the National Academy of Sciences.

[15]  Andrew Zisserman,et al.  Advances in Neural Information Processing Systems (NIPS) , 2007 .

[16]  Marco Grzegorczyk,et al.  Non-homogeneous dynamic Bayesian networks for continuous data , 2011, Machine Learning.

[17]  Michael P. H. Stumpf,et al.  Statistical inference of the time-varying structure of gene-regulation networks , 2010, BMC Systems Biology.

[18]  Marco Grzegorczyk,et al.  Non-stationary continuous dynamic Bayesian networks , 2009, NIPS.

[19]  B. Larget,et al.  Markov Chain Monte Carlo Algorithms for the Bayesian Analysis of Phylogenetic Trees , 2000 .

[20]  D. Rubin,et al.  Inference from Iterative Simulation Using Multiple Sequences , 1992 .

[21]  Takeshi Mizuno,et al.  Data assimilation constrains new connections and components in a complex, eukaryotic circadian clock model , 2010, Molecular Systems Biology.

[22]  Christophe Andrieu,et al.  Joint Bayesian model selection and estimation of noisy sinusoids via reversible jump MCMC , 1999, IEEE Trans. Signal Process..

[23]  Le Song,et al.  Sparsistent Learning of Varying-coefficient Models with Structural Changes , 2009, NIPS.

[24]  Alexander J. Hartemink,et al.  Learning Non-Stationary Dynamic Bayesian Networks , 2010, J. Mach. Learn. Res..

[25]  J. Vigoreaux,et al.  Suppression of muscle hypercontraction by mutations in the myosin heavy chain gene of Drosophila melanogaster. , 2003, Genetics.

[26]  R. Chanet,et al.  Protein interaction mapping: a Drosophila case study. , 2005, Genome research.

[27]  Wenjie Fu,et al.  Recovering temporally rewiring networks: a model-based approach , 2007, ICML '07.

[28]  Frank Dondelinger Machine learning approach to reconstructing signalling pathways and interaction networks in biology , 2013 .

[29]  D. Husmeier,et al.  Detecting recombination in 4-taxa DNA sequence alignments with Bayesian hidden Markov models and Markov chain Monte Carlo. , 2003, Molecular biology and evolution.

[30]  Alexander J. Hartemink,et al.  Non-stationary dynamic Bayesian networks , 2008, NIPS.

[31]  Dirk Husmeier,et al.  Gene Regulatory Network Reconstruction by Bayesian Integration of Prior Knowledge and/or Different Experimental Conditions , 2008, J. Bioinform. Comput. Biol..

[32]  P. Green Reversible jump Markov chain Monte Carlo computation and Bayesian model determination , 1995 .

[33]  Kevin P. Murphy,et al.  Modeling changing dependency structure in multivariate time series , 2007, ICML '07.

[34]  N. D. Clarke,et al.  Towards a Rigorous Assessment of Systems Biology Models: The DREAM3 Challenges , 2010, PloS one.

[35]  D. Bernardo,et al.  A Yeast Synthetic Network for In Vivo Assessment of Reverse-Engineering and Modeling Approaches , 2009, Cell.

[36]  Sophie Lèbre Stochastic process analysis for Genomics and Dynamic Bayesian Networks inference. , 2007 .

[37]  Andrew McCallum,et al.  Piecewise pseudolikelihood for efficient training of conditional random fields , 2007, ICML '07.

[38]  Manfred Jaeger,et al.  Proceedings of the 24th Annual International Conference on Machine Learning (ICML 2007) , 2007, ICML 2007.

[39]  E. Salmon Gene Expression During the Life Cycle of Drosophila melanogaster , 2002 .

[40]  Dirk Husmeier,et al.  Inter-time segment information sharing for non-homogeneous dynamic Bayesian networks , 2010, NIPS.

[41]  Christophe Andrieu,et al.  Bayesian curve fitting using MCMC with applications to signal segmentation , 2002, IEEE Trans. Signal Process..