Tracking of time-varying genomic regulatory networks with a LASSO-Kalman smoother

It is widely accepted that cellular requirements and environmental conditions dictate the architecture of genetic regulatory networks. Nonetheless, the status quo in regulatory network modeling and analysis assumes an invariant network topology over time. In this paper, we refocus on a dynamic perspective of genetic networks, one that can uncover substantial topological changes in network structure during biological processes such as developmental growth. We propose a novel outlook on the inference of time-varying genetic networks, from a limited number of noisy observations, by formulating the network estimation as a target tracking problem. We overcome the limited number of observations (small n large p problem) by performing tracking in a compressed domain. Assuming linear dynamics, we derive the LASSO-Kalman smoother, which recursively computes the minimum mean-square sparse estimate of the network connectivity at each time point. The LASSO operator, motivated by the sparsity of the genetic regulatory networks, allows simultaneous signal recovery and compression, thereby reducing the amount of required observations. The smoothing improves the estimation by incorporating all observations. We track the time-varying networks during the life cycle of the Drosophila melanogaster. The recovered networks show that few genes are permanent, whereas most are transient, acting only during specific developmental phases of the organism.

[1]  Emmanuel J. Candès,et al.  Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information , 2004, IEEE Transactions on Information Theory.

[2]  Ting Sun,et al.  Single-pixel imaging via compressive sampling , 2008, IEEE Signal Process. Mag..

[3]  Jean-Luc Starck,et al.  Compressed Sensing in Astronomy , 2008, IEEE Journal of Selected Topics in Signal Processing.

[4]  Takashi Matsumoto,et al.  Nonparametric Bayes-based Heterogeneous “Drosophila Melanogaster” Gene Regulatory Network Inference: T-Process Regression , 2013 .

[5]  Hendrik Van Brussel,et al.  A Smoothly Constrained Kalman Filter , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Otmar Scherzer,et al.  Handbook of Mathematical Methods in Imaging , 2015, Handbook of Mathematical Methods in Imaging.

[7]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[8]  L. Hood,et al.  A Genomic Regulatory Network for Development , 2002, Science.

[9]  Michael I. Jordan,et al.  Bayesian Nonparametric Inference of Switching Dynamic Linear Models , 2010, IEEE Transactions on Signal Processing.

[10]  Thomas S. Huang,et al.  Time Varying Dynamic Bayesian Network for Nonstationary Events Modeling and Online Inference , 2011, IEEE Transactions on Signal Processing.

[11]  Zidong Wang,et al.  An Extended Kalman Filtering Approach to Modeling Nonlinear Dynamic Gene Regulatory Networks via Short Gene Expression Time Series , 2009, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[12]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[13]  Massimo Fornasier,et al.  Compressive Sensing , 2015, Handbook of Mathematical Methods in Imaging.

[14]  Thomas D. Nielsen,et al.  Adapting Bayes Network Structures to Non-stationary Domains , 2006, Probabilistic Graphical Models.

[15]  N. Huang,et al.  The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis , 1998, Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[16]  Amr Ahmed,et al.  Recovering time-varying networks of dependencies in social and biological studies , 2009, Proceedings of the National Academy of Sciences.

[17]  Joseph J. LaViola,et al.  On Kalman Filtering With Nonlinear Equality Constraints , 2007, IEEE Transactions on Signal Processing.

[18]  Alexander J. Hartemink,et al.  Learning Non-Stationary Dynamic Bayesian Networks , 2010, J. Mach. Learn. Res..

[19]  John B. Moore,et al.  Optimal State Estimation , 2006 .

[20]  I. Simon,et al.  Reconstructing dynamic regulatory maps , 2007, Molecular systems biology.

[21]  Nir Friedman,et al.  Inferring Cellular Networks Using Probabilistic Graphical Models , 2004, Science.

[22]  M. Gerstein,et al.  Genomic analysis of regulatory network dynamics reveals large topological changes , 2004, Nature.

[23]  Diogo M. Camacho,et al.  Wisdom of crowds for robust gene network inference , 2012, Nature Methods.

[24]  R. Tibshirani,et al.  Regression shrinkage and selection via the lasso: a retrospective , 2011 .

[25]  James M. Rehg,et al.  Learning and Inferring Motion Patterns using Parametric Segmental Switching Linear Dynamic Systems , 2008, International Journal of Computer Vision.

[26]  Chaoyang Zhang,et al.  A novel gene network inference algorithm using predictive minimum description length approach , 2010, BMC Systems Biology.

[27]  Alexander J. Hartemink,et al.  Non-stationary dynamic Bayesian networks , 2008, NIPS.

[28]  E. Dougherty,et al.  Inferring Connectivity of Genetic Regulatory Networks Using Information-Theoretic Criteria , 2008, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[29]  Richard G. Baraniuk,et al.  Compressive Sensing , 2008, Computer Vision, A Reference Guide.

[30]  Allan Tucker,et al.  A Bayesian network approach to explaining time series with changing structure , 2004, Intell. Data Anal..

[31]  Hugh F. Durrant-Whyte,et al.  Model-based multi-sensor data fusion , 1992, Proceedings 1992 IEEE International Conference on Robotics and Automation.

[32]  Holger Rauhut,et al.  Compressive Estimation of Doubly Selective Channels in Multicarrier Systems: Leakage Effects and Sparsity-Enhancing Processing , 2009, IEEE Journal of Selected Topics in Signal Processing.

[33]  Wenjie Fu,et al.  Recovering temporally rewiring networks: a model-based approach , 2007, ICML '07.

[34]  Sebastian Kadener,et al.  Mathematical model of the Drosophila circadian clock: loop regulation and transcriptional integration. , 2009, Biophysical journal.

[35]  Edward R. Dougherty,et al.  Probabilistic Boolean networks: a rule-based uncertainty model for gene regulatory networks , 2002, Bioinform..

[36]  LiuXiaohui,et al.  An Extended Kalman Filtering Approach to Modeling Nonlinear Dynamic Gene Regulatory Networks via Short Gene Expression Time Series , 2009 .

[37]  Jiguo Cao,et al.  Modeling gene regulation networks using ordinary differential equations. , 2012, Methods in molecular biology.

[38]  B. S. Baker,et al.  Gene Expression During the Life Cycle of Drosophila melanogaster , 2002, Science.

[39]  Marco Grzegorczyk,et al.  Non-stationary continuous dynamic Bayesian networks , 2009, NIPS.

[40]  Martin Kuiper,et al.  BiNGO: a Cytoscape plugin to assess overrepresentation of Gene Ontology categories in Biological Networks , 2005, Bioinform..

[41]  Lingfeng Shi,et al.  Adaptive learning of dynamic Bayesian networks with changing structures by detecting geometric structures of time series , 2008, Knowledge and Information Systems.

[42]  Eric P. Xing,et al.  Time-Varying Networks: Recovering Temporally Rewiring Genetic Networks During the Life Cycle of Drosophila melanogaster , 2008, 0901.0138.

[43]  Matthias E. Futschik,et al.  OLIN: optimized normalization, visualization and quality testing of two-channel microarray data , 2005, Bioinform..

[44]  Ralf Herwig,et al.  Reverse Engineering of Gene Regulatory Networks: A Comparative Study , 2009, EURASIP J. Bioinform. Syst. Biol..

[45]  Dan Schonfeld,et al.  Inference of genetic regulatory networks using regularized likelihood with covariance estimation , 2012, 2012 IEEE Statistical Signal Processing Workshop (SSP).

[46]  S. Dongen Performance criteria for graph clustering and Markov cluster experiments , 2000 .

[47]  Alfred O. Hero,et al.  Inferring Time-Varying Network Topologies from Gene Expression Data , 2007, EURASIP J. Bioinform. Syst. Biol..

[48]  S. Kauffman Metabolic stability and epigenesis in randomly constructed genetic nets. , 1969, Journal of theoretical biology.

[49]  Le Song,et al.  Time-Varying Dynamic Bayesian Networks , 2009, NIPS.

[50]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[51]  O. Scherzer Handbook of mathematical methods in imaging , 2011 .

[52]  Michele Ceccarelli,et al.  articleTimeDelay-ARACNE : Reverse engineering of gene networks from time-course data by an information theoretic approach , 2010 .

[53]  Hazem N. Nounou,et al.  Inferring Gene Regulatory Networks via Nonlinear State-Space Models and Exploiting Sparsity , 2012, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[54]  Paul Fearnhead,et al.  Exact and efficient Bayesian inference for multiple changepoint problems , 2006, Stat. Comput..

[55]  Dirk Husmeier,et al.  Non-homogeneous dynamic Bayesian networks with Bayesian regularization for inferring gene regulatory networks with gradually time-varying structure , 2012, Machine Learning.

[56]  Kevin Kontos,et al.  Information-Theoretic Inference of Large Transcriptional Regulatory Networks , 2007, EURASIP J. Bioinform. Syst. Biol..

[57]  Pini Gurfil,et al.  Methods for Sparse Signal Recovery Using Kalman Filtering With Embedded Pseudo-Measurement Norms and Quasi-Norms , 2010, IEEE Transactions on Signal Processing.

[58]  Michael A. West,et al.  Bayesian time-varying autoregressions: Theory, methods and applications , 2000 .