Structure identification for gene regulatory networks via linearization and robust state estimation

Abstract Inferring causal relationships among cellular components is one of the fundamental problems in understanding biological behaviours. The well known extended Kalman filter (EKF) has been proved to be a useful tool in simultaneously estimating both structure and actual gene expression levels of a gene regulatory network (GRN). First-order approximations, however, unavoidably result in modelling errors, but the EKF based method does not take either unmodelled dynamics or parametric uncertainties into account, which makes its estimation performances not very satisfactory. To overcome these problems, a sensitivity penalization based robust state estimator is adopted in this paper for revealing the structure of a GRN. Based on the specific structure of the estimation problem, it has been proved that under some weak conditions, both the EKF based method and the method suggested in this paper provide a consistent estimate, but the suggested method has a faster convergence speed. Compared with both the EKF and the unscented Kalman filter (UKF) based methods, simulation results and real data based estimations consistently show that both convergence speed and parametric estimation accuracy can be appreciably improved. These lead to significant reductions in both false positive errors and false negative errors, and may imply helpfulness of the suggested method in better understanding the structure and dynamics of actual GRNs.

[1]  Zidong Wang,et al.  An Extended Kalman Filtering Approach to Modeling Nonlinear Dynamic Gene Regulatory Networks via Short Gene Expression Time Series , 2009, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[2]  Konrad Reif,et al.  Stochastic stability of the discrete-time extended Kalman filter , 1999, IEEE Trans. Autom. Control..

[3]  Edward R. Dougherty,et al.  Inference of Noisy Nonlinear Differential Equation Models for Gene Regulatory Networks Using Genetic Programming and Kalman Filtering , 2008, IEEE Transactions on Signal Processing.

[4]  Hidde de Jong,et al.  Modeling and Simulation of Genetic Regulatory Systems: A Literature Review , 2002, J. Comput. Biol..

[5]  I S Kohane,et al.  Mutual information relevance networks: functional genomic clustering using pairwise entropy measurements. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[6]  Lennart Ljung,et al.  Theory and Practice of Recursive Identification , 1983 .

[7]  J. Collins,et al.  Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles , 2007, PLoS biology.

[8]  G. Grimmett,et al.  Probability and random processes , 2002 .

[9]  Adam A. Margolin,et al.  Reverse engineering cellular networks , 2006, Nature Protocols.

[10]  P. Woolf,et al.  A fuzzy logic approach to analyzing gene expression data. , 2000, Physiological genomics.

[11]  Neil J. Gordon,et al.  A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking , 2002, IEEE Trans. Signal Process..

[12]  Isabel M. Tienda-Luna,et al.  Reverse engineering gene regulatory networks , 2009, IEEE Signal Processing Magazine.

[13]  Diego di Bernardo,et al.  Inference of gene regulatory networks and compound mode of action from time course gene expression profiles , 2006, Bioinform..

[14]  P. Geurts,et al.  Inferring Regulatory Networks from Expression Data Using Tree-Based Methods , 2010, PloS one.

[15]  Fuzhen Zhang Matrix Theory: Basic Results and Techniques , 1999 .

[16]  T. Westerlund,et al.  Remarks on "Asymptotic behavior of the extended Kalman filter as a parameter estimator for linear systems" , 1980 .

[17]  Tong Zhou,et al.  Gene Regulatory Network Inference from Multifactorial Perturbation Data Using both Regression and Correlation Analyses , 2012, Proceedings of the 31st Chinese Control Conference.

[18]  Ankush Mittal,et al.  Model gene network by semi-fixed Bayesian network , 2006, Expert Syst. Appl..

[19]  Satoru Miyano,et al.  Inferring gene networks from time series microarray data using dynamic Bayesian networks , 2003, Briefings Bioinform..

[20]  Edward R. Dougherty,et al.  Inference of Gene Regulatory Networks using S-System: A Unified Approach , 2007, 2007 IEEE Symposium on Computational Intelligence and Bioinformatics and Computational Biology.

[21]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[22]  Diogo M. Camacho,et al.  Wisdom of crowds for robust gene network inference , 2012, Nature Methods.

[23]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[24]  P. Lancaster,et al.  The theory of matrices : with applications , 1985 .

[25]  Kevin Murphy,et al.  Modelling Gene Expression Data using Dynamic Bayesian Networks , 2006 .

[26]  Tianhai Tian,et al.  Stochastic neural network models for gene regulatory networks , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[27]  Aurélien Mazurie,et al.  Gene networks inference using dynamic Bayesian networks , 2003, ECCB.

[28]  Tong Zhou,et al.  Sensitivity Penalization Based Robust State Estimation for Uncertain Linear Systems , 2010, IEEE Transactions on Automatic Control.

[29]  Stuart A. Kauffman,et al.  The origins of order , 1993 .

[30]  N. D. Clarke,et al.  Towards a Rigorous Assessment of Systems Biology Models: The DREAM3 Challenges , 2010, PloS one.

[31]  Hazem N. Nounou,et al.  Inferring Gene Regulatory Networks via Nonlinear State-Space Models and Exploiting Sparsity , 2012, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[32]  Jean-Loup Faulon,et al.  Boolean dynamics of genetic regulatory networks inferred from microarray time series data , 2007, Bioinform..

[33]  L. Guarente,et al.  Regulation of the yeast CYT1 gene encoding cytochrome c1 by HAP1 and HAP2/3/4. , 1991, Molecular and cellular biology.

[34]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[35]  M. Xiong,et al.  Extended Kalman Filter for Estimation of Parameters in Nonlinear State-Space Models of Biochemical Networks , 2008, PloS one.

[36]  Kevin Kontos,et al.  Information-Theoretic Inference of Large Transcriptional Regulatory Networks , 2007, EURASIP J. Bioinform. Syst. Biol..

[37]  P. Kumar,et al.  Theory and practice of recursive identification , 1985, IEEE Transactions on Automatic Control.

[38]  Satoru Miyano,et al.  Identification of Genetic Networks from a Small Number of Gene Expression Patterns Under the Boolean Network Model , 1998, Pacific Symposium on Biocomputing.