An Extended Kalman Filtering Approach to Modeling Nonlinear Dynamic Gene Regulatory Networks via Short Gene Expression Time Series

In this paper, the extended Kalman filter (EKF) algorithm is applied to model the gene regulatory network from gene time series data. The gene regulatory network is considered as a nonlinear dynamic stochastic model that consists of the gene measurement equation and the gene regulation equation. After specifying the model structure, we apply the EKF algorithm for identifying both the model parameters and the actual value of gene expression levels. It is shown that the EKF algorithm is an online estimation algorithm that can identify a large number of parameters (including parameters of nonlinear functions) through iterative procedure by using a small number of observations. Four real-world gene expression data sets are employed to demonstrate the effectiveness of the EKF algorithm, and the obtained models are evaluated from the viewpoint of bioinformatics.

[1]  Sui Huang Gene expression profiling, genetic networks, and cellular states: an integrating concept for tumorigenesis and drug discovery , 1999, Journal of Molecular Medicine.

[2]  A. Krener The Convergence of the Extended Kalman Filter , 2002, math/0212255.

[3]  Zoubin Ghahramani,et al.  Modeling T-cell activation using gene expression profiling and state-space models , 2004, Bioinform..

[4]  J. Derisi,et al.  The Transcriptome of the Intraerythrocytic Developmental Cycle of Plasmodium falciparum , 2003, PLoS biology.

[5]  Roland Somogyi,et al.  Modeling the complexity of genetic networks: Understanding multigenic and pleiotropic regulation , 1996, Complex..

[6]  Xiaohui Liu,et al.  Predicting glaucomatous visual field deterioration through short multivariate time series modelling , 2002, Artif. Intell. Medicine.

[7]  Donna K Slonim,et al.  The homeodomain protein PAL-1 specifies a lineage-specific regulatory network in the C. elegans embryo , 2005, Development.

[8]  Patrik D'haeseleer,et al.  Linear Modeling of mRNA Expression Levels During CNS Development and Injury , 1998, Pacific Symposium on Biocomputing.

[9]  Satoru Miyano,et al.  Identification of Genetic Networks from a Small Number of Gene Expression Patterns Under the Boolean Network Model , 1998, Pacific Symposium on Biocomputing.

[10]  Ankush Mittal,et al.  Model gene network by semi-fixed Bayesian network , 2006, Expert Syst. Appl..

[11]  Neal S. Holter,et al.  Dynamic modeling of gene expression data. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[12]  T. Kepler,et al.  Stochasticity in transcriptional regulation: origins, consequences, and mathematical representations. , 2001, Biophysical journal.

[13]  Ting Chen,et al.  Modeling Gene Expression with Differential Equations , 1998, Pacific Symposium on Biocomputing.

[14]  A. Corigliano,et al.  Parameter identification in explicit structural dynamics: performance of the extended Kalman filter , 2004 .

[15]  Steven X. Ding,et al.  State and Disturbance Estimator for Time-Delay Systems With Application to Fault Estimation and Signal Compensation , 2007, IEEE Transactions on Signal Processing.

[16]  Paola Sebastiani,et al.  Cluster analysis of gene expression dynamics , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Satoru Miyano,et al.  Inferring Gene Regulatory Networks from Time-Ordered Gene Expression Data of Bacillus Subtilis Using Differential Equations , 2002, Pacific Symposium on Biocomputing.

[18]  Fang-Xiang Wu,et al.  Modeling Gene Expression from Microarray Expression Data with State-Space Equations , 2003, Pacific Symposium on Biocomputing.

[19]  M. Thattai,et al.  Stochastic Gene Expression in Fluctuating Environments , 2004, Genetics.

[20]  Jinde Cao,et al.  On Delayed Genetic Regulatory Networks With Polytopic Uncertainties: Robust Stability Analysis , 2008, IEEE Transactions on NanoBioscience.

[21]  G. Church,et al.  Systematic determination of genetic network architecture , 1999, Nature Genetics.

[22]  Zoubin Ghahramani,et al.  A Bayesian approach to reconstructing genetic regulatory networks with hidden factors , 2005, Bioinform..

[23]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[24]  Nicolas Brunel,et al.  Estimating parameters and hidden variables in non-linear state-space models based on ODEs for biological networks inference , 2007, Bioinform..

[25]  Huijun Gao,et al.  A Parameter-Dependent Approach to Robust $H_{\infty }$ Filtering for Time-Delay Systems , 2008, IEEE Transactions on Automatic Control.

[26]  S. Tapscott,et al.  Modeling stochastic gene expression: implications for haploinsufficiency. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Daniel W. C. Ho,et al.  State/noise estimator for descriptor systems with application to sensor fault diagnosis , 2006, IEEE Transactions on Signal Processing.

[28]  Hidde de Jong,et al.  Modeling and Simulation of Genetic Regulatory Systems: A Literature Review , 2002, J. Comput. Biol..

[29]  Morris F. Maduro,et al.  Making worm guts: the gene regulatory network of the Caenorhabditis elegans endoderm. , 2002, Developmental biology.

[30]  A. Arkin,et al.  Stochastic mechanisms in gene expression. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[31]  K. Aihara,et al.  Chaos and asymptotical stability in discrete-time neural networks , 1997, chao-dyn/9701020.

[32]  Tianhai Tian,et al.  Stochastic neural network models for gene regulatory networks , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[33]  J. Mesirov,et al.  Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[34]  Huijun Gao,et al.  A new design of robust H2 filters for uncertain systems , 2008, Syst. Control. Lett..

[35]  Gamini Dissanayake,et al.  Convergence and Consistency Analysis for Extended Kalman Filter Based SLAM , 2007, IEEE Transactions on Robotics.

[36]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[37]  S Fuhrman,et al.  Reveal, a general reverse engineering algorithm for inference of genetic network architectures. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[38]  Fuwen Yang,et al.  Stochastic Dynamic Modeling of Short Gene Expression Time-Series Data , 2008, IEEE Transactions on NanoBioscience.

[39]  Adrian E. Raftery,et al.  Model-based clustering and data transformations for gene expression data , 2001, Bioinform..

[40]  T. Chai,et al.  A robust fault detection filtering for stochastic distribution systems via descriptor estimator and parametric gain design , 2007 .

[41]  Zoubin Ghahramani,et al.  Learning Dynamic Bayesian Networks , 1997, Summer School on Neural Networks.