An Extended Kalman Filtering Approach to Modeling Nonlinear Dynamic Gene Regulatory Networks via Short Gene Expression Time Series

In this paper, the extended Kalman filter (EKF) algorithm is applied to model the gene regulatory network from gene time series data. The gene regulatory network is considered as a nonlinear dynamic stochastic model that consists of the gene measurement equation and the gene regulation equation. After specifying the model structure, we apply the EKF algorithm for identifying both the model parameters and the actual value of gene expression levels. It is shown that the EKF algorithm is an online estimation algorithm that can identify a large number of parameters (including parameters of nonlinear functions) through iterative procedure by using a small number of observations. Four real-world gene expression data sets are employed to demonstrate the effectiveness of the EKF algorithm, and the obtained models are evaluated from the viewpoint of bioinformatics.

[1]  Tianhai Tian,et al.  Stochastic neural network models for gene regulatory networks , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[2]  Zoubin Ghahramani,et al.  A Bayesian approach to reconstructing genetic regulatory networks with hidden factors , 2005, Bioinform..

[3]  Jinde Cao,et al.  On Delayed Genetic Regulatory Networks With Polytopic Uncertainties: Robust Stability Analysis , 2008, IEEE Transactions on NanoBioscience.

[4]  A. Arkin,et al.  Stochastic mechanisms in gene expression. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[5]  K. Aihara,et al.  Chaos and asymptotical stability in discrete-time neural networks , 1997, chao-dyn/9701020.

[6]  A. Krener The Convergence of the Extended Kalman Filter , 2002, math/0212255.

[7]  Xiaohui Liu,et al.  A Framework for Modelling Short, High-Dimensional Multivariate Time Series: Preliminary Results in Virus Gene Expression Data Analysis , 2001, IDA.

[8]  J. Derisi,et al.  The Transcriptome of the Intraerythrocytic Developmental Cycle of Plasmodium falciparum , 2003, PLoS biology.

[9]  Roland Somogyi,et al.  Modeling the complexity of genetic networks: Understanding multigenic and pleiotropic regulation , 1996, Complex..

[10]  D. A. Baxter,et al.  Mathematical Modeling of Gene Networks , 2000, Neuron.

[11]  Paola Sebastiani,et al.  Cluster analysis of gene expression dynamics , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Satoru Miyano,et al.  Inferring Gene Regulatory Networks from Time-Ordered Gene Expression Data of Bacillus Subtilis Using Differential Equations , 2002, Pacific Symposium on Biocomputing.

[13]  Sui Huang Gene expression profiling, genetic networks, and cellular states: an integrating concept for tumorigenesis and drug discovery , 1999, Journal of Molecular Medicine.

[14]  Steven X. Ding,et al.  State and Disturbance Estimator for Time-Delay Systems With Application to Fault Estimation and Signal Compensation , 2007, IEEE Transactions on Signal Processing.

[15]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Daniel W. C. Ho,et al.  State/noise estimator for descriptor systems with application to sensor fault diagnosis , 2006, IEEE Transactions on Signal Processing.

[17]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[18]  Nicolas Brunel,et al.  Estimating parameters and hidden variables in non-linear state-space models based on ODEs for biological networks inference , 2007, Bioinform..

[19]  Huijun Gao,et al.  A new design of robust H2 filters for uncertain systems , 2008, Syst. Control. Lett..

[20]  Gamini Dissanayake,et al.  Convergence and Consistency Analysis for Extended Kalman Filter Based SLAM , 2007, IEEE Transactions on Robotics.

[21]  Zhiwei Gao,et al.  Novel Parameter Identification by Using a High-Gain Observer With Application to a Gas Turbine Engine , 2008, IEEE Transactions on Industrial Informatics.

[22]  Ting Chen,et al.  Modeling Gene Expression with Differential Equations , 1998, Pacific Symposium on Biocomputing.

[23]  F. Lewis,et al.  Optimal and Robust Estimation: With an Introduction to Stochastic Control Theory, Second Edition , 2007 .

[24]  Xiaohui Liu,et al.  Predicting glaucomatous visual field deterioration through short multivariate time series modelling , 2002, Artif. Intell. Medicine.

[25]  Hidde de Jong,et al.  Modeling and Simulation of Genetic Regulatory Systems: A Literature Review , 2002, J. Comput. Biol..

[26]  Morris F. Maduro,et al.  Making worm guts: the gene regulatory network of the Caenorhabditis elegans endoderm. , 2002, Developmental biology.

[27]  Donna K Slonim,et al.  The homeodomain protein PAL-1 specifies a lineage-specific regulatory network in the C. elegans embryo , 2005, Development.

[28]  Satoru Miyano,et al.  Identification of Genetic Networks from a Small Number of Gene Expression Patterns Under the Boolean Network Model , 1998, Pacific Symposium on Biocomputing.

[29]  Neal S. Holter,et al.  Dynamic modeling of gene expression data. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Zoubin Ghahramani,et al.  Modeling T-cell activation using gene expression profiling and state-space models , 2004, Bioinform..

[31]  T. Kepler,et al.  Stochasticity in transcriptional regulation: origins, consequences, and mathematical representations. , 2001, Biophysical journal.

[32]  A. Corigliano,et al.  Parameter identification in explicit structural dynamics: performance of the extended Kalman filter , 2004 .

[33]  Ankush Mittal,et al.  Model gene network by semi-fixed Bayesian network , 2006, Expert Syst. Appl..

[34]  G. Church,et al.  Systematic determination of genetic network architecture , 1999, Nature Genetics.

[35]  Huijun Gao,et al.  A Parameter-Dependent Approach to Robust $H_{\infty }$ Filtering for Time-Delay Systems , 2008, IEEE Transactions on Automatic Control.

[36]  S. Tapscott,et al.  Modeling stochastic gene expression: implications for haploinsufficiency. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[37]  S Fuhrman,et al.  Reveal, a general reverse engineering algorithm for inference of genetic network architectures. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[38]  Fuwen Yang,et al.  Stochastic Dynamic Modeling of Short Gene Expression Time-Series Data , 2008, IEEE Transactions on NanoBioscience.

[39]  Adrian E. Raftery,et al.  Model-based clustering and data transformations for gene expression data , 2001, Bioinform..

[40]  T. Chai,et al.  A robust fault detection filtering for stochastic distribution systems via descriptor estimator and parametric gain design , 2007 .

[41]  Zoubin Ghahramani,et al.  Learning Dynamic Bayesian Networks , 1997, Summer School on Neural Networks.

[42]  Patrik D'haeseleer,et al.  Linear Modeling of mRNA Expression Levels During CNS Development and Injury , 1998, Pacific Symposium on Biocomputing.

[43]  J. Mesirov,et al.  Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[44]  Fang-Xiang Wu,et al.  Modeling Gene Expression from Microarray Expression Data with State-Space Equations , 2003, Pacific Symposium on Biocomputing.

[45]  M. Thattai,et al.  Stochastic Gene Expression in Fluctuating Environments , 2004, Genetics.

[46]  Zhiwei Gao,et al.  High-gain observer-based parameter identification with application in a gas turbine engine , 2008 .