Predicting state transitions in the transcriptome and metabolome using a linear dynamical system model

BackgroundModelling of time series data should not be an approximation of input data profiles, but rather be able to detect and evaluate dynamical changes in the time series data. Objective criteria that can be used to evaluate dynamical changes in data are therefore important to filter experimental noise and to enable extraction of unexpected, biologically important information.ResultsHere we demonstrate the effectiveness of a Markov model, named the Linear Dynamical System, to simulate the dynamics of a transcript or metabolite time series, and propose a probabilistic index that enables detection of time-sensitive changes. This method was applied to time series datasets from Bacillus subtilis and Arabidopsis thaliana grown under stress conditions; in the former, only gene expression was studied, whereas in the latter, both gene expression and metabolite accumulation. Our method not only identified well-known changes in gene expression and metabolite accumulation, but also detected novel changes that are likely to be responsible for each stress response condition.ConclusionThis general approach can be applied to any time-series data profile from which one wishes to identify elements responsible for state transitions, such as rapid environmental adaptation by an organism.

[1]  M. Hirai,et al.  Elucidation of Gene-to-Gene and Metabolite-to-Gene Networks in Arabidopsis by Integration of Metabolomics and Transcriptomics* , 2005, Journal of Biological Chemistry.

[2]  Yasuo Kobayashi,et al.  Characterization of a New Sigma-K-Dependent Peptidoglycan Hydrolase Gene That Plays a Role in Bacillus subtilis Mother Cell Lysis , 1999, Journal of bacteriology.

[3]  Taesung Park,et al.  Boolean networks using the chi-square test for inferring large-scale gene regulatory networks , 2006, BMC Bioinformatics.

[4]  R. E. Kalman,et al.  New Results in Linear Filtering and Prediction Theory , 1961 .

[5]  Satoru Miyano,et al.  Algorithms for identifying Boolean networks and related biological networks based on matrix multiplication and fingerprint function , 2000, RECOMB '00.

[6]  S. Kanaya,et al.  Analysis of codon usage diversity of bacterial genes with a self-organizing map (SOM): characterization of horizontally transferred genes with emphasis on the E. coli O157 genome. , 2001, Gene.

[7]  L. W. Nichols Optical Filtering , 1959, Proceedings of the IRE.

[8]  David J. Galas,et al.  Dynamic models of gene expression and classification , 2001, Functional & Integrative Genomics.

[9]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[10]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[11]  Hiroyuki Toh,et al.  Inference of a genetic network by a combined approach of cluster analysis and graphical Gaussian modeling , 2002, Bioinform..

[12]  Satoru Miyano,et al.  Estimating gene networks from gene expression data by combining Bayesian network model with promoter element detection , 2003, ECCB.

[13]  D. Goodenowe,et al.  Nontargeted metabolome analysis by use of Fourier Transform Ion Cyclotron Mass Spectrometry. , 2002, Omics : a journal of integrative biology.

[14]  Oliver Fiehn,et al.  Systems Rebalancing of Metabolism in Response to Sulfur Deprivation, as Revealed by Metabolome Analysis of Arabidopsis Plants1[w] , 2005, Plant Physiology.