Using a state-space model with hidden variables to infer transcription factor activities

MOTIVATION In a gene regulatory network, genes are typically regulated by transcription factors (TFs). Transcription factor activity (TFA) is more difficult to measure than gene expression levels are. Other models have extracted information about TFA from gene expression data, but without explicitly modeling feedback from the genes. We present a state-space model (SSM) with hidden variables. The hidden variables include regulatory motifs in the gene network, such as feedback loops and auto-regulation, making SSM a useful complement to existing models. RESULTS A gene regulatory network incorporating, for example, feed-forward loops, auto-regulation and multiple-inputs was constructed with an SSM model. First, the gene expression data were simulated by SSM and used to infer the TFAs. The ability of SSM to infer TFAs was evaluated by comparing the profiles of the inferred and simulated TFAs. Second, SSM was applied to gene expression data obtained from Escherichia coli K12 undergoing a carbon source transition and from the Saccharomyces cerevisiae cell cycle. The inferred activity profile for each TF was validated either by measurement or by activity information from the literature. The SSM model provides a probabilistic framework to simulate gene regulatory networks and to infer activity profiles of hidden variables. AVAILABILITY Supplementary data and Matlab code will be made available at the URL below. SUPPLEMENTARY INFORMATION http://www.chems.msu.edu/groups/chan/ssm.zip.

[1]  Zoubin Ghahramani,et al.  A Bayesian approach to reconstructing genetic regulatory networks with hidden factors , 2005, Bioinform..

[2]  Aurélien Mazurie,et al.  Gene networks inference using dynamic Bayesian networks , 2003, ECCB.

[3]  Kevin Murphy,et al.  Bayes net toolbox for Matlab , 1999 .

[4]  Nir Friedman,et al.  Inferring quantitative models of regulatory networks from expression data , 2004, ISMB/ECCB.

[5]  F. Cross,et al.  Ste12 and Mcm1 regulate cell cycle-dependent transcription of FAR1 , 1996, Molecular and cellular biology.

[6]  K Nasmyth,et al.  EGT2 gene transcription is induced predominantly by Swi5 in early G1 , 1996, Molecular and cellular biology.

[7]  Julio Collado-Vides,et al.  RegulonDB (version 3.2): transcriptional regulation and operon organization in Escherichia coli K-12 , 2001, Nucleic Acids Res..

[8]  Michel Kerszberg,et al.  Noise, delays, robustness, canalization and all that. , 2004, Current opinion in genetics & development.

[9]  B. Andrews,et al.  Regulation of Cell Cycle Transcription Factor Swi4 through Auto-Inhibition of DNA Binding , 1999, Molecular and Cellular Biology.

[10]  James C Liao,et al.  A Global Regulatory Role of Gluconeogenic Genes in Escherichia coli Revealed by Transcriptome Network Analysis* , 2005, Journal of Biological Chemistry.

[11]  L. Breeden,et al.  A novel Mcm1-dependent element in the SWI4, CLN3, CDC6, and CDC47 promoters activates M/G1-specific transcription. , 1997, Genes & development.

[12]  Katy C. Kao,et al.  Transcriptome-based determination of multiple transcription regulator activities in Escherichia coli by using network component analysis. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[14]  Kevin Murphy,et al.  Modelling Gene Expression Data using Dynamic Bayesian Networks , 2006 .

[15]  John D. Storey,et al.  Precision and functional specificity in mRNA decay , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[16]  A. Arkin,et al.  Stochastic mechanisms in gene expression. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[17]  J. Collins,et al.  Chemogenomic profiling on a genome-wide scale using reverse-engineered gene networks , 2005, Nature Biotechnology.

[18]  Zoubin Ghahramani,et al.  Modeling T-cell activation using gene expression profiling and state-space models , 2004, Bioinform..

[19]  T Nyström,et al.  Bacterial defense against aging: role of the Escherichia coli ArcA regulator in gene expression, readjusted energy flux and survival during stasis. , 1996, The EMBO journal.

[20]  Nicola J. Rinaldi,et al.  Transcriptional Regulatory Networks in Saccharomyces cerevisiae , 2002, Science.

[21]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[22]  Katy C. Kao,et al.  gNCA: a framework for determining transcription factor activity based on transcriptome: identifiability and numerical implementation. , 2005, Metabolic engineering.

[23]  Ambuj K. Singh,et al.  Deriving phylogenetic trees from the similarity analysis of metabolic pathways , 2003, ISMB.

[24]  Julio Collado-Vides,et al.  RegulonDB (version 4.0): transcriptional regulation, operon organization and growth conditions in Escherichia coli K-12 , 2004, Nucleic Acids Res..

[25]  Tommi S. Jaakkola,et al.  Combining Location and Expression Data for Principled Discovery of Genetic Regulatory Network Models , 2001, Pacific Symposium on Biocomputing.

[26]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[27]  N. Monk Oscillatory Expression of Hes1, p53, and NF-κB Driven by Transcriptional Time Delays , 2003, Current Biology.

[28]  Chiara Sabatti,et al.  Network component analysis: Reconstruction of regulatory signals in biological systems , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[29]  Paola Sebastiani,et al.  Cluster analysis of gene expression dynamics , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Tommi S. Jaakkola,et al.  A new approach to analyzing gene expression time series data , 2002, RECOMB '02.

[31]  L. Breeden,et al.  Early Cell Cycle BoxMediated BoxMediated BoxMediated Transcription of CLN 3 and SWI 4 Contributes to the Proper Timing of the G 1toS 1toS 1toS toS toS Transition in Budding Yeast , 2001 .

[32]  L. Johnston,et al.  Swi5 controls a novel wave of cyclin synthesis in late mitosis. , 1998, Molecular biology of the cell.

[33]  David Page,et al.  Modelling regulatory pathways in E. coli from time series expression profiles , 2002, ISMB.