Large-scale learning of combinatorial transcriptional dynamics from gene expression

MOTIVATION Knowledge of the activation patterns of transcription factors (TFs) is fundamental to elucidate the dynamics of gene regulation in response to environmental conditions. Direct experimental measurement of TFs' activities is, however, challenging, resulting in a need to develop statistical tools to infer TF activities from mRNA expression levels of target genes. Current models, however, neglect important features of transcriptional regulation; in particular, the combinatorial nature of regulation, which is fundamental for signal integration, is not accounted for. RESULTS We present a novel method to infer combinatorial regulation of gene expression by multiple transcription factors in large-scale transcriptional regulatory networks. The method implements a factorial hidden Markov model with a non-linear likelihood to represent the interactions between the hidden transcription factors. We explore our model's performance on artificial datasets and demonstrate the applicability of our method on genome-wide scale for three expression datasets. The results obtained using our model are biologically coherent and provide a tool to explore the concealed nature of combinatorial transcriptional regulation. AVAILABILITY http://homepages.inf.ed.ac.uk/gsanguin/software.html.

[1]  Mahesan Niranjan,et al.  Reducing the algorithmic variability in transcriptome-based inference , 2010, Bioinform..

[2]  Chiara Sabatti,et al.  Bayesian sparse hidden components analysis for transcription regulation networks , 2005, Bioinform..

[3]  Neil D. Lawrence,et al.  Probabilistic inference of transcription factor concentrations and gene-specific regulatory activities , 2006, Bioinform..

[4]  T. Cooper,et al.  Roles of the Dal82p Domains in Allophanate/Oxalurate-dependent Gene Expression inSaccharomyces cerevisiae * , 2000, The Journal of Biological Chemistry.

[5]  Neil D. Lawrence,et al.  Modelling transcriptional regulation using Gaussian Processes , 2006, NIPS.

[6]  M. Barenco,et al.  Ranked prediction of p53 targets using hidden variable dynamic modeling , 2006, Genome Biology.

[7]  Raya Khanin,et al.  Bayesian model-based inference of transcription factor activity , 2007, BMC Bioinformatics.

[8]  Matthew J. Beal Variational algorithms for approximate Bayesian inference , 2003 .

[9]  Guido Sanguinetti,et al.  Transition of Escherichia coli from Aerobic to Micro-aerobic Conditions Involves Fast and Slow Reacting Regulatory Components* , 2007, Journal of Biological Chemistry.

[10]  Nicola J. Rinaldi,et al.  Transcriptional Regulatory Networks in Saccharomyces cerevisiae , 2002, Science.

[11]  A. Kudlicki,et al.  Logic of the Yeast Metabolic Cycle: Temporal Compartmentalization of Cellular Processes , 2005, Science.

[12]  References , 1971 .

[13]  Nicola J. Rinaldi,et al.  Transcriptional regulatory code of a eukaryotic genome , 2004, Nature.

[14]  Michael I. Jordan,et al.  An Introduction to Variational Methods for Graphical Models , 1999, Machine Learning.

[15]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .

[16]  Andreas Ruttor,et al.  Switching regulatory models of cellular stress response , 2009, Bioinform..

[17]  D. Stillman,et al.  Mutations in the Pho2 (Bas2) Transcription Factor That Differentially Affect Activation with Its Partner Proteins Bas1, Pho4, and Swi5* , 2002, The Journal of Biological Chemistry.

[18]  Neil D. Lawrence,et al.  TFInfer: a tool for probabilistic inference of transcription factor activities , 2010, Bioinform..

[19]  B. Daignan-Fornier,et al.  Coregulation of purine and histidine biosynthesis by the transcriptional activators BAS1 and BAS2. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[21]  Zhaojun Xu,et al.  Role of Gts1p in regulation of energy‐metabolism oscillation in continuous cultures of the yeast Saccharomyces cerevisiae , 2007, Yeast.

[22]  T. Cooper,et al.  Gat1p, a GATA family protein whose production is sensitive to nitrogen catabolite repression, participates in transcriptional activation of nitrogen-catabolic genes in Saccharomyces cerevisiae , 1996, Molecular and cellular biology.

[23]  Tom M. Mitchell,et al.  A Combined Expression-Interaction Model for Inferring the Temporal Activity of Transcription Factors , 2009, J. Comput. Biol..

[24]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[25]  Chiara Sabatti,et al.  Network component analysis: Reconstruction of regulatory signals in biological systems , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[26]  Guido Sanguinetti,et al.  Carbon Monoxide-releasing Antibacterial Molecules Target Respiration and Global Transcriptional Regulators* , 2009, Journal of Biological Chemistry.

[27]  Guido Sanguinetti,et al.  Learning combinatorial transcriptional dynamics from gene expression data , 2010, Bioinform..

[28]  Y. Xing,et al.  Mutations in yeast HAP2/HAP3 define a hybrid CCAAT box binding domain. , 1993, The EMBO journal.

[29]  L. Guarente,et al.  The HAP3 regulatory locus of Saccharomyces cerevisiae encodes divergent overlapping transcripts , 1988, Molecular and cellular biology.

[30]  Michael I. Jordan,et al.  Factorial Hidden Markov Models , 1995, Machine Learning.