TF-centered downstream gene set enrichment analysis: Inference of causal regulators by integrating TF-DNA interactions and protein post-translational modifications information

BackgroundInference of causal regulators responsible for gene expression changes under different conditions is of great importance but remains rather challenging. To date, most approaches use direct binding targets of transcription factors (TFs) to associate TFs with expression profiles. However, the low overlap between binding targets of a TF and the affected genes of the TF knockout limits the power of those methods.ResultsWe developed a TF-centered downstream gene set enrichment analysis approach to identify potential causal regulators responsible for expression changes. We constructed hierarchical and multi-layer regulation models to derive possible downstream gene sets of a TF using not only TF-DNA interactions, but also, for the first time, post-translational modifications (PTM) information. We verified our method in one expression dataset of large-scale TF knockout and another dataset involving both TF knockout and TF overexpression. Compared with the flat model using TF-DNA interactions alone, our method correctly identified five more actual perturbed TFs in large-scale TF knockout data and six more perturbed TFs in overexpression data. Potential regulatory pathways downstream of three perturbed regulators— SNF1, AFT1 and SUT1 —were given to demonstrate the power of multilayer regulation models integrating TF-DNA interactions and PTM information. Additionally, our method successfully identified known important TFs and inferred some novel potential TFs involved in the transition from fermentative to glycerol-based respiratory growth and in the pheromone response. Downstream regulation pathways of SUT1 and AFT1 were also supported by the mRNA and/or phosphorylation changes of their mediating TFs and/or “modulator” proteins.ConclusionsThe results suggest that in addition to direct transcription, indirect transcription and post-translational regulation are also responsible for the effects of TFs perturbation, especially for TFs overexpression. Many TFs inferred by our method are supported by literature. Multiple TF regulation models could lead to new hypotheses for future experiments. Our method provides a valuable framework for analyzing gene expression data to identify causal regulators in the context of TF-DNA interactions and PTM information.

[1]  Michael R. Green,et al.  Dissecting the Regulatory Circuitry of a Eukaryotic Genome , 1998, Cell.

[2]  Christian J Stoeckert,et al.  Clustering of genes into regulons using integrated modeling-COGRIM , 2007, Genome Biology.

[3]  M Aldea,et al.  The AFT1 Transcriptional Factor is Differentially Required for Expression of High‐Affinity Iron Uptake Genes in Saccharomyces cerevisiae , 1997, Yeast.

[4]  F. Cross,et al.  Ste12 and Mcm1 regulate cell cycle-dependent transcription of FAR1 , 1996, Molecular and cellular biology.

[5]  H. Boucherie,et al.  The Snf1 Protein Kinase Controls the Induction of Genes of the Iron Uptake Pathway at the Diauxic Shift in Saccharomyces cerevisiae* , 2003, Journal of Biological Chemistry.

[6]  Xiang-Jun Lu,et al.  Inferring Condition-Specific Modulation of Transcription Factor Activity in Yeast through Regulon-Based Analysis of Genomewide Expression , 2008, PloS one.

[7]  Li Wang,et al.  An integrative approach for causal gene identification and gene regulatory pathway inference , 2006, ISMB.

[8]  Sridhar Hannenhalli,et al.  PTM-Switchboard—a database of posttranslational modifications of transcription factors, the mediating enzymes and target genes , 2008, Nucleic Acids Res..

[9]  Tommi S. Jaakkola,et al.  Physical Network Models , 2004, J. Comput. Biol..

[10]  D. Botstein,et al.  Genomic expression programs in the response of yeast cells to environmental changes. , 2000, Molecular biology of the cell.

[11]  J. Winderickx,et al.  Inferring transcriptional modules from ChIP-chip, motif and microarray data , 2006, Genome Biology.

[12]  Nicola J. Rinaldi,et al.  Control of Pancreas and Liver Gene Expression by HNF Transcription Factors , 2004, Science.

[13]  Trey Ideker,et al.  Integrated Assessment and Prediction of Transcription Factor Binding , 2006, PLoS Comput. Biol..

[14]  Steve Hoberman,et al.  A computational model to define the molecular causes of type 2 diabetes mellitus. , 2005, Diabetes technology & therapeutics.

[15]  Weidong Tian,et al.  FuncBase : a resource for quantitative gene function annotation , 2010, Bioinform..

[16]  David C Fargo,et al.  Using ChIP-chip and ChIP-seq to study the regulation of gene expression: genome-wide localization studies reveal widespread regulation of transcription elongation. , 2009, Methods.

[17]  Jonathan Schug,et al.  Glucocorticoid Receptor-Dependent Gene Regulatory Networks , 2005, PLoS genetics.

[18]  S. Fields,et al.  A novel genetic system to detect protein–protein interactions , 1989, Nature.

[19]  M. Gerstein,et al.  Complex transcriptional circuitry at the G1/S transition in Saccharomyces cerevisiae. , 2002, Genes & development.

[20]  S. J. Deminoff,et al.  Rap1p requires Gcr1p and Gcr2p homodimers to activate ribosomal protein and glycolytic genes, respectively. , 2001, Genetics.

[21]  BMC Bioinformatics , 2005 .

[22]  P. Bork,et al.  Functional organization of the yeast proteome by systematic analysis of protein complexes , 2002, Nature.

[23]  Patrick J. Killion,et al.  Genetic reconstruction of a functional transcriptional regulatory network , 2007, Nature Genetics.

[24]  Toshio Tsukiyama,et al.  Regulated displacement of TBP from the PHO8 promoter in vivo requires Cbf1 and the Isw1 chromatin remodeling complex. , 2003, Molecular cell.

[25]  Bor-Sen Chen,et al.  Computational reconstruction of transcriptional regulatory modules of the yeast cell cycle , 2006, BMC Bioinformatics.

[26]  Haiyan Hu,et al.  An efficient algorithm to identify coordinately activated transcription factors. , 2010, Genomics.

[27]  George G. Roberts,et al.  Transcriptome profiling of Saccharomyces cerevisiae during a transition from fermentative to glycerol-based respiratory growth reveals extensive metabolic and structural remodeling , 2006, Molecular Genetics and Genomics.

[28]  Wei-Sheng Wu,et al.  Identifying gene regulatory modules of heat shock response in yeast , 2008, BMC Genomics.

[29]  Feng Gao,et al.  Defining transcriptional networks through integrative modeling of mRNA expression and transcription factor binding data , 2004, BMC Bioinformatics.

[30]  T. Jaakkola,et al.  Validation and refinement of gene-regulatory pathways on a network of physical interactions , 2005, Genome Biology.

[31]  Nicola J. Rinaldi,et al.  Computational discovery of gene modules and regulatory networks , 2003, Nature Biotechnology.

[32]  Katsuhiko Shirahige,et al.  Yeast Pho85 kinase is required for proper gene expression during the diauxic shift , 2004, Yeast.

[33]  Yonina C. Eldar,et al.  eQED: an efficient method for interpreting eQTL associations using protein networks , 2008, Molecular systems biology.

[34]  Carito Guziolowski,et al.  Inferring the role of transcription factors in regulatory networks , 2007, BMC Bioinformatics.

[35]  K. Williams,et al.  The eukaryotic response regulator Skn7p regulates calcineurin signaling through stabilization of Crz1p , 2001, The EMBO journal.

[36]  Bruce Futcher,et al.  The G1 Cyclin Cln3 Promotes Cell Cycle Entry via the Transcription Factor Swi6 , 2002, Molecular and Cellular Biology.

[37]  G. Santangelo,et al.  Glucose Signaling in Saccharomyces cerevisiae , 2006, Microbiology and Molecular Biology Reviews.

[38]  J. Hahn,et al.  Activation of the Saccharomyces cerevisiae Heat Shock Transcription Factor Under Glucose Starvation Conditions by Snf1 Protein Kinase* , 2004, Journal of Biological Chemistry.

[39]  M. Carlson,et al.  Yap1 Accumulates in the Nucleus in Response to Carbon Stress in Saccharomyces cerevisiae , 2003, Eukaryotic Cell.

[40]  Mitsuhiro Shimizu,et al.  Transcriptional Repression by the Pho4 Transcription Factor Controls the Timing of SNZ1 Expression , 2008, Eukaryotic Cell.

[41]  Yoshiharu Inoue,et al.  Activity of the Yap1 Transcription Factor in Saccharomyces cerevisiae Is Modulated by Methylglyoxal, a Metabolite Derived from Glycolysis , 2004, Molecular and Cellular Biology.

[42]  I. Simon,et al.  Backup in gene regulatory networks explains differences between binding and knockout results , 2009, Molecular systems biology.

[43]  R. Sharan,et al.  An initial blueprint for myogenic differentiation. , 2005, Genes & development.

[44]  M. Carlson,et al.  Yeast SNF1 protein kinase interacts with SIP4, a C6 zinc cluster transcriptional activator: a new role for SNF1 in the glucose response , 1996, Molecular and cellular biology.

[45]  Sean R. Collins,et al.  Functional Organization of the S. cerevisiae Phosphorylation Network , 2009, Cell.

[46]  Esko Ukkonen,et al.  Correlating gene promoters and expression in gene disruption experiments , 2002, ECCB.

[47]  Gustav Ammerer,et al.  A dual role for PP1 in shaping the Msn2‐dependent transcriptional response to glucose starvation , 2005, The EMBO journal.

[48]  T. Ideker,et al.  Supporting Online Material for A Systems Approach to Mapping DNA Damage Response Pathways , 2006 .

[49]  Roded Sharan,et al.  SPINE: a framework for signaling-regulatory pathway inference from cause-effect experiments , 2007, ISMB/ECCB.

[50]  R. Zitomer,et al.  Genetic Factors That Regulate the Attenuation of the General Stress Response of Yeast , 2005, Genetics.

[51]  J. François,et al.  Combinatorial control by the protein kinases PKA, PHO85 and SNF1 of transcriptional induction of the Saccharomyces cerevisiae GSY2 gene at the diauxic shift , 2004, Molecular Genetics and Genomics.

[52]  Charles Boone,et al.  Identifying transcription factor functions and targets by phenotypic activation , 2006, Proceedings of the National Academy of Sciences.

[53]  T. Hughes,et al.  Exploration of Essential Gene Functions via Titratable Promoter Alleles , 2004, Cell.

[54]  X. Zheng,et al.  Convergence of TOR-Nitrogen and Snf1-Glucose Signaling Pathways onto Gln3 , 2002, Molecular and Cellular Biology.

[55]  Tae-Min Kim,et al.  Inferring biological functions and associated transcriptional regulators using gene set expression coherence analysis , 2007, BMC Bioinformatics.

[56]  Valmik K. Vyas,et al.  Interaction of the Srb10 Kinase with Sip4, a Transcriptional Activator of Gluconeogenic Genes in Saccharomyces cerevisiae , 2001, Molecular and Cellular Biology.

[57]  Martin Vingron,et al.  PASTAA: identifying transcription factors associated with sets of co-regulated genes , 2008, Bioinform..

[58]  R. Young,et al.  A common set of gene regulatory networks links metabolism and growth inhibition. , 2004, Molecular cell.

[59]  T. Hughes,et al.  Signaling and circuitry of multiple MAPK pathways revealed by a matrix of global gene expression profiles. , 2000, Science.

[60]  M. Mann,et al.  Quantitative Phosphoproteomics Applied to the Yeast Pheromone Signaling Pathway*S , 2005, Molecular & Cellular Proteomics.