A reverse-engineering approach to dissect post-translational modulators of transcription factor’s activity from transcriptional data

BackgroundTranscription factors (TFs) act downstream of the major signalling pathways functioning as master regulators of cell fate. Their activity is tightly regulated at the transcriptional, post-transcriptional and post-translational level. Proteins modifying TF activity are not easily identified by experimental high-throughput methods.ResultsWe developed a computational strategy, called Differential Multi-Information (DMI), to infer post-translational modulators of a transcription factor from a compendium of gene expression profiles (GEPs). DMI is built on the hypothesis that the modulator of a TF (i.e. kinase/phosphatases), when expressed in the cell, will cause the TF target genes to be co-expressed. On the contrary, when the modulator is not expressed, the TF will be inactive resulting in a loss of co-regulation across its target genes. DMI detects the occurrence of changes in target gene co-regulation for each candidate modulator, using a measure called Multi-Information. We validated the DMI approach on a compendium of 5,372 GEPs showing its predictive ability in correctly identifying kinases regulating the activity of 14 different transcription factors.ConclusionsDMI can be used in combination with experimental approaches as high-throughput screening to efficiently improve both pathway and target discovery. An on-line web-tool enabling the user to use DMI to identify post-transcriptional modulators of a transcription factor of interest che be found at http://dmi.tigem.it.

[1]  Tariq Enver,et al.  Involvement of mitogen-activated protein kinase in the cytokine-regulated phosphorylation of transcription factor GATA-1. , 2004, The Hematology Journal.

[2]  Maria Julia Marinissen,et al.  Regulation of c-myc expression by PDGF through Rho GTPases , 2001, Nature Cell Biology.

[3]  S C Robertson,et al.  Identification of tyrosine residues in constitutively activated fibroblast growth factor receptor 3 involved in mitogenesis, Stat activation, and phosphatidylinositol 3-kinase activation. , 2001, Molecular biology of the cell.

[4]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[5]  Leszek Rychlewski,et al.  ELM server: a new resource for investigating short functional sites in modular eukaryotic proteins , 2003, Nucleic Acids Res..

[6]  T. Speed,et al.  Summaries of Affymetrix GeneChip probe level data. , 2003, Nucleic acids research.

[7]  H. Theisen,et al.  A Wnt-kinase network alters nuclear localization of TCF-1 in colon cancer , 2009, Oncogene.

[8]  Michael I. Jordan Learning in Graphical Models , 1999, NATO ASI Series.

[9]  Barnabás Póczos,et al.  Nonparametric Divergence Estimation with Applications to Machine Learning on Distributions , 2011, UAI.

[10]  Klaus Aktories,et al.  Rho family GTPase inhibition reveals opposing effects of mitogen‐activated protein kinase kinase/extracellular signal‐regulated kinase and Janus kinase/signal transducer and activator of transcription signaling cascades on neuronal survival , 2006, Journal of neurochemistry.

[11]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Alan Wells,et al.  STAT Activation by Epidermal Growth Factor (EGF) and Amphiregulin , 1996, The Journal of Biological Chemistry.

[13]  Haruhiko Koseki,et al.  Overlapping Roles for Homeodomain-Interacting Protein Kinases Hipk1 and Hipk2 in the Mediation of Cell Growth in Response to Morphogenetic and Genotoxic Signals , 2006, Molecular and Cellular Biology.

[14]  H. Parkinson,et al.  A global map of human gene expression , 2010, Nature Biotechnology.

[15]  J. Qian,et al.  Construction of human activity-based phosphorylation networks , 2013, Molecular systems biology.

[16]  Mingsheng Zhang,et al.  Comparing signaling networks between normal and transformed hepatocytes using discrete logical models. , 2011, Cancer research.

[17]  Jeyakumar Natarajan,et al.  HomoKinase: A Curated Database of Human Protein Kinases , 2013 .

[18]  P. Doukhan,et al.  Weak Dependence: With Examples and Applications , 2007 .

[19]  Katsunori Yoshida,et al.  Differential Regulation of TGF-β/Smad Signaling in Hepatic Stellate Cells between Acute and Chronic Liver Injuries , 2012, Front. Physio..

[20]  N. Blom,et al.  Identification of phosphorylation sites in protein kinase A substrates using artificial neural networks and mass spectrometry. , 2004, Journal of proteome research.

[21]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[22]  R. Myers,et al.  Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data , 2005, Nucleic acids research.

[23]  Kai Chen,et al.  Activation of p53 by Oxidative Stress Involves Platelet-derived Growth Factor-β Receptor-mediated Ataxia Telangiectasia Mutated (ATM) Kinase Activation* , 2003, Journal of Biological Chemistry.

[24]  P. Bork,et al.  Systematic Discovery of In Vivo Phosphorylation Networks , 2007, Cell.

[25]  Barnabás Póczos,et al.  Estimation of Renyi Entropy and Mutual Information Based on Generalized Nearest-Neighbor Graphs , 2010, NIPS.

[26]  Tom Fawcett,et al.  ROC Graphs: Notes and Practical Considerations for Researchers , 2007 .

[27]  Mariano J. Alvarez,et al.  Genome-wide Identification of Post-translational Modulators of Transcription Factor Activity in Human B-Cells , 2009, Nature Biotechnology.

[28]  Bruce A. Fenderson,et al.  Cellular Signal Processing: An Introduction to the Molecular Mechanisms of Signal Transduction , 2009 .

[29]  Michael B. Yaffe,et al.  Scansite 2.0: proteome-wide prediction of cell signaling interactions using short sequence motifs , 2003, Nucleic Acids Res..

[30]  J. O’Shea,et al.  Lyn kinase controls basophil GATA-3 transcription factor expression and induction of Th2 cell differentiation. , 2009, Immunity.

[31]  Chi-Ying F. Huang,et al.  PhosphoPOINT: a comprehensive human kinase interactome and phospho-protein database , 2008, ECCB.

[32]  Y. Poumay,et al.  Ca2+/calmodulin-dependent protein kinase (CaM-kinase) inhibitor KN-62 suppresses the activity of mitogen-activated protein kinase (MAPK), c-myc activation and human keratinocyte proliferation , 2002, Archives of Dermatological Research.

[33]  M. Studený,et al.  The Multiinformation Function as a Tool for Measuring Stochastic Dependence , 1998, Learning in Graphical Models.

[34]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[35]  Helga Thorvaldsdóttir,et al.  Molecular signatures database (MSigDB) 3.0 , 2011, Bioinform..

[36]  Diego di Bernardo,et al.  Differential network analysis for the identification of condition-specific pathway activity and regulation , 2013, Bioinform..

[37]  Avi Ma'ayan,et al.  ChEA: transcription factor regulation inferred from integrating genome-wide ChIP-X experiments , 2010, Bioinform..