Systematic identification of transcriptional regulatory modules from protein–protein interaction networks

Transcription factors (TFs) combine with co-factors to form transcriptional regulatory modules (TRMs) that regulate gene expression programs with spatiotemporal specificity. Here we present a novel and generic method (rTRM) for the reconstruction of TRMs that integrates genomic information from TF binding, cell type-specific gene expression and protein–protein interactions. rTRM was applied to reconstruct the TRMs specific for embryonic stem cells (ESC) and hematopoietic stem cells (HSC), neural progenitor cells, trophoblast stem cells and distinct types of terminally differentiated CD4+ T cells. The ESC and HSC TRM predictions were highly precise, yielding 77 and 96 proteins, of which ∼75% have been independently shown to be involved in the regulation of these cell types. Furthermore, rTRM successfully identified a large number of bridging proteins with known roles in ESCs and HSCs, which could not have been identified using genomic approaches alone, as they lack the ability to bind specific DNA sequences. This highlights the advantage of rTRM over other methods that ignore PPI information, as proteins need to interact with other proteins to form complexes and perform specific functions. The prediction and experimental validation of the co-factors that endow master regulatory TFs with the capacity to select specific genomic sites, modulate the local epigenetic profile and integrate multiple signals will provide important mechanistic insights not only into how such TFs operate, but also into abnormal transcriptional states leading to disease.

[1]  Kara Dolinski,et al.  The BioGRID Interaction Database: 2011 update , 2010, Nucleic Acids Res..

[2]  H. Schöler,et al.  Binding of Sp1 and Sp3 transcription factors to the Oct-4 gene promoter. , 1999, Cellular and molecular biology.

[3]  N. D. Clarke,et al.  Integration of External Signaling Pathways with the Core Transcriptional Network in Embryonic Stem Cells , 2008, Cell.

[4]  Juan M. Vaquerizas,et al.  Multiplexed massively parallel SELEX for characterization of human transcription factor binding specificities. , 2010, Genome research.

[5]  P. Cahan,et al.  The transcriptional landscape of hematopoietic stem cell ontogeny. , 2012, Cell stem cell.

[6]  Robert Gentleman,et al.  Using GOstats to test gene lists for GO term association , 2007, Bioinform..

[7]  D. Miranda-Saavedra,et al.  The IL-10/STAT3-mediated anti-inflammatory response: recent developments and future challenges , 2013, Briefings in functional genomics.

[8]  Warren Strober,et al.  Interactions among the transcription factors Runx1, RORγt and Foxp3 regulate the differentiation of interleukin 17–producing T cells , 2008, Nature Immunology.

[9]  Elisa Laurenti,et al.  Hematopoiesis: a human perspective. , 2012, Cell stem cell.

[10]  Bor-Sen Chen,et al.  Computational reconstruction of transcriptional regulatory modules of the yeast cell cycle , 2006, BMC Bioinformatics.

[11]  Raja Jothi,et al.  Genome-wide analyses of transcription factor GATA3-mediated gene regulation in distinct T cell types. , 2011, Immunity.

[12]  Li Chai,et al.  Sall4 modulates embryonic stem cell pluripotency and early embryonic development by the transcriptional regulation of Pou5f1 , 2006, Nature Cell Biology.

[13]  Raja Jothi,et al.  Nuclear adaptor Ldb1 regulates a transcriptional program essential for the maintenance of hematopoietic stem cells , 2011, Nature Immunology.

[14]  C. Glass,et al.  Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. , 2010, Molecular cell.

[15]  Martin C. Frith,et al.  Inferring transcription factor complexes from ChIP-seq data , 2011, Nucleic acids research.

[16]  P. Robson,et al.  Co‐Motif Discovery Identifies an Esrrb‐Sox2‐DNA Ternary Complex as a Mediator of Transcriptional Differences Between Mouse Embryonic and Epiblast Stem Cells , 2013, Stem cells.

[17]  William Stafford Noble,et al.  Quantifying similarity between motifs , 2007, Genome Biology.

[18]  Ole Winther,et al.  JASPAR, the open access database of transcription factor-binding profiles: new content and tools in the 2008 update , 2007, Nucleic Acids Res..

[19]  Nicola J. Rinaldi,et al.  Computational discovery of gene modules and regulatory networks , 2003, Nature Biotechnology.

[20]  W. Ouwehand,et al.  Combinatorial transcriptional control in blood stem/progenitor cells: genome-wide analysis of ten major transcriptional regulators. , 2010, Cell stem cell.

[21]  Berthold Göttgens,et al.  Deciphering transcriptional control mechanisms in hematopoiesis:the impact of high-throughput sequencing technologies. , 2011, Experimental hematology.

[22]  E. Rothenberg,et al.  Competition and collaboration: GATA-3, PU.1, and Notch signaling in early T-cell fate determination. , 2008, Seminars in immunology.

[23]  Jun Qin,et al.  Nanog and Oct4 associate with unique transcriptional repression complexes in embryonic stem cells , 2008, Nature Cell Biology.

[24]  R. Myers,et al.  Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data , 2005, Nucleic acids research.

[25]  I. Kitabayashi,et al.  Deregulated transcription factors in leukemia , 2011, International journal of hematology.

[26]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[27]  Michael Q. Zhang,et al.  Analysis of the Vertebrate Insulator Protein CTCF-Binding Sites in the Human Genome , 2007, Cell.

[28]  S. Orkin,et al.  An Extended Transcriptional Network for Pluripotency of Embryonic Stem Cells , 2008, Cell.

[29]  Thomas Lufkin,et al.  Reprogramming of fibroblasts into induced pluripotent stem cells with orphan nuclear receptor Esrrb , 2009, Nature Cell Biology.

[30]  M. Babu,et al.  An Expanded Oct4 Interaction Network: Implications for Stem Cell Biology, Development, and Disease , 2010, Cell stem cell.

[31]  Ci Chu,et al.  The Transcription Factor Zfp281 Controls Embryonic Stem Cell Pluripotency by Direct Activation and Repression of Target Genes , 2008, Stem cells.

[32]  Ioannis Xenarios,et al.  Hard-wired heterogeneity in blood stem cells revealed using a dynamic regulatory network model , 2013, Bioinform..

[33]  J. Baker,et al.  Endogenous retroviruses function as species-specific enhancer elements in the placenta , 2013, Nature Genetics.

[34]  D. Miranda-Saavedra,et al.  Genome-wide analysis of STAT3 binding in vivo predicts effectors of the anti-inflammatory response in macrophages. , 2012, Blood.

[35]  J. Nichols,et al.  BMP Induction of Id Proteins Suppresses Differentiation and Sustains Embryonic Stem Cell Self-Renewal in Collaboration with STAT3 , 2003, Cell.

[36]  Kathleen Marchal,et al.  Unveiling combinatorial regulation through the combination of ChIP information and in silico cis-regulatory module detection , 2012, Nucleic acids research.

[37]  D. Miranda-Saavedra,et al.  Genomic and computational approaches to dissect the mechanisms of STAT3’s universal and cell type-specific functions , 2013, JAK-STAT.

[38]  Diego Miranda-Saavedra,et al.  Distinct transcriptional regulatory modules underlie STAT3’s cell type-independent and cell type-specific functions , 2013, Nucleic acids research.

[39]  M. Kaplan,et al.  PU.1 Regulates TCR Expression by Modulating GATA-3 Activity1 , 2009, The Journal of Immunology.

[40]  Juan M. Vaquerizas,et al.  DNA-Binding Specificities of Human Transcription Factors , 2013, Cell.

[41]  Subhajyoti De,et al.  BloodExpress: a database of gene expression in mouse haematopoiesis , 2008, Nucleic Acids Res..

[42]  B. Panning,et al.  An RNAi Screen of Chromatin Proteins Identifies Tip60-p400 as a Regulator of Embryonic Stem Cell Identity , 2008, Cell.

[43]  Thomas Lengauer,et al.  Improved scoring of functional groups from gene expression data by decorrelating GO graph structure , 2006, Bioinform..

[44]  Martha L. Bulyk,et al.  UniPROBE: an online database of protein binding microarray data on protein–DNA interactions , 2008, Nucleic Acids Res..

[45]  M. Kubo,et al.  The Runx1 Transcription Factor Inhibits the Differentiation of Naive CD4+ T Cells into the Th2 Lineage by Repressing GATA3 Expression , 2003, The Journal of experimental medicine.

[46]  Zhenqing Ye,et al.  Cell type-specific binding patterns reveal that TCF7L2 can be tethered to the genome by association with GATA3 , 2012, Genome Biology.

[47]  L. Zon,et al.  SnapShot: hematopoiesis. , 2008, Cell.

[48]  Juan M. Vaquerizas,et al.  A census of human transcription factors: function, expression and evolution , 2009, Nature Reviews Genetics.

[49]  Raja Jothi,et al.  esBAF Facilitates Pluripotency by Conditioning the Genome for LIF/STAT3Signalingand by Regulating Polycomb Function , 2011, Nature Cell Biology.

[50]  Benjamin L. Kidder,et al.  Examination of transcriptional networks reveals an important role for TCFAP2C, SMARCA4, and EOMES in trophoblast stem cell maintenance. , 2010, Genome research.

[51]  Pascal Barbry,et al.  Dax‐1 Knockdown in Mouse Embryonic Stem Cells Induces Loss of Pluripotency and Multilineage Differentiation , 2009, Stem cells.

[52]  R. Kellems,et al.  Regulation of murine Ada gene expression in the placenta by transcription factor RUNX1. , 2006, Placenta.

[53]  S. Teichmann,et al.  A HaemAtlas: characterizing gene expression in differentiated human blood cells , 2008, Blood.

[54]  Erik Splinter,et al.  The complex transcription regulatory landscape of our genome: control in three dimensions , 2011, The EMBO journal.

[55]  Stuart H. Orkin,et al.  A protein interaction network for pluripotency of embryonic stem cells , 2006, Nature.

[56]  Nicola Festuccia,et al.  Esrrb Is a Direct Nanog Target Gene that Can Substitute for Nanog Function in Pluripotent Cells , 2012, Cell stem cell.

[57]  B. Göttgens,et al.  Transcriptional regulatory networks in haematopoiesis. , 2008, Current opinion in genetics & development.

[58]  R. Jaenisch,et al.  SOX2 Co-Occupies Distal Enhancer Elements with Distinct POU Factors in ESCs and NPCs to Specify Cell State , 2013, PLoS genetics.

[59]  Judith A. Blake,et al.  The Mouse Genome Database (MGD): comprehensive resource for genetics and genomics of the laboratory mouse , 2011, Nucleic Acids Res..

[60]  Takashi Aoi,et al.  Generation of induced pluripotent stem cells without Myc from mouse and human fibroblasts , 2008, Nature Biotechnology.

[61]  Mark A. Dawson,et al.  The transcriptional program controlled by the stem cell leukemia gene Scl/Tal1 during early embryonic hematopoietic development. , 2009, Blood.

[62]  E. Birney,et al.  Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt , 2009, Nature Protocols.

[63]  Sarah A. Teichmann,et al.  Assessing Computational Methods of Cis-Regulatory Module Prediction , 2010, PLoS Comput. Biol..

[64]  Timothy L Bailey,et al.  A global role for KLF1 in erythropoiesis revealed by ChIP-seq in primary erythroid cells. , 2010, Genome research.

[65]  William Stafford Noble,et al.  Sequence features and chromatin structure around the genomic regions bound by 119 human transcription factors , 2012, Genome research.

[66]  D. Livingston,et al.  Cyclin A-kinase regulation of E2F-1 DNA binding function underlies suppression of an S phase checkpoint , 1995, Cell.

[67]  J. Yates,et al.  BRD7, a Novel PBAF-specific SWI/SNF Subunit, Is Required for Target Gene Activation and Repression in Embryonic Stem Cells* , 2008, Journal of Biological Chemistry.

[68]  Lucy J. Colwell,et al.  The emergence of protein complexes: quaternary structure, dynamics and allostery. Colworth Medal Lecture. , 2012, Biochemical Society transactions.

[69]  E. Li,et al.  Ring1b-mediated H2A Ubiquitination Associates with Inactive X Chromosomes and Is Involved in Initiation of X Inactivation* , 2004, Journal of Biological Chemistry.

[70]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[71]  Edgar Wingender,et al.  TFClass: an expandable hierarchical classification of human transcription factors , 2012, Nucleic Acids Res..