Unveiling combinatorial regulation through the combination of ChIP information and in silico cis-regulatory module detection

Computationally retrieving biologically relevant cis-regulatory modules (CRMs) is not straightforward. Because of the large number of candidates and the imperfection of the screening methods, many spurious CRMs are detected that are as high scoring as the biologically true ones. Using ChIP-information allows not only to reduce the regions in which the binding sites of the assayed transcription factor (TF) should be located, but also allows restricting the valid CRMs to those that contain the assayed TF (here referred to as applying CRM detection in a query-based mode). In this study, we show that exploiting ChIP-information in a query-based way makes in silico CRM detection a much more feasible endeavor. To be able to handle the large datasets, the query-based setting and other specificities proper to CRM detection on ChIP-Seq based data, we developed a novel powerful CRM detection method ‘CPModule’. By applying it on a well-studied ChIP-Seq data set involved in self-renewal of mouse embryonic stem cells, we demonstrate how our tool can recover combinatorial regulation of five known TFs that are key in the self-renewal of mouse embryonic stem cells. Additionally, we make a number of new predictions on combinatorial regulation of these five key TFs with other TFs documented in TRANSFAC.

[1]  J. Bijl,et al.  HOXA4 induces expansion of hematopoietic stem cells in vitro and confers enhancement of pro-B-cells in vivo. , 2012, Stem cells and development.

[2]  David J. Arenillas,et al.  Validation of Skeletal Muscle cis-Regulatory Module Predictions Reveals Nucleotide Composition Bias in Functional Enhancers , 2011, PLoS Comput. Biol..

[3]  Michael P. Snyder,et al.  A Large Gene Network in Immature Erythroid Cells Is Controlled by the Myeloid and B Cell Transcriptional Regulator PU.1 , 2011, PLoS genetics.

[4]  Martin C. Frith,et al.  Inferring transcription factor complexes from ChIP-seq data , 2011, Nucleic acids research.

[5]  G. Woodfield,et al.  Discovery of SMAD4 promoters, transcription factor binding sites and deletions in juvenile polyposis patients , 2011, Nucleic acids research.

[6]  Chyung-Ru Wang,et al.  Differential requirements for the Ets transcription factor Elf-1 in the development of NKT cells and NK cells. , 2011, Blood.

[7]  Mary Goldman,et al.  The UCSC Genome Browser database: update 2011 , 2010, Nucleic Acids Res..

[8]  P. Koopman,et al.  Sry: the master switch in mammalian sex determination , 2010, Development.

[9]  Kathleen Marchal,et al.  Cis-regulatory module detection using constraint programming , 2010, 2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[10]  Sarah A. Teichmann,et al.  Assessing Computational Methods of Cis-Regulatory Module Prediction , 2010, PLoS Comput. Biol..

[11]  H. Schöler,et al.  Oct1 regulates trophoblast development during early mouse embryogenesis , 2010, Development.

[12]  James Bailey,et al.  is-rSNP: a novel technique for in silico regulatory SNP detection , 2010, BMC Bioinformatics.

[13]  Yongjun Tan,et al.  Foxm1 transcription factor is required for maintenance of pluripotency of P19 embryonal carcinoma cells , 2010, Nucleic acids research.

[14]  Stephen A. Ramsey,et al.  Genome-wide histone acetylation data improve prediction of mammalian transcription factor binding sites , 2010, Bioinform..

[15]  C. Bult,et al.  Expression of the transcription factor, TFII-I, during post-implantation mouse embryonic development , 2010, BMC Research Notes.

[16]  M. Facciotti,et al.  Evaluation of Algorithm Performance in ChIP-Seq Peak Detection , 2010, PloS one.

[17]  Lei Xia,et al.  Predicting nucleosome positioning using a duration Hidden Markov Model , 2010, BMC Bioinformatics.

[18]  M. Huss,et al.  Q&A: ChIP-seq technologies and the study of gene regulation , 2010, BMC Biology.

[19]  E. J. Stringer,et al.  The role of Cdx genes in the gut and in axial development. , 2010, Biochemical Society transactions.

[20]  Ariel S. Schwartz,et al.  An Atlas of Combinatorial Transcriptional Regulation in Mouse and Man , 2010, Cell.

[21]  P. D. de Groot,et al.  Profiling of promoter occupancy by PPARα in human hepatoma cells via ChIP-chip analysis , 2010, Nucleic acids research.

[22]  T. Evans,et al.  Gata4 directs development of cardiac-inducing endoderm from ES cells. , 2010, Developmental biology.

[23]  A. Mortazavi,et al.  Computation for ChIP-seq and RNA-seq studies , 2009, Nature Methods.

[24]  Peter Van Loo,et al.  Computational methods for the detection of cis-regulatory modules , 2009, Briefings Bioinform..

[25]  Howard Y. Chang,et al.  HOXA3 Modulates Injury-Induced Mobilization and Recruitment of Bone Marrow-Derived Cells , 2009, Stem cells.

[26]  M. Gerstein,et al.  Unlocking the secrets of the genome , 2009, Nature.

[27]  A. Visel,et al.  ChIP-seq accurately predicts tissue-specific activity of enhancers , 2009, Nature.

[28]  Min Jung Park,et al.  Homeodomain transcription factor CDX1 is required for the transcriptional induction of PPARγ in intestinal cell differentiation , 2009, FEBS letters.

[29]  T. Bailey,et al.  High-throughput chromatin information enables accurate tissue-specific prediction of transcription factor binding sites , 2008, Nucleic acids research.

[30]  Dennis B. Troup,et al.  NCBI GEO: archive for high-throughput functional genomic data , 2008, Nucleic Acids Res..

[31]  Finn Drabløs,et al.  Compo: composite motif discovery using discrete models , 2008, BMC Bioinformatics.

[32]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[33]  H. Arnold,et al.  Transcriptional Regulator BPTF/FAC1 Is Essential for Trophoblast Differentiation during Early Mouse Development , 2008, Molecular and Cellular Biology.

[34]  Peng Li,et al.  Mitochondrial shuttling of CAP1 promotes actin- and cofilin-dependent apoptosis , 2008, Journal of Cell Science.

[35]  Luc De Raedt,et al.  Constraint programming for itemset mining , 2008, KDD.

[36]  Raja Jothi,et al.  Genome-wide identification of in vivo protein–DNA binding sites from ChIP-Seq data , 2008, Nucleic acids research.

[37]  Dan Xie,et al.  Cross-species de novo identification of cis-regulatory modules with GibbsModule: application to gene regulation in embryonic stem cells. , 2008, Genome research.

[38]  N. D. Clarke,et al.  Integration of External Signaling Pathways with the Core Transcriptional Network in Embryonic Stem Cells , 2008, Cell.

[39]  G. Mills,et al.  Cancer stem cells contribute to cisplatin resistance in Brca1/p53-mediated mouse mammary tumors. , 2008, Cancer research.

[40]  J. Bernal Faculty Opinions recommendation of Locomotor deficiencies and aberrant development of subtype-specific GABAergic interneurons caused by an unliganded thyroid hormone receptor alpha1. , 2008 .

[41]  Sheng Zhong,et al.  A core Klf circuitry regulates self-renewal of embryonic stem cells , 2008, Nature Cell Biology.

[42]  D. W. Knowles,et al.  Transcription Factors Bind Thousands of Active and Inactive Regions in the Drosophila Blastoderm , 2008, PLoS biology.

[43]  Peter J. Stuckey,et al.  Efficient constraint propagation engines , 2006, TOPL.

[44]  Yves Moreau,et al.  ModuleMiner - improved computational detection of cis-regulatory modules: are there different modes of gene regulation in embryonic development and adult tissues? , 2008, Genome Biology.

[45]  Finn Drabløs,et al.  Assessment of composite motif discovery methods , 2008, BMC Bioinformatics.

[46]  R. Schmidt,et al.  Misexpression of Pou3f1 Results in Peripheral Nerve Hypomyelination and Axonal Loss , 2007, The Journal of Neuroscience.

[47]  Zhenguo Wu,et al.  JAK1–STAT1–STAT3, a key pathway promoting proliferation and preventing premature differentiation of myoblasts , 2007, The Journal of cell biology.

[48]  Nello Cristianini,et al.  MINI: Mining Informative Non-redundant Itemsets , 2007, PKDD.

[49]  R. Maas,et al.  Concerted action of Msx1 and Msx2 in regulating cranial neural crest cell differentiation during frontal bone development , 2007, Mechanisms of Development.

[50]  Mark Craven,et al.  A specialized learner for inferring structured cis-regulatory modules , 2006, BMC Bioinformatics.

[51]  Yang Shi,et al.  Essential Dosage-Dependent Functions of the Transcription Factor Yin Yang 1 in Late Embryonic Development and Cell Cycle Progression , 2006, Molecular and Cellular Biology.

[52]  Terrence S. Furey,et al.  The UCSC Genome Browser Database: update 2006 , 2005, Nucleic Acids Res..

[53]  Alexander E. Kel,et al.  TRANSFAC® and its module TRANSCompel®: transcriptional gene regulation in eukaryotes , 2005, Nucleic Acids Res..

[54]  Vladimir Petrovic,et al.  Forkhead Box M1 Regulates the Transcriptional Network of Genes Essential for Mitotic Progression and Genes Encoding the SCF (Skp2-Cks1) Ubiquitin Ligase , 2005, Molecular and Cellular Biology.

[55]  E. Davidson Genomic Regulatory Systems: Development and Evolution , 2005 .

[56]  Bin Li,et al.  Limitations and potentials of current motif discovery algorithms , 2005, Nucleic acids research.

[57]  R. Foreman,et al.  Foxd3 is required in the trophoblast progenitor cell lineage of the mouse embryo. , 2005, Developmental biology.

[58]  M. Sander,et al.  NKX6 transcription factor activity is required for α- andβ -cell development in the pancreas , 2005 .

[59]  Thomas Werner,et al.  MatInspector and beyond: promoter analysis based on transcription factor binding sites , 2005, Bioinform..

[60]  A. Look,et al.  TEF, an antiapoptotic bZIP transcription factor related to the oncogenic E2A-HLF chimera, inhibits cell growth by down-regulating expression of the common beta chain of cytokine receptors. , 2005, Blood.

[61]  Jun S. Liu,et al.  De novo cis-regulatory module elicitation for eukaryotic genomes. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[62]  Alan Ashworth,et al.  Targeting the DNA repair defect in BRCA mutant cells as a therapeutic strategy , 2005, Nature.

[63]  Jeffrey A Whitsett,et al.  Compensatory Roles of Foxa1 and Foxa2 during Lung Morphogenesis* , 2005, Journal of Biological Chemistry.

[64]  Ariel J. Levine,et al.  TGFβ/activin/nodal signaling is necessary for the maintenance of pluripotency in human embryonic stem cells , 2005 .

[65]  T. Werner,et al.  Linking disease-associated genes to regulatory networks via promoter organization , 2005, Nucleic acids research.

[66]  M. Sander,et al.  NKX6 transcription factor activity is required for alpha- and beta-cell development in the pancreas. , 2005, Development.

[67]  Ariel J. Levine,et al.  TGFbeta/activin/nodal signaling is necessary for the maintenance of pluripotency in human embryonic stem cells. , 2005, Development.

[68]  A. Leutz,et al.  Essential Requirement of CCAAT/Enhancer Binding Proteins in Embryogenesis , 2004, Molecular and Cellular Biology.

[69]  A. Groves,et al.  Expression of mouse Foxi class genes in early craniofacial development , 2004, Developmental dynamics : an official publication of the American Association of Anatomists.

[70]  I. Graef,et al.  A Field of Myocardial-Endocardial NFAT Signaling Underlies Heart Valve Morphogenesis , 2004, Cell.

[71]  W. Wong,et al.  CisModule: de novo discovery of cis-regulatory modules by hierarchical mixture modeling. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[72]  Joonsoo Kang,et al.  STAT5 Is Required for Thymopoiesis in a Development Stage-Specific Manner1 , 2004, The Journal of Immunology.

[73]  Bart De Moor,et al.  A genetic algorithm for the detection of new cis-regulatory modules in sets of coregulated genes , 2004, Bioinform..

[74]  J. Lieb,et al.  Evidence for nucleosome depletion at active regulatory regions genome-wide , 2004, Nature Genetics.

[75]  S. Pikkarainen,et al.  GATA transcription factors in the developing and adult heart. , 2004, Cardiovascular research.

[76]  P. Prinos,et al.  Cdx1 Autoregulation Is Governed by a Novel Cdx1-LEF1 Transcription Complex , 2004, Molecular and Cellular Biology.

[77]  Z. Weng,et al.  Detection of functional DNA motifs via statistical over-representation. , 2004, Nucleic acids research.

[78]  J. Lieb,et al.  ChIP-chip: considerations for the design, analysis, and application of genome-wide chromatin immunoprecipitation experiments. , 2004, Genomics.

[79]  L. Sussel,et al.  The concerted activities of Pax4 and Nkx2.2 are essential to initiate pancreatic beta-cell differentiation. , 2004, Developmental biology.

[80]  I. Thesleff,et al.  Phenotypic Changes in Dentition of Runx2 Homozygote-null Mutant Mice , 2004, The journal of histochemistry and cytochemistry : official journal of the Histochemistry Society.

[81]  R. Braun,et al.  Androgen receptor function is required in Sertoli cells for the terminal differentiation of haploid spermatids , 2003, Development.

[82]  R. Shiekhattar,et al.  Isolation of human NURF: a regulator of Engrailed gene expression , 2003, The EMBO journal.

[83]  Bart De Moor,et al.  Computational detection of cis-regulatory modules , 2003, ECCB.

[84]  A. Fusco,et al.  Loss of Hmga1 gene function affects embryonic stem cell lymphohematopoietic differentiation , 2003, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[85]  Kathleen Marchal,et al.  INCLUSive: a web portal and service registry for microarray and regulatory sequence analysis , 2003, Nucleic Acids Res..

[86]  Martin C. Frith,et al.  Cluster-Buster: finding dense clusters of motifs in DNA sequences , 2003, Nucleic Acids Res..

[87]  A. Rudensky,et al.  Foxp3 programs the development and function of CD4+CD25+ regulatory T cells , 2003, Nature Immunology.

[88]  C. Deng,et al.  Disruption of Transforming Growth Factor-β Signaling in ELF β-Spectrin-Deficient Mice , 2003, Science.

[89]  C. Deng,et al.  Disruption of transforming growth factor-beta signaling in ELF beta-spectrin-deficient mice. , 2003, Science.

[90]  M. Busslinger,et al.  Nephric lineage specification by Pax2 and Pax8. , 2002, Genes & development.

[91]  J. Darnell,et al.  Signalling: STATs: transcriptional control and biological impact , 2002, Nature Reviews Molecular Cell Biology.

[92]  Jun S. Liu,et al.  An algorithm for finding protein–DNA binding sites with applications to chromatin-immunoprecipitation microarray experiments , 2002, Nature Biotechnology.

[93]  M. Takeichi,et al.  Homeobox gene hoxa3 is essential for the formation of the carotid body in the mouse embryos. , 2002, Developmental biology.

[94]  A. Voss,et al.  Gcm1 expression defines three stages of chorio-allantoic interaction during placental development , 2002, Mechanisms of Development.

[95]  M. Goldsmith,et al.  STAT5 promotes multilineage hematolymphoid development in vivo through effects on early hematopoietic progenitor cells. , 2002, Blood.

[96]  B. Beverloo,et al.  The structure‐specific endonuclease Ercc1—Xpf is required for targeted gene replacement in embryonic stem cells , 2001, The EMBO journal.

[97]  Martin C. Frith,et al.  Detection of cis -element clusters in higher eukaryotic DNA , 2001, Bioinform..

[98]  E. Robertson,et al.  Mouse embryos lacking Smad1 signals display defects in extra-embryonic tissues and germ cell formation. , 2001, Development.

[99]  S. Chanda,et al.  Requirement for Pbx1 in skeletal patterning and programming chondrocyte proliferation and differentiation. , 2001, Development.

[100]  E. Fuchs,et al.  Tcf3 and Lef1 regulate lineage differentiation of multipotent stem cells in skin. , 2001, Genes & development.

[101]  C. Perez-Sanchez,et al.  Fhx (Foxj2) expression is activated during spermatogenesis and very early in embryonic development , 2000, Mechanisms of Development.

[102]  Takashi Tanaka,et al.  The biology of Stat4 and Stat6 , 2000, Oncogene.

[103]  R. Cardiff,et al.  Impact of progesterone receptor on cell-fate decisions during mammary gland development. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[104]  J. Farrar,et al.  Recruitment of Stat4 to the Human Interferon-α/β Receptor Requires Activated Stat2* , 2000, The Journal of Biological Chemistry.

[105]  J. Farrar,et al.  Recruitment of Stat4 to the human interferon-alpha/beta receptor requires activated Stat2. , 2000, The Journal of biological chemistry.

[106]  Tiansen Li,et al.  Requirement for the c-Maf transcription factor in crystallin gene regulation and lens development. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[107]  H Clevers,et al.  Wnt3a-/--like phenotype and limb deficiency in Lef1(-/-)Tcf1(-/-) mice. , 1999, Genes & development.

[108]  C. Lien,et al.  Control of early cardiac-specific transcription of Nkx2-5 by a GATA-dependent enhancer. , 1999, Development.

[109]  Hans Clevers,et al.  Depletion of epithelial stem-cell compartments in the small intestine of mice lacking Tcf-4 , 1998, Nature Genetics.

[110]  Ahmed Mansouri,et al.  Follicular cells of the thyroid gland require Pax8 gene function , 1998, Nature Genetics.

[111]  B. Morgan,et al.  Helios, a novel dimerization partner of Ikaros expressed in the earliest hematopoietic progenitors , 1998, Current Biology.

[112]  S. Orkin,et al.  Knock-in mutation of transcription factor GATA-3 into the GATA-1 locus: partial rescue of GATA-1 loss of function in erythroid cells. , 1998, Developmental biology.

[113]  T. Jacks Tumor suppressor gene mutations in mice. , 1999, Annual review of genetics.

[114]  C H Fox,et al.  The T/ebp null mouse: thyroid-specific enhancer-binding protein is essential for the organogenesis of the thyroid, lung, ventral forebrain, and pituitary. , 1996, Genes & development.

[115]  P. Gruss,et al.  Pax-2 controls multiple steps of urogenital development. , 1995, Development.

[116]  M. Nemer,et al.  Inhibition of transcription factor GATA-4 expression blocks in vitro cardiac muscle differentiation , 1995, Molecular and cellular biology.

[117]  W. Leonard,et al.  Regulation of cell-type-specific interleukin-2 receptor alpha-chain gene expression: potential role of physical interactions between Elf-1, HMG-I(Y), and NF-kappa B family proteins , 1995, Molecular and cellular biology.