A novel unbiased measure for motif co-occurrence predicts combinatorial regulation of transcription

BackgroundMultiple transcription factors (TFs) are involved in the generation of gene expression patterns, such as tissue-specific gene expression and pleiotropic immune responses. However, how combinations of TFs orchestrate diverse gene expression patterns is poorly understood. Here we propose a new measure for regulatory motif co-occurrence and a new methodology to systematically identify TF pairs significantly co-occurring in a set of promoter sequences.ResultsInitial analyses suggest that non-CpG promoters have a higher potential for combinatorial regulation than CpG island-associated promoters, and that co-occurrences are strongly influenced by motif similarity. We applied our method to large-scale gene expression data from various tissues, and showed how our measure for motif co-occurrence is not biased by motif over-representation. Our method identified, amongst others, the binding motifs of HNF1 and FOXP1 to be significantly co-occurring in promoters of liver/kidney specific genes. Binding sites tend to be positioned proximally to each other, suggesting interactions exist between this pair of transcription factors. Moreover, the binding sites of several TFs were found to co-occur with NF-κB and IRF sites in sets of genes with similar expression patterns in dendritic cells after Toll-like receptor stimulation. Of these, we experimentally verified that CCAAT enhancer binding protein alpha positively regulates its target promoters synergistically with NF-κB.ConclusionsBoth computational and experimental results indicate that the proposed method can clarify TF interactions that could not be observed by currently available prediction methods.

[1]  Jiang Qian,et al.  Identification of tissue-specific cis-regulatory modules based on interactions between transcription factors , 2007, BMC Bioinformatics.

[2]  Michael Q. Zhang,et al.  Identifying combinatorial regulation of transcription factors and binding motifs , 2004, Genome Biology.

[3]  S. Salzberg,et al.  The Transcriptional Landscape of the Mammalian Genome , 2005, Science.

[4]  J. T. Kadonaga,et al.  The RNA polymerase II core promoter. , 2003, Annual review of biochemistry.

[5]  Jun Hyoung Lee,et al.  Phenotypic engineering by reprogramming gene transcription using novel artificial transcription factors in Escherichia coli , 2008, Nucleic acids research.

[6]  Zachary D. Smith,et al.  Unbiased Reconstruction of a Mammalian Transcriptional Network Mediating Pathogen Responses , 2009 .

[7]  G. Church,et al.  Genome-wide co-occurrence of promoter elements reveals a cis-regulatory cassette of rRNA transcription motifs in Saccharomyces cerevisiae. , 2002, Genome research.

[8]  Xueping Yu,et al.  Genome-wide prediction and characterization of interactions between transcription factors in Saccharomyces cerevisiae , 2006, Nucleic acids research.

[9]  C. Nerlov The C/EBP family of transcription factors: a paradigm for interaction between gene expression and proliferation control. , 2007, Trends in cell biology.

[10]  Mary Goldman,et al.  The UCSC Genome Browser database: update 2011 , 2010, Nucleic Acids Res..

[11]  Ariel S. Schwartz,et al.  An Atlas of Combinatorial Transcriptional Regulation in Mouse and Man , 2010, Cell.

[12]  Kenta Nakai,et al.  Modeling tissue-specific structural patterns in human and mouse promoters , 2009, Nucleic acids research.

[13]  Michael Q. Zhang,et al.  Identifying cooperativity among transcription factors controlling the cell cycle in yeast. , 2003, Nucleic acids research.

[14]  E. Furlong,et al.  Combinatorial binding predicts spatio-temporal cis-regulatory activity , 2009, Nature.

[15]  K. Nakai,et al.  Two different classes of co-occurring motif pairs found by a novel visualization method in human promoter regions , 2008, BMC Genomics.

[16]  Alexander E. Kel,et al.  TRANSFAC® and its module TRANSCompel®: transcriptional gene regulation in eukaryotes , 2005, Nucleic Acids Res..

[17]  Xiang-Sun Zhang,et al.  Predicting eukaryotic transcriptional cooperativity by Bayesian network integration of genome-wide data , 2009, Nucleic acids research.

[18]  Jerrold M. Ward,et al.  Hepatocyte Nuclear Factor 4α (Nuclear Receptor 2A1) Is Essential for Maintenance of Hepatic Gene Expression and Lipid Homeostasis , 2001, Molecular and Cellular Biology.

[19]  Daniel Aguilar,et al.  Topological comparison of methods for predicting transcriptional cooperativity in yeast , 2008, BMC Genomics.

[20]  Sankar Ghosh,et al.  NF-κB, the first quarter-century: remarkable progress and outstanding questions. , 2012, Genes & development.

[21]  Junbai Wang,et al.  A new framework for identifying combinatorial regulation of transcription factors: A case study of the yeast cell cycle , 2007, J. Biomed. Informatics.

[22]  Michael A. Beer,et al.  Discriminative prediction of mammalian enhancers from DNA sequence. , 2011, Genome research.

[23]  G. Church,et al.  Identifying regulatory networks by combinatorial analysis of promoter elements , 2001, Nature Genetics.

[24]  Ole Winther,et al.  JASPAR, the open access database of transcription factor-binding profiles: new content and tools in the 2008 update , 2007, Nucleic Acids Res..

[25]  David Haussler,et al.  The UCSC Genome Browser database: update 2010 , 2009, Nucleic Acids Res..

[26]  Martin Vingron,et al.  CpG-depleted promoters harbor tissue-specific transcription factor binding signals—implications for motif overrepresentation analyses , 2009, Nucleic acids research.

[27]  S. Batalov,et al.  A gene atlas of the mouse and human protein-encoding transcriptomes. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Ernest Fraenkel,et al.  Core transcriptional regulatory circuitry in human hepatocytes , 2006, Molecular systems biology.

[29]  R. Tjian,et al.  Transcription regulation and animal diversity , 2003, Nature.

[30]  Hongyu Zhao,et al.  Statistical methods to infer cooperative binding among transcription factors in Saccharomyces cerevisiae , 2008, Bioinform..

[31]  Vladimir B. Bajic,et al.  Content analysis of the core promoter region of human genes , 2003, Silico Biol..

[32]  P. Sternberg,et al.  Transcriptional network underlying Caenorhabditis elegans vulval development. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[33]  Osamu Takeuchi,et al.  Enhanced TLR-mediated NF-IL6–dependent gene expression by Trib1 deficiency , 2007, The Journal of experimental medicine.

[34]  E. Siggia,et al.  Analysis of Combinatorial cis-Regulation in Synthetic and Genomic Promoters , 2008, Nature.

[35]  S. Cereghini Liver‐enriched transcription factors and hepatocyte differentiation , 1996, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[36]  Kenta Nakai,et al.  DBTSS provides a tissue specific dynamic view of Transcription Start Sites , 2009, Nucleic Acids Res..

[37]  M. Frommer,et al.  CpG islands in vertebrate genomes. , 1987, Journal of molecular biology.

[38]  Bin Li,et al.  Uncovering a Macrophage Transcriptional Program by Integrating Evidence from Motif Scanning and Expression Dynamics , 2008, PLoS Comput. Biol..

[39]  Martin S. Taylor,et al.  Genome-wide analysis of mammalian promoter architecture and evolution , 2006, Nature Genetics.

[40]  L. Hood,et al.  A Genomic Regulatory Network for Development , 2002, Science.

[41]  Boris Lenhard,et al.  Mammalian RNA polymerase II core promoters: insights from genome-wide studies , 2007, Nature Reviews Genetics.

[42]  Bing Ren,et al.  PU.1 and C/EBPα synergistically program distinct response to NF-κB activation through establishing monocyte specific enhancers , 2011, Proceedings of the National Academy of Sciences.

[43]  E. Davidson,et al.  The hardwiring of development: organization and function of genomic regulatory systems. , 1997, Development.

[44]  Nicola J. Rinaldi,et al.  Control of Pancreas and Liver Gene Expression by HNF Transcription Factors , 2004, Science.

[45]  C. Janeway,et al.  MyD88 is an adaptor protein in the hToll/IL-1 receptor family signaling pathways. , 1998, Molecular cell.

[46]  I. Talianidis,et al.  Modulation of hepatic gene expression by hepatocyte nuclear factor 1. , 1997, Science.

[47]  S. Levy,et al.  Predicting transcription factor synergism. , 2002, Nucleic acids research.

[48]  William Stafford Noble,et al.  Quantifying similarity between motifs , 2007, Genome Biology.