Genome-wide decoding of hierarchical modular structure of transcriptional regulation by cis-element and expression clustering

MOTIVATION A holistic approach to the study of cellular processes is identifying both gene-expression changes and regulatory elements promoting such changes. Cellular regulatory processes can be viewed as transcriptional modules (TMs), groups of coexpressed genes regulated by groups of transcription factors (TFs). We set out to devise a method that would identify TMs while avoiding arbitrary thresholds on TM sizes and number. METHOD Assuming that gene expression is determined by TFs that bind to the gene's promoter, clustering of genes based on TF binding sites (cis-elements) should create gene groups similar to those obtained by gene expression clustering. Intersections between the expression and cis-element-based gene clusters reveal TMs. Statistical significance assigned to each TM allows identification of regulatory units of any size. RESULTS Our method correctly identifies the number and sizes of TMs on simulated datasets. We demonstrate that yeast experimental TMs are biologically relevant by comparing them with MIPS and GO categories. Our modules are in statistically significant agreement with TMs from other research groups. This work suggests that there is no preferential division of biological processes into regulatory units; each degree of partitioning exhibits a slice of biological network revealing hierarchical modular organization of transcriptional regulation.

[1]  D. Koller,et al.  A module map showing conditional activity of expression modules in cancer , 2004, Nature Genetics.

[2]  Sven Bergmann,et al.  Defining transcription modules using large-scale gene expression data , 2004, Bioinform..

[3]  Roded Sharan,et al.  CREME: Cis-Regulatory Module Explorer for the human genome , 2004, Nucleic Acids Res..

[4]  Harmen J. Bussemaker,et al.  REDUCE: an online tool for inferring cis-regulatory elements and transcriptional module activities from microarray data , 2003, Nucleic Acids Res..

[5]  S. Wodak,et al.  Transcriptional regulation of protein complexes in yeast , 2004, Genome Biology.

[6]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[7]  A. Barabasi,et al.  Hierarchical Organization of Modularity in Metabolic Networks , 2002, Science.

[8]  G. Church,et al.  Genome-wide co-occurrence of promoter elements reveals a cis-regulatory cassette of rRNA transcription motifs in Saccharomyces cerevisiae. , 2002, Genome research.

[9]  Massimo Vergassola,et al.  Computational detection of genomic cis-regulatory modules applied to body patterning in the early Drosophila embryo , 2002, BMC Bioinformatics.

[10]  G. Stormo,et al.  Identification of a novel cis-regulatory element involved in the heat shock response in Caenorhabditis elegans using microarray gene expression and computational methods. , 2002, Genome research.

[11]  T. Werner Cluster analysis and promoter modelling as bioinformatics tools for the identification of target genes from expression array data. , 2001, Pharmacogenomics.

[12]  Mark Rebeiz,et al.  SCORE: A computational approach to the identification of cis-regulatory modules and target genes in whole-genome sequence data , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Martin Vingron,et al.  Correlating protein-DNA and protein-protein interaction networks. , 2003, Journal of molecular biology.

[14]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[15]  Marc S. Halfon,et al.  Prediction of similarly acting cis-regulatory modules by subsequence profiling and comparative genomics in Drosophila melanogaster and D.pseudoobscura , 2004, Bioinform..

[16]  G. Church,et al.  Computational identification of transcription factor binding sites via a transcription-factor-centric clustering (TFCC) algorithm. , 2002, Journal of molecular biology.

[17]  D. Pe’er,et al.  Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data , 2003, Nature Genetics.

[18]  Yaniv Ziv,et al.  Revealing modular organization in the yeast transcriptional network , 2002, Nature Genetics.

[19]  L. Fulton,et al.  Finding Functional Features in Saccharomyces Genomes by Phylogenetic Footprinting , 2003, Science.

[20]  Sven Bergmann,et al.  Iterative signature algorithm for the analysis of large-scale gene expression data. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[21]  J. Fak,et al.  Transcriptional Control in the Segmentation Gene Network of Drosophila , 2004, PLoS biology.

[22]  Petter Mostad,et al.  Prediction of cell type-specific gene modules: identification and initial characterization of a core set of smooth muscle-specific genes. , 2003, Genome research.

[23]  Saurabh Sinha,et al.  Cross-species comparison significantly improves genome-wide prediction of cis-regulatory modules in Drosophila , 2004, BMC Bioinformatics.

[24]  Jun S. Liu,et al.  Decoding human regulatory circuits. , 2004, Genome research.

[25]  Matthew W. Hahn,et al.  The evolution of transcriptional regulation in eukaryotes. , 2003, Molecular biology and evolution.

[26]  G. Church,et al.  Identifying regulatory networks by combinatorial analysis of promoter elements , 2001, Nature Genetics.

[27]  G. Rubin,et al.  Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Anna G. Nazina,et al.  Distance preferences in the arrangement of binding motifs and hierarchical levels in organization of transcription regulatory information. , 2003, Nucleic acids research.

[29]  Jennifer Hallinan,et al.  Gene duplication and hierarchical modularity in intracellular interaction networks. , 2004, Bio Systems.

[30]  Yitzhak Pilpel,et al.  Comprehensive quantitative analyses of the effects of promoter sequence elements on mRNA transcription , 2003, Nucleic Acids Res..

[31]  Mark D. Robinson,et al.  FunSpec: a web-based cluster interpreter for yeast , 2002, BMC Bioinformatics.

[32]  Andreas Beyer,et al.  Post-transcriptional Expression Regulation in the Yeast Saccharomyces cerevisiae on a Genomic Scale*S , 2004, Molecular & Cellular Proteomics.

[33]  R. C. Gardner,et al.  Type I Error Rate Comparisons of Post Hoc Procedures for I j Chi-Square Tables , 2000 .

[34]  Thomas Werner,et al.  Integrated functional and bioinformatics approach for the identification and experimental verification of RNA signals: application to HIV-1 INS. , 2003, Nucleic acids research.

[35]  Roded Sharan,et al.  Revealing modularity and organization in the yeast molecular network by integrated analysis of highly heterogeneous genomewide data. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[36]  P. Brown,et al.  Exploring the metabolic and genetic control of gene expression on a genomic scale. , 1997, Science.

[37]  Marc S Halfon,et al.  Computation-based discovery of related transcriptional regulatory modules and motifs using an experimentally validated combinatorial model. , 2002, Genome research.

[38]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[39]  Daniel E. Zak,et al.  PAINT: a promoter analysis and interaction network generation tool for gene regulatory network identification. , 2003, Omics : a journal of integrative biology.

[40]  S. Bergmann,et al.  Similarities and Differences in Genome-Wide Expression Data of Six Organisms , 2003, PLoS biology.

[41]  Nicola J. Rinaldi,et al.  Computational discovery of gene modules and regulatory networks , 2003, Nature Biotechnology.

[42]  David Botstein,et al.  A systematic approach to reconstructing transcription networks in Saccharomyces cerevisiae , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[43]  Nicola J. Rinaldi,et al.  Transcriptional Regulatory Networks in Saccharomyces cerevisiae , 2002, Science.

[44]  Anna G. Nazina,et al.  Homotypic regulatory clusters in Drosophila. , 2003, Genome research.

[45]  S. Levy,et al.  Predicting transcription factor synergism. , 2002, Nucleic acids research.