Predicting rules on organization of cis-regulatory elements, taking the order of elements into account

MOTIVATION In eukaryotes, rules regarding organization of cis-regulatory elements are complex. They sometimes govern multiple kinds of elements and positional restrictions on elements. RESULTS We propose a method for detecting rules, by which the order of elements is restricted. The order restriction is expressed as element patterns. We extract all the element patterns that occur in promoter regions of at least the specified number of genes. Then, we find significant patterns based on the expression similarity of genes with promoter regions containing each of the extracted patterns. When we applied our method to Saccharomyces cerevisiae, we detected significant patterns overlooked by previous methods, thus demonstrating the utility of our method for analyses of eukaryotic gene regulation. We also suggest that several types of element organization exist: (i) those in which only the order of elements is important, (ii) order and distance both are important and (iii) only the combination of elements is important. AVAILABILITY The program for extracting element patterns is available upon request.

[1]  K Rippe,et al.  Action at a distance: DNA-looping and initiation of transcription. , 1995, Trends in biochemical sciences.

[2]  G. Pesole,et al.  WORDUP: an efficient algorithm for discovering statistically significant patterns in DNA sequences. , 1992, Nucleic acids research.

[3]  M. Gerstein,et al.  Relating whole-genome expression data with protein-protein interactions. , 2002, Genome research.

[4]  Kathleen Marchal,et al.  A Gibbs sampling method to detect over-represented motifs in the upstream regions of co-expressed genes , 2001, RECOMB.

[5]  M. Künzler,et al.  Amino Acid and Adenine Cross-pathway Regulation Act through the Same 5′-TGACTC-3′ Motif in the Yeast HIS7 Promoter* , 1996, The Journal of Biological Chemistry.

[6]  Alexander E. Kel,et al.  Automatic Annotation of Genomic Regulatory Sequences by Searching for Composite Clusters , 2001, Pacific Symposium on Biocomputing.

[7]  Alexander E. Kel,et al.  TRANSCompel®: a database on composite regulatory elements in eukaryotic genes , 2002, Nucleic Acids Res..

[8]  M. Tompa,et al.  Discovery of novel transcription factor binding sites by statistical overrepresentation. , 2002, Nucleic acids research.

[9]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[10]  B. Stillman,et al.  Yeast autonomously replicating sequence binding factor is involved in nucleotide excision repair. , 1999, Genes & development.

[11]  George M. Church,et al.  Regulatory Networks Revealed by Transcriptional Profiling of Damaged Saccharomyces cerevisiae Cells: Rpn4 Links Base Excision Repair with Proteasomes , 2000, Molecular and Cellular Biology.

[12]  Thomas Werner,et al.  Functional promoter modules can be detected by formal models independent of overall nucleotide sequence similarity , 1999, Bioinform..

[13]  T. Graves,et al.  Surveying Saccharomyces genomes to identify functional elements by comparative DNA sequence analysis. , 2001, Genome research.

[14]  Michael B. Eisen,et al.  Visualizing associations between genome sequences and gene expression data using genome-mean expression profiles , 2001, ISMB.

[15]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[16]  E. O’Shea,et al.  Phosphorylation of the transcription factor PHO4 by a cyclin-CDK complex, PHO80-PHO85. , 1994, Science.

[17]  Esko Ukkonen,et al.  Correlating gene promoters and expression in gene disruption experiments , 2002, ECCB.

[18]  Peter W. Markstein,et al.  Genome-wide analysis of clustered Dorsal binding sites identifies putative target genes in the Drosophila embryo , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[19]  I. Graham,et al.  A Reb1p‐binding site is required for efficient activation of the yeast RAP1 gene, but multiple binding sites for Rap1p are not essential , 1994, Molecular microbiology.

[20]  G. Church,et al.  Identifying regulatory networks by combinatorial analysis of promoter elements , 2001, Nature Genetics.

[21]  G. Rubin,et al.  Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[22]  L. Karns,et al.  Histone H3 transcription in Saccharomyces cerevisiae is controlled by multiple cell cycle activation sites and a constitutive negative regulatory element , 1992, Molecular and cellular biology.

[23]  Gary D. Stormo,et al.  Identifying DNA and protein patterns with statistically significant alignments of multiple sequences , 1999, Bioinform..

[24]  L. Breeden,et al.  A novel Mcm1-dependent element in the SWI4, CLN3, CDC6, and CDC47 promoters activates M/G1-specific transcription. , 1997, Genes & development.

[25]  B. Redruello,et al.  Multiple regulatory elements control the expression of the yeast ACR1 gene , 1999, FEBS letters.

[26]  E. O’Shea,et al.  Genetic evidence for a morphogenetic function of the Saccharomyces cerevisiae Pho85 cyclin-dependent kinase. , 2001, Genetics.

[27]  P. Brown,et al.  Exploring the metabolic and genetic control of gene expression on a genomic scale. , 1997, Science.

[28]  H. Bussemaker,et al.  Regulatory element detection using correlation with expression , 2001, Nature Genetics.

[29]  Douglas L. Brutlag,et al.  BioProspector: Discovering Conserved DNA Motifs in Upstream Regulatory Regions of Co-Expressed Genes , 2000, Pacific Symposium on Biocomputing.

[30]  T. Werner,et al.  Regulatory context is a crucial part of gene function. , 2002, Trends in genetics : TIG.

[31]  Ramakrishnan Srikant,et al.  Mining Sequential Patterns: Generalizations and Performance Improvements , 1996, EDBT.

[32]  William Noble Grundy,et al.  Meta-MEME: motif-based hidden Markov models of protein families , 1997, Comput. Appl. Biosci..

[33]  Michele Caselle,et al.  Correlating overrepresented upstream motifs to gene expression: a computational approach to regulatory element discovery in eukaryotes , 2001, BMC Bioinformatics.

[34]  D. Botstein,et al.  The transcriptional program of sporulation in budding yeast. , 1998, Science.

[35]  R. Sharan,et al.  Genome-wide in silico identification of transcriptional regulators controlling the cell cycle in human cells. , 2003, Genome research.

[36]  Martin C. Frith,et al.  Detection of cis -element clusters in higher eukaryotic DNA , 2001, Bioinform..

[37]  Nicola J. Rinaldi,et al.  Transcriptional Regulatory Networks in Saccharomyces cerevisiae , 2002, Science.

[38]  E. Wingender,et al.  Recognition of NFATp/AP-1 composite elements within genes induced upon the activation of immune cells. , 1999, Journal of molecular biology.

[39]  H. Friesen,et al.  NDT80 and the Meiotic Recombination Checkpoint Regulate Expression of Middle Sporulation-Specific Genes in Saccharomyces cerevisiae , 1998, Molecular and Cellular Biology.

[40]  T. Hughes,et al.  Signaling and circuitry of multiple MAPK pathways revealed by a matrix of global gene expression profiles. , 2000, Science.

[41]  W. Wasserman,et al.  A predictive model for regulatory sequences directing liver-specific transcription. , 2001, Genome research.

[42]  Ronald W. Davis,et al.  A genome-wide transcriptional analysis of the mitotic cell cycle. , 1998, Molecular cell.

[43]  W. H. Mager,et al.  Analysis of upstream activation sites of yeast ribosomal protein genes. , 1987, Nucleic acids research.

[44]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[45]  F. Estruch,et al.  Convergence of the Target of Rapamycin and the Snf1 Protein Kinase Pathways in the Regulation of the Subcellular Localization of Msn2, a Transcriptional Activator of STRE (Stress Response Element)-regulated Genes* , 2002, The Journal of Biological Chemistry.