Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome

A major challenge in interpreting genome sequences is understanding how the genome encodes the information that specifies when and where a gene will be expressed. The first step in this process is the identification of regions of the genome that contain regulatory information. In higher eukaryotes, this cis-regulatory information is organized into modular units [cis-regulatory modules (CRMs)] of a few hundred base pairs. A common feature of these cis-regulatory modules is the presence of multiple binding sites for multiple transcription factors. Here, we evaluate the extent to which the tendency for transcription factor binding sites to be clustered can be used as the basis for the computational identification of cis-regulatory modules. By using published DNA binding specificity data for five transcription factors active in the early Drosophila embryo, we identified genomic regions containing unusually high concentrations of predicted binding sites for these factors. A significant fraction of these binding site clusters overlap known CRMs that are regulated by these factors. In addition, many of the remaining clusters are adjacent to genes expressed in a pattern characteristic of genes regulated by these factors. We tested one of the newly identified clusters, mapping upstream of the gap gene giant (gt), and show that it acts as an enhancer that recapitulates the posterior expression pattern of gt.

[1]  G. Struhl,et al.  A molecular gradient in early Drosophila embryos and its role in specifying the body pattern , 1986, Nature.

[2]  P. V. von Hippel,et al.  Selection of DNA binding sites by regulatory proteins. Statistical-mechanical theory and application to operators and promoters. , 1987, Journal of molecular biology.

[3]  Marek Mlodzik,et al.  Expression of the caudal gene in the germ line of Drosophila: Formation of an RNA and protein gradient during early embryogenesis , 1987, Cell.

[4]  P. Ingham,et al.  The spatial and temporal deployment of Dfd and Scr transcripts throughout development of Drosophila. , 1987, Development.

[5]  M. Levine,et al.  Gap genes define the limits of antennapedia and bithorax gene expression during early development in Drosophila. , 1988, The EMBO journal.

[6]  C. Thummel,et al.  Vectors for Drosophila P-element-mediated transformation and tissue culture transfection. , 1988, Gene.

[7]  V. Pirrotta,et al.  A novel spatial transcription pattern associated with the segmentation gene, giant, of Drosophila. , 1989, The EMBO journal.

[8]  C. S. Parker,et al.  Transcriptional control of Drosophila fushi tarazu zebra stripe expression. , 1989, Genes & development.

[9]  C. S. Parker,et al.  The caudal gene product is a direct activator of fushi tarazu transcription during Drosophila embryogenesis , 1989, Nature.

[10]  Tom Maniatis,et al.  Early and late periodic patterns of even skipped expression are controlled by distinct regulatory elements that respond to different spatial cues , 1989, Cell.

[11]  M. Levine,et al.  Autoregulatory and gap gene response elements of the even‐skipped promoter of Drosophila. , 1989, The EMBO journal.

[12]  Establishment of the Deformed expression stripe requires the combinatorial action of coordinate, gap and pair‐rule proteins. , 1990, The EMBO journal.

[13]  M. Levine,et al.  Control of the initiation of homeotic gene expression by the gap genes giant and tailless in Drosophila. , 1990, Developmental biology.

[14]  E. Wieschaus,et al.  Molecular analysis of odd‐skipped, a zinc finger encoding segmentation gene with a novel pair‐rule expression pattern. , 1990, The EMBO journal.

[15]  Norbert Perrimon,et al.  The orthodenticle gene is regulated by bicoid and torso and specifies Drosophila head development , 1990, Nature.

[16]  V. Pirrotta,et al.  Interactions of the Drosophila gap gene giant with maternal and zygotic pattern-forming genes. , 1991, Development.

[17]  C. S. Parker,et al.  Synthetic oligonucleotides recreate Drosophila fushi tarazu zebra-stripe expression. , 1991, Genes & development.

[18]  M. Levine,et al.  Spatial regulation of the gap gene giant during Drosophila development. , 1991, Development.

[19]  M. Levine,et al.  Regulation of a segmentation stripe by overlapping activators and repressors in the Drosophila embryo. , 1991, Science.

[20]  M. Levine,et al.  Mutually repressive interactions between the gap genes giant and Krüppel define middle body regions of the Drosophila embryo. , 1991, Development.

[21]  V. Pirrotta,et al.  The giant gene of Drosophila encodes a b-ZIP DNA-binding protein that regulates the expression of other segmentation gap genes. , 1992, Development.

[22]  M. Levine,et al.  Regulation of even‐skipped stripe 2 in the Drosophila embryo. , 1992, The EMBO journal.

[23]  C. Nüsslein-Volhard,et al.  The origin of pattern and polarity in the Drosophila embryo , 1992, Cell.

[24]  AC Tose Cell , 1993, Cell.

[25]  H. Jäckle,et al.  A Drosophila homologue of human Sp1 is a head-specific segmentation gene , 1993, Nature.

[26]  S. Poole,et al.  Regulation of expression domains and effects of ectopic expression reveal gap gene-like properties of the linked pdm genes of Drosophila , 1993, Mechanisms of Development.

[27]  H. Jäckle,et al.  Trans- and cis-acting requirements for blastodermal expression of the head gap gene buttonhead , 1995, Mechanisms of Development.

[28]  Norbert Perrimon,et al.  Activation of posterior gap gene expression in the Drosophila blastoderm , 1995, Nature.

[29]  M. Levine,et al.  The eve stripe 2 enhancer employs multiple modes of transcriptional synergy. , 1996, Development.

[30]  Q. Gao,et al.  orthodenticle regulation during embryonic head development in Drosophila , 1996, Mechanisms of Development.

[31]  A. Mccarthy Development , 1996, Current Opinion in Neurobiology.

[32]  M. Levine,et al.  Regulation of two pair-rule stripes by a single enhancer in the Drosophila embryo. , 1996, Developmental biology.

[33]  H. Jäckle,et al.  Gene regulation in the Drosophila embryo. , 1996, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[34]  H. Jäckle,et al.  A cascade of transcriptional control leading to axis determination in Drosophila , 1997, Journal of cellular physiology.

[35]  D Kosman,et al.  Concentration-dependent patterning by an ectopic expression domain of the Drosophila gap gene knirps. , 1997, Development.

[36]  K. Roeder,et al.  A statistical model for locating regulatory regions in genomic DNA. , 1997, Journal of molecular biology.

[37]  J. Fickett,et al.  Identification of regulatory regions which confer muscle-specific gene expression. , 1998, Journal of molecular biology.

[38]  D. S. Fields,et al.  Specificity, free energy and information content in protein-DNA interactions. , 1998, Trends in biochemical sciences.

[39]  N. Patel,et al.  Functional analysis of eve stripe 2 enhancer evolution in Drosophila: rules governing conservation and change. , 1998, Development.

[40]  G. Korge,et al.  The Pipsqueak Protein of Drosophila melanogasterBinds to GAGA Sequences through a Novel DNA-binding Domain* , 1998, The Journal of Biological Chemistry.

[41]  Q Gao,et al.  Targeting gene expression to the head: the Drosophila orthodenticle gene is a direct target of the Bicoid morphogen. , 1998, Development.

[42]  M. Fujioka,et al.  Analysis of an even-skipped rescue transgene reveals both composite and discrete neuronal and early blastoderm enhancers, and multi-stripe positioning by gap gene repressor gradients. , 1999, Development.

[43]  Gary D. Stormo,et al.  Identifying DNA and protein patterns with statistically significant alignments of multiple sequences , 1999, Bioinform..

[44]  Andreas Wagner,et al.  Genes regulated cooperatively by one or more transcription factors and their identification in whole eukaryotic genomes , 1999, Bioinform..

[45]  M. Fujioka,et al.  The even-skipped locus is contained in a 16-kb chromatin domain. , 1999, Developmental biology.

[46]  Stephen M. Mount,et al.  The genome sequence of Drosophila melanogaster. , 2000, Science.

[47]  C. Lawrence,et al.  Human-mouse genome comparisons to locate regulatory sites , 2000, Nature Genetics.

[48]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .

[49]  N. Patel,et al.  Evidence for stabilizing selection in a eukaryotic enhancer element , 2000, Nature.

[50]  Gary D. Stormo,et al.  DNA binding sites: representation and discovery , 2000, Bioinform..

[51]  Xin Chen,et al.  The TRANSFAC system on gene expression regulation , 2001, Nucleic Acids Res..

[52]  Martin C. Frith,et al.  Detection of cis -element clusters in higher eukaryotic DNA , 2001, Bioinform..

[53]  Bret J. Pearson,et al.  Drosophila Neuroblasts Sequentially Express Transcription Factors which Specify the Temporal Identity of Their Neuronal Progeny , 2001, Cell.

[54]  L. Pennacchio,et al.  Genomic strategies to identify mammalian regulatory sequences , 2001, Nature Reviews Genetics.

[55]  W. Wasserman,et al.  A predictive model for regulatory sequences directing liver-specific transcription. , 2001, Genome research.

[56]  N. Gostling,et al.  From DNA to Diversity: Molecular Genetics and the Evolution of Animal Design , 2002, Heredity.

[57]  M. Laubichler Review of: Carroll, Sean B., Jennifer K. Grenier and Scott D. Weatherbee: From DNA to diversity : molecular genetics and the evolution of animal design. Malden, Mass [u.a.]: Blackwell Science 2001 , 2003 .