Conservation of regulatory elements between two species of Drosophila

BackgroundOne of the important goals in the post-genomic era is to determine the regulatory elements within the non-coding DNA of a given organism's genome. The identification of functional cis-regulatory modules has proven difficult since the component factor binding sites are small and the rules governing their arrangement are poorly understood. However, the genomes of suitably diverged species help to predict regulatory elements based on the generally accepted assumption that conserved blocks of genomic sequence are likely to be functional. To judge the efficacy of strategies that prefilter by sequence conservation it is important to know to what extent the converse assumption holds, namely that functional elements common to both species will fall within these conserved blocks. The recently completed sequence of a second Drosophila species provides an opportunity to test this assumption for one of the experimentally best studied regulatory networks in multicellular organisms, the body patterning of the fly embryo.ResultsWe find that 50%–70% of known binding sites reside in conserved sequence blocks, but these percentages are not greatly enriched over what is expected by chance. Finally, a computational genome-wide search in both species for regulatory modules based on clusters of binding sites suggests that genes central to the regulatory network are consistently recovered.ConclusionsOur results indicate that binding sites remain clustered for these "core modules" while not necessarily residing in conserved blocks. This is an important clue as to how regulatory information is encoded in the genome and how modules evolve.

[1]  Chuong B. Do,et al.  Access the most recent version at doi: 10.1101/gr.926603 References , 2003 .

[2]  A. Laughon,et al.  Ftz-F1 is a cofactor in Ftz activation of the Drosophila engrailed gene. , 1997, Development.

[3]  J. Kim,et al.  Molecular heterochrony in the early development of Drosophila. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Anna G. Nazina,et al.  Extraction of functional binding sites from unique regulatory regions: the Drosophila early developmental enhancers. , 2002, Genome research.

[5]  R. Jackson Genomic regulatory systems , 2001 .

[6]  Mihaela Zavolan,et al.  SMASHing regulatory sites in DNA by human-mouse sequence comparisons , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[7]  J. Kim Macro-evolution of the hairy enhancer in Drosophila species. , 2001, The Journal of experimental zoology.

[8]  G. Rubin,et al.  Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[9]  G A Dover,et al.  Coevolution in bicoid‐dependent promoters and the inception of regulatory incompatibilities among species of higher Diptera , 2002, Evolution & development.

[10]  L. Hood,et al.  Regulatory gene networks and the properties of the developmental process , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[11]  S. Carroll,et al.  Conservation of regulatory elements controlling hairy pair-rule stripe formation. , 1993, Development.

[12]  Massimo Vergassola,et al.  Computational detection of genomic cis-regulatory modules applied to body patterning in the early Drosophila embryo , 2002, BMC Bioinformatics.

[13]  P. O’Farrell,et al.  Evolutionary conservation of homeodomain-binding sites and other sequences upstream and within the major transcription unit of the Drosophila segmentation gene engrailed , 1989, Molecular and cellular biology.

[14]  John M. Hancock,et al.  High sequence turnover in the regulatory regions of the developmental gene hunchback in insects. , 1999, Molecular biology and evolution.

[15]  Michael Brudno,et al.  Fast and sensitive alignment of large genomic sequences , 2002, Proceedings. IEEE Computer Society Bioinformatics Conference.

[16]  Mark Rebeiz,et al.  SCORE: A computational approach to the identification of cis-regulatory modules and target genes in whole-genome sequence data , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[17]  L. Pick,et al.  A Binding Site for Multiple Transcriptional Activators in the fushi tarazu Proximal Enhancer Is Essential for Gene Expression In Vivo , 1998, Molecular and Cellular Biology.

[18]  M. Nei,et al.  Molecular phylogeny and divergence times of drosophilid species. , 1995, Molecular biology and evolution.

[19]  Jon D. McAuliffe,et al.  Phylogenetic Shadowing of Primate Sequences to Find Functional Regions of the Human Genome , 2003, Science.

[20]  C. Lawrence,et al.  Human-mouse genome comparisons to locate regulatory sites , 2000, Nature Genetics.

[21]  A. Gnirke,et al.  Assessing the impact of comparative genomic sequence data on the functional annotation of the Drosophila genome , 2002, Genome Biology.

[22]  D. Tautz Evolution of transcriptional regulation. , 2000, Current opinion in genetics & development.

[23]  A. Clark,et al.  Evolution of transcription factor binding sites in Mammalian gene regulatory regions: conservation and turnover. , 2002, Molecular biology and evolution.

[24]  A. Clark,et al.  Tracing the evolutionary history of Drosophila regulatory regions with models that identify transcription factor binding sites. , 2003, Molecular biology and evolution.

[25]  W. A. Johnson,et al.  Restricted patterning of vestigial expression in Drosophila wing imaginal discs requires synergistic activation by both Mad and the drifter POU domain transcription factor. , 2000, Development.

[26]  A. Schier,et al.  Analysis of a fushi tarazu autoregulatory element: multiple sequence elements contribute to enhancer activity. , 1993, The EMBO journal.

[27]  M. Kreitman,et al.  Analysis of conserved noncoding DNA in Drosophila reveals similar constraints in intergenic and intronic sequences. , 2001, Genome research.

[28]  M. Levine,et al.  The eve stripe 2 enhancer employs multiple modes of transcriptional synergy. , 1996, Development.

[29]  M. Laubichler Review of: Carroll, Sean B., Jennifer K. Grenier and Scott D. Weatherbee: From DNA to diversity : molecular genetics and the evolution of animal design. Malden, Mass [u.a.]: Blackwell Science 2001 , 2003 .

[30]  Martin Klingler,et al.  Structure and evolution of a pair-rule interaction element: runt regulatory sequences in D. melanogaster and D. virilis , 1999, Mechanisms of Development.

[31]  N. Patel,et al.  Evidence for stabilizing selection in a eukaryotic enhancer element , 2000, Nature.

[32]  G. Wray,et al.  Abundant raw material for cis-regulatory evolution in humans. , 2002, Molecular biology and evolution.

[33]  Marc S Halfon,et al.  Computation-based discovery of related transcriptional regulatory modules and motifs using an experimentally validated combinatorial model. , 2002, Genome research.

[34]  Eric H Davidson,et al.  Patchy interspecific sequence similarities efficiently identify positive cis-regulatory elements in the sea urchin. , 2002, Developmental biology.

[35]  A. Schier,et al.  Analysis of the ftz upstream element: germ layer-specific enhancers are independently autoregulated. , 1990, Genes & development.

[36]  W. Miller,et al.  Identification of a coordinate regulator of interleukins 4, 13, and 5 by cross-species sequence comparisons. , 2000, Science.

[37]  E. Rothenberg Mapping of complex regulatory elements by pufferfish/zebrafish transgenesis , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[38]  G. Stormo,et al.  Identification of a novel cis-regulatory element involved in the heat shock response in Caenorhabditis elegans using microarray gene expression and computational methods. , 2002, Genome research.

[39]  M. Levine,et al.  Regulation of even‐skipped stripe 2 in the Drosophila embryo. , 1992, The EMBO journal.

[40]  E. Wieschaus,et al.  Embryonic transcription and the control of developmental pathways. , 1996, Genetics.

[41]  P. Flores-Villanueva,et al.  Identification of phylogenetic footprints in primate tumor necrosis factor-alpha promoters. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[42]  S. Crews,et al.  Specification of the Drosophila CNS midline cell lineage: direct control of single-minded transcription by dorsal/ventral patterning genes. , 1998, Gene expression.

[43]  Peter W. Markstein,et al.  Genome-wide analysis of clustered Dorsal binding sites identifies putative target genes in the Drosophila embryo , 2001, Proceedings of the National Academy of Sciences of the United States of America.