Highly parallel assays of tissue-specific enhancers in whole Drosophila embryos

Transcriptional enhancers are a primary mechanism by which tissue-specific gene expression is achieved. Despite the importance of these regulatory elements in development, responses to environmental stresses and disease, testing enhancer activity in animals remains tedious, with a minority of enhancers having been characterized. Here we describe 'enhancer-FACS-seq' (eFS) for highly parallel identification of active, tissue-specific enhancers in Drosophila melanogaster embryos. Analysis of enhancers identified by eFS as being active in mesodermal tissues revealed enriched DNA binding site motifs of known and putative, previously uncharacterized mesodermal transcription factors. Naive Bayes classifiers using transcription factor binding site motifs accurately predicted mesodermal enhancer activity. Application of eFS to other cell types and organisms should accelerate the cataloging of enhancers and understanding how transcriptional regulation is encoded in them.

[1]  Martha L. Bulyk,et al.  LOESS correction for length variation in gene set-based genomic sequence analysis , 2012, Bioinform..

[2]  R. Cripps,et al.  Transcription of the myogenic regulatory gene Mef2 in cardiac, somatic, and visceral muscle cell lineages is regulated by a Tinman-dependent core enhancer. , 1999, Developmental biology.

[3]  B. Thisse,et al.  The twist gene: isolation of a Drosophila zygotic gene necessary for the establishment of dorsoventral pattern. , 1987, Nucleic acids research.

[4]  E. Furlong,et al.  Combinatorial binding predicts spatio-temporal cis-regulatory activity , 2009, Nature.

[5]  Fangxue Sherry He,et al.  Systematic identification of mammalian regulatory motifs' target genes and functions , 2008, Nature Methods.

[6]  Nathaniel D. Heintzman,et al.  Histone modifications at human enhancers reflect global cell-type-specific gene expression , 2009, Nature.

[7]  Michael A Quail,et al.  Improved Protocols for the Illumina Genome Analyzer Sequencing System , 2009, Current protocols in human genetics.

[8]  K. White,et al.  Patterns of Gene Expression During Drosophila Mesoderm Development , 2001, Science.

[9]  N. Negre,et al.  From genetics to epigenetics: the tale of Polycomb group and trithorax group genes , 2006, Chromosome Research.

[10]  S. Spicuglia,et al.  H3K4 tri‐methylation provides an epigenetic signature of active enhancers , 2011, The EMBO journal.

[11]  Łukasz M. Boryń,et al.  Genome-Wide Quantitative Enhancer Activity Maps Identified by STARR-seq , 2013, Science.

[12]  Joseph B Hiatt,et al.  Massively parallel functional dissection of mammalian enhancers in vivo , 2012, Nature Biotechnology.

[13]  Lovelace J. Luquette,et al.  Comprehensive analysis of the chromatin landscape in Drosophila , 2010, Nature.

[14]  R. Sandstrom,et al.  Dynamic reprogramming of chromatin accessibility during Drosophila embryo development , 2011, Genome Biology.

[15]  R. Maeda,et al.  An optimized transgenesis system for Drosophila using germ-line-specific φC31 integrases , 2007, Proceedings of the National Academy of Sciences.

[16]  Steven M. Gallo,et al.  REDfly 2.0: an integrated database of cis-regulatory modules and transcription factor binding sites in Drosophila , 2007, Nucleic Acids Res..

[17]  Z. Yakhini,et al.  Inferring gene regulatory logic from high-throughput measurements of thousands of systematically designed promoters , 2012, Nature Biotechnology.

[18]  Ivan Ovcharenko,et al.  A Machine Learning Approach for Identifying Novel Cell Type–Specific Transcriptional Regulators of Myogenesis , 2012, PLoS genetics.

[19]  Martha L. Bulyk,et al.  Molecular mechanism underlying the regulatory specificity of a Drosophila homeodomain protein that specifies myoblast identity , 2012, Development.

[20]  Ting Ni,et al.  Integrative analysis of the zinc finger transcription factor Lame duck in the Drosophila myogenic gene regulatory network , 2012, Proceedings of the National Academy of Sciences.

[21]  J. Carroll,et al.  Pioneer transcription factors: establishing competence for gene expression. , 2011, Genes & development.

[22]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[23]  Lei Guo,et al.  Predicting Gene Expression from Sequence: A Reexamination , 2007, PLoS Comput. Biol..

[24]  E. Furlong,et al.  Tissue-specific analysis of chromatin state identifies temporal signatures of enhancer activity during embryonic development , 2012, Nature Genetics.

[25]  Nathaniel D. Heintzman,et al.  Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome , 2007, Nature Genetics.

[26]  Saurabh Sinha,et al.  FlyFactorSurvey: a database of Drosophila transcription factor binding specificities determined using the bacterial one-hybrid system , 2010, Nucleic Acids Res..

[27]  Anthony A. Philippakis,et al.  Expression-Guided In Silico Evaluation of Candidate Cis Regulatory Codes for Drosophila Muscle Founder Cells , 2006, PLoS Comput. Biol..

[28]  R. Hollis,et al.  Phage integrases for the construction and manipulation of transgenic mammals , 2003, Reproductive biology and endocrinology : RB&E.

[29]  E. Davidson Genomic Regulatory Systems: Development and Evolution , 2005 .

[30]  Timothy J. Durham,et al.  Systematic analysis of chromatin state dynamics in nine human cell types , 2011, Nature.

[31]  W. Huber,et al.  which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. MAnorm: a robust model for quantitative comparison of ChIP-Seq data sets , 2011 .

[32]  Peter F. Stadler,et al.  Fast Mapping of Short Sequences with Mismatches, Insertions and Deletions Using Index Structures , 2009, PLoS Comput. Biol..

[33]  Eric C. Olivares,et al.  Site-Specific Genomic Integration in Mammalian Cells Mediated by Phage φC31 Integrase , 2001, Molecular and Cellular Biology.

[34]  G. Rubin,et al.  Tools for neuroanatomy and neurogenetics in Drosophila , 2008, Proceedings of the National Academy of Sciences.

[35]  M. Bulyk Computational prediction of transcription-factor binding site locations , 2003, Genome Biology.

[36]  K. G. Guruharsha,et al.  The Complex Spatio-Temporal Regulation of the Drosophila Myoblast Attractant Gene duf/kirre , 2009, PloS one.

[37]  J. Lister Transgene excision in zebrafish using the phiC31 integrase , 2010, Genesis.

[38]  Christopher Joseph Pal,et al.  Analyzing in situ gene expression in the mouse brain with image registration, feature extraction and block clustering , 2007, BMC Bioinformatics.

[39]  Stephen S. Gisselbrecht,et al.  Ras Pathway Specificity Is Determined by the Integration of Multiple Signal-Activated and Tissue-Restricted Transcription Factors , 2000, Cell.

[40]  E. Siggia,et al.  Analysis of Combinatorial cis-Regulation in Synthetic and Genomic Promoters , 2008, Nature.

[41]  N. Brown,et al.  Mammalian CD2 is an effective heterologous marker of the cell surface in Drosophila. , 1995, Developmental biology.

[42]  N. Perrimon,et al.  Exploiting position effects and the gypsy retrovirus insulator to engineer precisely expressed transgenes , 2008, Nature Genetics.

[43]  Marc S Halfon,et al.  An Integrated Strategy for Analyzing the Unique Developmental Programs of Different Myoblast Subtypes , 2006, PLoS genetics.

[44]  Hanh T. Nguyen,et al.  Drosophila Lame duck, a novel member of the Gli superfamily, acts as a key regulator of myogenesis by controlling fusion-competent myoblast development. , 2001, Development.

[45]  Gos Micklem,et al.  Supporting Online Material Materials and Methods Figs. S1 to S50 Tables S1 to S18 References Identification of Functional Elements and Regulatory Circuits by Drosophila Modencode , 2022 .

[46]  E. Davidson,et al.  Functional cis-regulatory genomics for systems biology , 2010, Proceedings of the National Academy of Sciences.

[47]  Sergio Contrino,et al.  modMine: flexible access to modENCODE data , 2011, Nucleic Acids Res..

[48]  M. Frasch,et al.  tinman and bagpipe: two homeo box genes that determine cell fates in the dorsal mesoderm of Drosophila. , 1993, Genes & development.

[49]  E. Furlong,et al.  A core transcriptional network for early mesoderm development in Drosophila melanogaster. , 2007, Genes & development.

[50]  S. Aerts,et al.  i-cisTarget: an integrative genomics method for the prediction of regulatory features and cis-regulatory modules , 2012, Nucleic acids research.

[51]  Michele P Calos,et al.  Construction of transgenic Drosophila by using the site-specific integrase from phage phiC31. , 2004, Genetics.

[52]  Steven M. Gallo,et al.  REDfly v3.0: toward a comprehensive database of transcriptional regulatory elements in Drosophila , 2010, Nucleic Acids Res..

[53]  M. Bate,et al.  The development of Drosophila melanogaster , 1993 .

[54]  S. Barolo,et al.  GFP and beta-galactosidase transformation vectors for promoter/enhancer analysis in Drosophila. , 2000, BioTechniques.