Flynet: a genomic resource for Drosophila melanogaster transcriptional regulatory networks

Motivation: The highly coordinated expression of thousands of genes in an organism is regulated by the concerted action of transcription factors, chromatin proteins and epigenetic mechanisms. High-throughput experimental data for genome wide in vivo protein–DNA interactions and epigenetic marks are becoming available from large projects, such as the model organism ENCyclopedia Of DNA Elements (modENCODE) and from individual labs. Dissemination and visualization of these datasets in an explorable form is an important challenge. Results: To support research on Drosophila melanogaster transcription regulation and make the genome wide in vivo protein–DNA interactions data available to the scientific community as a whole, we have developed a system called Flynet. Currently, Flynet contains 101 datasets for 38 transcription factors and chromatin regulator proteins in different experimental conditions. These factors exhibit different types of binding profiles ranging from sharp localized peaks to broad binding regions. The protein–DNA interaction data in Flynet was obtained from the analysis of chromatin immunoprecipitation experiments on one color and two color genomic tiling arrays as well as chromatin immunoprecipitation followed by massively parallel sequencing. A web-based interface, integrated with an AJAX based genome browser, has been built for queries and presenting analysis results. Flynet also makes available the cis-regulatory modules reported in literature, known and de novo identified sequence motifs across the genome, and other resources to study gene regulation. Contact: grossman@uic.edu Availability: Flynet is available at https://www.cistrack.org/flynet/. Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  H. Karas,et al.  TRANSFAC database as a bridge between sequence data libraries and biological function. , 1997, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[2]  Julie M. Sullivan,et al.  FlyMine: an integrated database for Drosophila and Anopheles genomics , 2007, Genome Biology.

[3]  Clifford A. Meyer,et al.  Model-based analysis of tiling-arrays for ChIP-chip , 2006, Proceedings of the National Academy of Sciences.

[4]  Steven M. Gallo,et al.  REDfly 2.0: an integrated database of cis-regulatory modules and transcription factor binding sites in Drosophila , 2007, Nucleic Acids Res..

[5]  Sarah A. Teichmann,et al.  DBD: a transcription factor prediction database , 2005, Nucleic Acids Res..

[6]  Wyeth W. Wasserman,et al.  JASPAR: an open-access database for eukaryotic transcription factor binding profiles , 2004, Nucleic Acids Res..

[7]  D. Botstein,et al.  Genomic binding sites of the yeast cell-cycle transcription factors SBF and MBF , 2001, Nature.

[8]  Manolis Kellis,et al.  Reliable prediction of regulator targets using 12 Drosophila genomes. , 2007, Genome research.

[9]  Jun S. Liu,et al.  An algorithm for finding protein–DNA binding sites with applications to chromatin-immunoprecipitation microarray experiments , 2002, Nature Biotechnology.

[10]  Sarah A. Teichmann,et al.  FlyTF: a systematic review of site-specific transcription factors in the fruit fly Drosophila melanogaster , 2006, Bioinform..

[11]  Terry Speed,et al.  Genomic profiling and expression studies reveal both positive and negative activities for the Drosophila Myb MuvB/dREAM complex in proliferating cells. , 2007, Genes & development.

[12]  Sündüz Keleş,et al.  Mixture Modeling for Genome‐Wide Localization of Transcription Factors , 2007, Biometrics.

[13]  J. Robert Manak,et al.  Stability and Dynamics of Polycomb Target Sites in Drosophila Development , 2008, PLoS genetics.

[14]  Mark D. Biggin,et al.  NELF and GAGA Factor Are Linked to Promoter-Proximal Pausing at Many Genes in Drosophila , 2008, Molecular and Cellular Biology.

[15]  D. W. Knowles,et al.  Transcription Factors Bind Thousands of Active and Inactive Regions in the Drosophila Blastoderm , 2008, PLoS biology.

[16]  Danny Reinberg,et al.  Is there a code embedded in proteins that is based on post-translational modifications? , 2008, Nature Reviews Molecular Cell Biology.

[17]  L. Stein,et al.  JBrowse: a next-generation genome browser. , 2009, Genome research.

[18]  Ankush Mittal,et al.  Computational annotation of transcription factor binding sites in D. Melanogaster developmental genes. , 2006, Genome informatics. International Conference on Genome Informatics.

[19]  M. Gerstein,et al.  Structure and evolution of transcriptional regulatory networks. , 2004, Current opinion in structural biology.

[20]  M. Q. Zhang,et al.  Identification of human gene core promoters in silico. , 1998, Genome research.

[21]  A. Mortazavi,et al.  Genome-Wide Mapping of in Vivo Protein-DNA Interactions , 2007, Science.

[22]  Helen E. Parkinson,et al.  ArrayExpress—a public database of microarray experiments and gene expression profiles , 2006, Nucleic Acids Res..

[23]  Jean-Stéphane Varré,et al.  Efficient and accurate P-value computation for Position Weight Matrices , 2007, Algorithms for Molecular Biology.

[24]  Dennis B. Troup,et al.  NCBI GEO: archive for high-throughput functional genomic data , 2008, Nucleic Acids Res..

[25]  Michael B. Eisen,et al.  Association of cohesin and Nipped-B with transcriptionally active regions of the Drosophila melanogaster genome , 2008, Chromosoma.

[26]  Richard Bourgon,et al.  Genome-wide analysis of Polycomb targets in Drosophila melanogaster , 2006, Nature Genetics.

[27]  Zhao Wang,et al.  Unravelling the world of cis-regulatory elements , 2007, Medical & Biological Engineering & Computing.

[28]  John J. Wyrick,et al.  Genome-wide location and function of DNA binding proteins. , 2000, Science.

[29]  Robert Tjian,et al.  Novel TRF1/BRF target genes revealed by genome‐wide analysis of Drosophila Pol III transcription , 2007, The EMBO journal.

[30]  H. Ueda,et al.  A functional genomics strategy reveals clockwork orange as a transcriptional regulator in the Drosophila circadian clock. , 2007, Genes & development.

[31]  Tae Hoon Kim,et al.  Genome-wide analysis of protein-DNA interactions. , 2006, Annual review of genomics and human genetics.

[32]  G. Church,et al.  Computational identification of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae. , 2000, Journal of molecular biology.

[33]  Casey M. Bergman,et al.  Drosophila DNase I footprint database: a systematic genome annotation of transcription factor binding sites in the fruitfly, Drosophila melanogaster , 2005, Bioinform..

[34]  Myles Brown,et al.  xMAN: extreme MApping of OligoNucleotides , 2008, BMC Genomics.

[35]  Charles Elkan,et al.  Fitting a Mixture Model By Expectation Maximization To Discover Motifs In Biopolymer , 1994, ISMB.