Studying the functional conservation of cis-regulatory modules and their transcriptional output

BackgroundCis-regulatory modules (CRMs) are distinct, genomic regions surrounding the target gene that can independently activate the promoter to drive transcription. The activation of a CRM is controlled by the binding of a certain combination of transcription factors (TFs). It would be of great benefit if the transcriptional output mediated by a specific CRM could be predicted. Of equal benefit would be identifying in silico a specific CRM as the driver of the expression in a specific tissue or situation. We extend a recently developed biochemical modeling approach to manage both prediction tasks. Given a set of TFs, their protein concentrations, and the positions and binding strengths of each of the TFs in a putative CRM, the model predicts the transcriptional output of the gene. Our approach predicts the location of the regulating CRM by using predicted TF binding sites in regions near the gene as input to the model and searching for the region that yields a predicted transcription rate most closely matching the known rate.ResultsHere we show the ability of the model on the example of one of the CRMs regulating the eve gene, MSE2. A model trained on the MSE2 in D. melanogaster was applied to the surrounding sequence of the eve gene in seven other Drosophila species. The model successfully predicts the correct MSE2 location and output in six out of eight Drosophila species we examine.ConclusionThe model is able to generalize from D. melanogaster to other Drosophila species and accurately predicts the location and transcriptional output of MSE2 in those species. However, we also show that the current model is not specific enough to function as a genome-wide CRM scanner, because it incorrectly predicts other genomic regions to be MSE2s.

[1]  E. Segal,et al.  Predicting expression patterns from regulatory sequence in Drosophila segmentation , 2008, Nature.

[2]  S. Salzberg,et al.  Computational identification of developmental enhancers: conservation and function of transcription factor binding-site clusters in Drosophila melanogaster and Drosophila pseudoobscura , 2004, Genome Biology.

[3]  N. Patel,et al.  Functional analysis of eve stripe 2 enhancer evolution in Drosophila: rules governing conservation and change. , 1998, Development.

[4]  David H. Sharp,et al.  Transcriptional Control in Drosophila , 2003, Complexus.

[5]  D. Haussler,et al.  Aligning multiple genomic sequences with the threaded blockset aligner. , 2004, Genome research.

[6]  David H. Sharp,et al.  Dynamical Analysis of Regulatory Interactions in the Gap Gene System of Drosophila melanogaster , 2004, Genetics.

[7]  E. Davidson,et al.  Transcriptional regulatory cascades in development: Initial rates, not steady state, determine network kinetics , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[8]  A. Sandelin,et al.  Applied bioinformatics for the identification of regulatory elements , 2004, Nature Reviews Genetics.

[9]  Z. Weng,et al.  Detection of functional DNA motifs via statistical over-representation. , 2004, Nucleic acids research.

[10]  M. Levine,et al.  Regulation of even‐skipped stripe 2 in the Drosophila embryo. , 1992, The EMBO journal.

[11]  Chenna Ramu,et al.  SIRW: a web server for the Simple Indexing and Retrieval System that combines sequence motif searches with keyword searches , 2003, Nucleic Acids Res..

[12]  Alexander E. Kel,et al.  TRANSFAC® and its module TRANSCompel®: transcriptional gene regulation in eukaryotes , 2005, Nucleic Acids Res..

[13]  Rodrigo Lopez,et al.  Multiple sequence alignment with the Clustal series of programs , 2003, Nucleic Acids Res..

[14]  Pavel A. Pevzner,et al.  Combinatorial Approaches to Finding Subtle Signals in DNA Sequences , 2000, ISMB.

[15]  Robert P Zinzen,et al.  A novel multifunctional factor involved in trans-splicing of chloroplast introns in Chlamydomonas , 2006, Nucleic acids research.

[16]  J. Davies,et al.  Molecular Biology of the Cell , 1983, Bristol Medico-Chirurgical Journal.

[17]  David H. Sharp,et al.  Quantitative and predictive model of transcriptional control of the Drosophila melanogaster even skipped gene , 2006, Nature Genetics.

[18]  Enrique Blanco,et al.  Multiple non-collinear TF-map alignments of promoter regions , 2007, BMC Bioinformatics.