Modeling transcription programs: inferring binding site activity and dose-response model optimization

The modeling of transcription regulation programs is a major focus of today's biology. The challenge is to utilize diverse high-throughput data (gene expression, promoter binding site localization assays, protein expression) in order to infer the mechanistic models of transcription control. We propose a new model which integrates transcription factor-gene affinities, protein abundance and gene expression levels. Transcription factor binding site activity is represented by a dose-affinity-response function, and regulation is assumed to be a combinatorial function of the activities of the binding sites in the gene's promoter sites.We develop algorithms that infer the model given complete data and give a fast polynomial time algorithm under reasonable assumptions. We also show how to assess initial values of missing data (notably protein abundance) using a novel framework for active motif detection, which may be of independent interest. We test the various components of the framework on gene expression data related to carbohydrate metabolism in yeast. The results demonstrate the high specificity and sensitivity of the approach and its advantages over extant motif activity detection methods. We are also able to predict new active motifs in the galactose pathway.A key feature of our method is the global approach to transcription factor activity and to the relation between this activity and promoter signals. We use dozens of genes, with many different promoter signals and expression levels in order to draw conclusions on the function of a single transcription factor. This provides us the robustness necessary in order to overcome the considerable level of noise in the data.

[1]  Yaniv Ziv,et al.  Revealing modular organization in the yeast transcriptional network , 2002, Nature Genetics.

[2]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[3]  D. Botstein,et al.  Genomic binding sites of the yeast cell-cycle transcription factors SBF and MBF , 2001, Nature.

[4]  D. Botstein,et al.  Genomic expression programs in the response of yeast cells to environmental changes. , 2000, Molecular biology of the cell.

[5]  Xin Chen,et al.  The TRANSFAC system on gene expression regulation , 2001, Nucleic Acids Res..

[6]  H. Bussemaker,et al.  Regulatory element detection using correlation with expression , 2001, Nature Genetics.

[7]  Ron Shamir,et al.  Chain functions and scoring functions in genetic networks , 2003, ISMB.

[8]  G. Church,et al.  Identifying regulatory networks by combinatorial analysis of promoter elements , 2001, Nature Genetics.

[9]  Roger E Bumgarner,et al.  Integrated genomic and proteomic analyses of a systematically perturbed metabolic network. , 2001, Science.

[10]  Michael B. Eisen,et al.  Visualizing associations between genome sequences and gene expression data using genome-mean expression profiles , 2001, ISMB.

[11]  Yudong D. He,et al.  Functional Discovery via a Compendium of Expression Profiles , 2000, Cell.

[12]  G. Church,et al.  Systematic determination of genetic network architecture , 1999, Nature Genetics.

[13]  Roded Sharan,et al.  Discovering statistically significant biclusters in gene expression data , 2002, ISMB.

[14]  Nicola J. Rinaldi,et al.  Transcriptional Regulatory Networks in Saccharomyces cerevisiae , 2002, Science.

[15]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[16]  Nir Friedman,et al.  From promoter sequence to expression: a probabilistic framework , 2002, RECOMB '02.

[17]  Ron Shamir,et al.  Computational expansion of genetic networks , 2001, ISMB.

[18]  John J. Wyrick,et al.  Genome-wide location and function of DNA binding proteins. , 2000, Science.

[19]  Henry V. Baker,et al.  Understanding the Growth Phenotype of the Yeastgcr1 Mutant in Terms of Global Genomic Expression Patterns , 2000, Journal of bacteriology.