Identification of functional transcription factor binding sites using closely related Saccharomyces species.

Comparative genomics provides a rapid means of identifying functional DNA elements by their sequence conservation between species. Transcription factor binding sites (TFBSs) may constitute a significant fraction of these conserved sequences, but the annotation of specific TFBSs is complicated by the fact that these short, degenerate sequences may frequently be conserved by chance rather than functional constraint. To identify intergenic sequences that function as TFBSs, we calculated the probability of binding site conservation between Saccharomyces cerevisiae and its two closest relatives under a neutral model of evolution. We found that this probability is <5% for 134 of 163 transcription factor binding motifs, implying that we can reliably annotate binding sites for the majority of these transcription factors by conservation alone. Although our annotation relies on a number of assumptions, mutations in five of five conserved Ume6 binding sites and three of four conserved Ndt80 binding sites show Ume6- and Ndt80-dependent effects on gene expression. We also found that three of five unconserved Ndt80 binding sites show Ndt80-dependent effects on gene expression. Together these data imply that although sequence conservation can be reliably used to predict functional TFBSs, unconserved sequences might also make a significant contribution to a species' biology.

[1]  G. Stormo,et al.  Non-independence of Mnt repressor-operator interaction determined by a new quantitative multiple fluorescence relative affinity (QuMFRA) assay. , 2001, Nucleic acids research.

[2]  A. Mitchell,et al.  Bipartite structure of an early meiotic upstream activation sequence from Saccharomyces cerevisiae , 1993, Molecular and cellular biology.

[3]  A. Wilson,et al.  Two types of molecular evolution. Evidence from studies of interspecific hybridization. , 1974, Proceedings of the National Academy of Sciences of the United States of America.

[4]  S. Salzberg,et al.  Computational identification of developmental enhancers: conservation and function of transcription factor binding-site clusters in Drosophila melanogaster and Drosophila pseudoobscura , 2004, Genome Biology.

[5]  Ronald W. Davis,et al.  Functional profiling of the Saccharomyces cerevisiae genome , 2002, Nature.

[6]  M. Kimura A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences , 1980, Journal of Molecular Evolution.

[7]  Jason E Stajich,et al.  The effects of selection against spurious transcription factor binding sites. , 2003, Molecular biology and evolution.

[8]  E. Davidson,et al.  Genomic cis-regulatory logic: experimental and computational analysis of a sea urchin gene. , 1998, Science.

[9]  A. Vershon,et al.  Transcriptional regulation of meiosis in yeast. , 2000, Current opinion in cell biology.

[10]  Nicola J. Rinaldi,et al.  Transcriptional regulatory code of a eukaryotic genome , 2004, Nature.

[11]  W. Miller,et al.  Identification of a coordinate regulator of interleukins 4, 13, and 5 by cross-species sequence comparisons. , 2000, Science.

[12]  Alan M. Moses,et al.  Position specific variation in the rate of evolution in transcription factor binding sites , 2003, BMC Evolutionary Biology.

[13]  N. Patel,et al.  Functional analysis of eve stripe 2 enhancer evolution in Drosophila: rules governing conservation and change. , 1998, Development.

[14]  B. Birren,et al.  Sequencing and comparison of yeast species to identify genes and regulatory elements , 2003, Nature.

[15]  Klaudia Walter,et al.  Highly Conserved Non-Coding Sequences Are Associated with Vertebrate Development , 2004, PLoS biology.

[16]  D. Kinney,et al.  Yeast shuttle and integrative vectors with multiple cloning sites suitable for construction of lacZ fusions. , 1986, Gene.

[17]  M. Levine,et al.  Regulation of a segmentation stripe by overlapping activators and repressors in the Drosophila embryo. , 1991, Science.

[18]  L. Fulton,et al.  Finding Functional Features in Saccharomyces Genomes by Phylogenetic Footprinting , 2003, Science.

[19]  S. Chu,et al.  Gametogenesis in yeast is regulated by a transcriptional cascade dependent on Ndt80. , 1998, Molecular cell.

[20]  M. Kreitman,et al.  Analysis of conserved noncoding DNA in Drosophila reveals similar constraints in intergenic and intronic sequences. , 2001, Genome research.

[21]  G. Church,et al.  Nucleotides of transcription factor binding sites exert interdependent effects on the binding affinities of transcription factors. , 2002, Nucleic acids research.

[22]  Jens Stoye,et al.  Benchmarking tools for the alignment of functional noncoding DNA , 2004, BMC Bioinformatics.

[23]  K. Benjamin,et al.  Sum1 and Ndt80 Proteins Compete for Binding to Middle Sporulation Element Sequences That Control Meiotic Gene Expression , 2003, Molecular and Cellular Biology.

[24]  Christopher D. Brown,et al.  Noncoding regulatory sequences of Ciona exhibit strong correspondence between evolutionary constraint and functional importance. , 2004, Genome research.

[25]  I. Dawes,et al.  Regulation of gene expression during meiosis in Saccharomyces cerevisiae: SPR3 is controlled by both ABFI and a new sporulation control element , 1997, Molecular and cellular biology.

[26]  Jeffrey H. Chuang,et al.  Genome-wide regulatory complexity in yeast promoters: separation of functionally conserved and neutral sequence. , 2005, Genome research.

[27]  Michael Q. Zhang,et al.  SCPD: a promoter database of the yeast Saccharomyces cerevisiae , 1999, Bioinform..

[28]  Ronald W. Davis,et al.  The Ume6 regulon coordinates metabolic and meiotic gene expression in yeast , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[29]  R. E. Esposito,et al.  UME6 is a key regulator of nitrogen repression and meiotic development. , 1994, Genes & development.

[30]  A. Mitchell,et al.  Positive control of yeast meiotic genes by the negative regulator UME6 , 1995, Molecular and cellular biology.

[31]  Jon D. McAuliffe,et al.  Phylogenetic Shadowing of Primate Sequences to Find Functional Regions of the Human Genome , 2003, Science.

[32]  R. E. Esposito,et al.  UME6 is a central component of a developmental regulatory switch controlling meiosis-specific gene expression. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[33]  J. Costas,et al.  Turnover of binding sites for transcription factors involved in early Drosophila development. , 2003, Gene.

[34]  Thomas E. Royce,et al.  Distribution of NF-κB-binding sites across human chromosome 22 , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[35]  S. Lewis,et al.  The generic genome browser: a building block for a model organism system database. , 2002, Genome research.

[36]  A. Clark,et al.  Evolution of transcription factor binding sites in Mammalian gene regulatory regions: conservation and turnover. , 2002, Molecular biology and evolution.

[37]  D. Cox,et al.  Noncoding sequences conserved in a limited number of mammals in the SIM2 interval are frequently functional. , 2004, Genome research.

[38]  A. Willems,et al.  Studies on the transformation of intact yeast cells by the LiAc/SS‐DNA/PEG procedure , 1995, Yeast.

[39]  D. Labie,et al.  Molecular Evolution , 1991, Nature.

[40]  Nicola J. Rinaldi,et al.  Transcriptional Regulatory Networks in Saccharomyces cerevisiae , 2002, Science.

[41]  J. Drake A constant rate of spontaneous mutation in DNA-based microbes. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[42]  Jeffrey H. Chuang,et al.  Functional Bias and Spatial Organization of Genes in Mutational Hot and Cold Regions in the Human Genome , 2004, PLoS biology.

[43]  A. Clark,et al.  Tracing the evolutionary history of Drosophila regulatory regions with models that identify transcription factor binding sites. , 2003, Molecular biology and evolution.

[44]  D. Botstein,et al.  The transcriptional program of sporulation in budding yeast. , 1998, Science.

[45]  N. Patel,et al.  Evidence for stabilizing selection in a eukaryotic enhancer element , 2000, Nature.

[46]  A. Wilson,et al.  Two types of molecular evolution , 1974 .

[47]  Ziheng Yang,et al.  PAML: a program package for phylogenetic analysis by maximum likelihood , 1997, Comput. Appl. Biosci..

[48]  R. Nielsen,et al.  Detecting Selection in Noncoding Regions of Nucleotide Sequences , 2004, Genetics.

[49]  H. Kishino,et al.  Dating of the human-ape splitting by a molecular clock of mitochondrial DNA , 2005, Journal of Molecular Evolution.

[50]  Gary D. Stormo,et al.  Identifying DNA and protein patterns with statistically significant alignments of multiple sequences , 1999, Bioinform..

[51]  H. Akashi,et al.  Gene expression and molecular evolution. , 2001, Current opinion in genetics & development.