Core promoter sequence in yeast is a major determinant of expression level

The core promoter is the regulatory sequence to which RNA polymerase is recruited and where it acts to initiate transcription. Here, we present the first comprehensive study of yeast core promoters, providing massively parallel measurements of core promoter activity and of TSS locations and relative usage for thousands of native and designed sequences. We found core promoter activity to be highly correlated to the activity of the entire promoter and that sequence variation in different core promoter regions substantially tunes its activity in a predictable way. We also show that location, orientation, and flanking bases critically affect TATA element function, that transcription initiation in highly active core promoters is focused within a narrow region, that poly(dA:dT) orientation has a functional consequence at the 3' end of promoters, and that orthologous core promoters across yeast species have conserved activities. Our results demonstrate the importance of core promoters in the quantitative study of gene regulation.

[1]  Sebastian J Maerkl,et al.  Mapping the fine structure of a eukaryotic promoter input-output function , 2013, Nature Genetics.

[2]  L. Guarente,et al.  Each of three "TATA elements" specifies a subset of the transcription initiation sites at the CYC-1 promoter of Saccharomyces cerevisiae. , 1985, Proceedings of the National Academy of Sciences of the United States of America.

[3]  K. Struhl,et al.  A wide variety of DNA sequences can functionally replace a yeast TATA element for transcriptional activation. , 1990, Genes & development.

[4]  B. Hall,et al.  DNA sequence elements required for transcription initiation of the Schizosaccharomyces pombe ADH gene in Saccharomyces cerevisiae , 1990, Molecular and General Genetics MGG.

[5]  J. Söding,et al.  A Conserved GA Element in TATA-Less RNA Polymerase II Promoters , 2011, PloS one.

[6]  F. Dietrich,et al.  Mapping of transcription start sites in Saccharomyces cerevisiae using 5′ SAGE , 2005, Nucleic acids research.

[7]  R. Milo,et al.  Promoters maintain their relative activity levels under different growth conditions , 2013, Molecular systems biology.

[8]  Timothy R. Hughes,et al.  G+C content dominates intrinsic nucleosome occupancy , 2009, BMC Bioinformatics.

[9]  E. Segal,et al.  Poly(da:dt) Tracts: Major Determinants of Nucleosome Organization This Review Comes from a Themed Issue on Protein-nucleic Acid Interactions Edited , 2022 .

[10]  E. Segal,et al.  Molecular dissection of the genetic mechanisms that underlie expression conservation in orthologous yeast ribosomal promoters , 2014, Genome research.

[11]  B. Pugh,et al.  Identification and Distinct Regulation of Yeast TATA Box-Containing Genes , 2004, Cell.

[12]  Irene K. Moore,et al.  The DNA-encoded nucleosome organization of a eukaryotic genome , 2009, Nature.

[13]  D. Brow,et al.  Quantitative Analysis of in Vivo Initiator Selection by Yeast RNA Polymerase II Supports a Scanning Model* , 2006, Journal of Biological Chemistry.

[14]  M. Bulyk,et al.  Genomic regions flanking E-box binding sites influence DNA binding specificity of bHLH transcription factors through DNA shape. , 2013, Cell reports.

[15]  J. Friesen,et al.  A sequence pattern that occurs at the transcription initiation region of yeast RNA polymerase II promoters. , 1990, Nucleic acids research.

[16]  E. Segal,et al.  In pursuit of design principles of regulatory sequences , 2014, Nature Reviews Genetics.

[17]  Fuminori Sugihara,et al.  Highly redundant function of multiple AT-rich sequences as core promoter elements in the TATA-less RPS5 promoter of Saccharomyces cerevisiae , 2010, Nucleic Acids Res..

[18]  M. Hattori,et al.  A large-scale full-length cDNA analysis to explore the budding yeast transcriptome , 2006, Proceedings of the National Academy of Sciences.

[19]  B. Simmons,et al.  A single-base resolution map of an archaeal transcriptome. , 2010, Genome research.

[20]  J. Lis,et al.  DNA melting on yeast RNA polymerase II promoters. , 1993, Science.

[21]  Yaniv Lubling,et al.  Distinct Modes of Regulation by Chromatin Encoded through Nucleosome Positioning Signals , 2008, PLoS Comput. Biol..

[22]  M. Gerstein,et al.  The Transcriptional Landscape of the Yeast Genome Defined by RNA Sequencing , 2008, Science.

[23]  J. T. Kadonaga,et al.  The RNA polymerase II core promoter. , 2003, Annual review of biochemistry.

[24]  S. Hahn,et al.  Architecture of the Yeast RNA Polymerase II Open Complex and Regulation of Activity by TFIIF , 2011, Molecular and Cellular Biology.

[25]  S. Hahn,et al.  A DNA-tethered cleavage probe reveals the path for promoter DNA in the yeast preinitiation complex , 2006, Nature Structural &Molecular Biology.

[26]  S. Brodie,et al.  Promoter-Specific Shifts in Transcription Initiation Conferred by Yeast TFIIB Mutations Are Determined by the Sequence in the Immediate Vicinity of the Start Sites , 2001, Molecular and Cellular Biology.

[27]  David Tollervey,et al.  Coding-Sequence Determinants of Gene Expression in Escherichia coli , 2009, Science.

[28]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[29]  T. Juven-Gershon,et al.  Core Promoter Functions in the Regulation of Gene Expression of Drosophila Dorsal Target Genes* , 2014, The Journal of Biological Chemistry.

[30]  Z. Yakhini,et al.  Inferring gene regulatory logic from high-throughput measurements of thousands of systematically designed promoters , 2012, Nature Biotechnology.

[31]  Eran Segal,et al.  Manipulating nucleosome disfavoring sequences allows fine-tune regulation of gene expression in yeast , 2012, Nature Genetics.

[32]  Eytan Ruppin,et al.  Translation efficiency is determined by both codon bias and folding energy , 2010, Proceedings of the National Academy of Sciences.

[33]  Z. Yakhini,et al.  Unraveling determinants of transcription factor binding outside the core binding site , 2015, Genome research.

[34]  Eran Segal,et al.  Deciphering the rules by which 5′-UTR sequences affect protein expression in yeast , 2013, Proceedings of the National Academy of Sciences.

[35]  Eran Segal,et al.  Sequence features of yeast and human core promoters that are predictive of maximal promoter activity , 2013, Nucleic acids research.

[36]  David Botstein,et al.  Promoter-specific binding of Rap1 revealed by genome-wide maps of protein–DNA association , 2001, Nature Genetics.

[37]  K. Struhl,et al.  Tc, an unusual promoter element required for constitutive transcription of the yeast HIS3 gene , 1990, Molecular and cellular biology.

[38]  N. D. Clarke,et al.  Differential binding of the related transcription factors Pho4 and Cbf1 can tune the sensitivity of promoters to different levels of an induction signal , 2013, Nucleic acids research.

[39]  R. Mitra,et al.  TATA is a modular component of synthetic promoters. , 2010, Genome research.

[40]  Randy Z. Wu,et al.  Positioned and G/C-capped poly(dA:dT) tracts associate with the centers of nucleosome-free regions in yeast promoters. , 2010, Genome research.

[41]  A. Regev,et al.  Conservation and evolvability in regulatory networks: the evolution of ribosomal regulation in yeast. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[42]  B. Pugh,et al.  Genome-wide structure and organization of eukaryotic pre-initiation complexes , 2011, Nature.

[43]  Emily M. LeProust,et al.  Synthesis of high-quality libraries of long (150mer) oligonucleotides by a novel depurination controlled process , 2010, Nucleic acids research.

[44]  M. Hampsey,et al.  Mechanism of Start Site Selection by RNA Polymerase II , 2011, The Journal of Biological Chemistry.

[45]  K. Struhl,et al.  Saturation mutagenesis of a yeast his3 "TATA element": genetic evidence for a specific TATA-binding protein. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[46]  Roger D Kornberg,et al.  Structural Basis of Transcription: An RNA Polymerase II-TFIIB Cocrystal at 4.5 Angstroms , 2004, Science.

[47]  L. A. Stargell,et al.  The Stability of the TFIIA-TBP-DNA Complex Is Dependent on the Sequence of the TATAAA Element* , 2001, The Journal of Biological Chemistry.