Genomic Organization of Human Transcription Initiation Complexes

The human genome is pervasively transcribed, yet only a small fraction is coding. Here we address whether this non-coding transcription arises at promoters, and detail the interactions of initiation factors TATA box binding protein (TBP), transcription factor IIB (TFIIB) and RNA polymerase (Pol) II. Using ChIP-exo (chromatin immunoprecipitation with lambda exonuclease digestion followed by high-throughput sequencing), we identify approximately 160,000 transcription initiation complexes across the human K562 genome, and more in other cancer genomes. Only about 5% associate with messenger RNA genes. The remainder associates with non-polyadenylated non-coding transcription. Regardless, Pol II moves into a transcriptionally paused state, and TBP and TFIIB remain at the promoter. Remarkably, the vast majority of locations contain the four core promoter elements— upstream TFIIB recognition element (BREu), TATA, downstream TFIIB recognition element (BREd), and initiator element (INR)—in constrained positions. All but the INR also reside at Pol III promoters, where TBP makes similar contacts. This comprehensive and high-resolution genome-wide detection of the initiation machinery produces a consolidated view of transcription initiation events from yeast to humans at Pol II/III TATA-containing/TATA-less coding and non-coding genes.

[1]  T. Gingeras,et al.  Genome-wide transcription and the implications for genomic organization , 2007, Nature Reviews Genetics.

[2]  Daniel Schulz,et al.  Transcriptome Surveillance by Selective Termination of Noncoding RNA Synthesis , 2013, Cell.

[3]  T. Lowe,et al.  Widespread Use of TATA Elements in the Core Promoters for RNA Polymerases III, II, and I in Fission Yeast , 2001, Molecular and Cellular Biology.

[4]  R. Tjian,et al.  Binding of TAFs to core elements directs promoter selectivity by RNA polymerase II , 1995, Cell.

[5]  P. Sharp,et al.  Five intermediate complexes in transcription initiation by RNA polymerase II , 1989, Cell.

[6]  Istvan Albert,et al.  GeneTrack - a genomic data processing and visualization framework , 2008, Bioinform..

[7]  John T. Lis,et al.  Transcription Regulation Through Promoter-Proximal Pausing of RNA Polymerase II , 2008, Science.

[8]  J. Lis,et al.  Genome-wide dynamics of Pol II elongation and its interplay with promoter proximal pausing, chromatin, and exons , 2014, eLife.

[9]  S. Jackson,et al.  Mechanism of TATA-binding protein recruitment to a TATA-less class III promoter , 1992, Cell.

[10]  M. Ptashne,et al.  Transcriptional activation by recruitment , 1997, Nature.

[11]  J. T. Kadonaga,et al.  Regulation of gene expression via the core promoter and the basal transcriptional machinery. , 2010, Developmental biology.

[12]  M. Gut,et al.  Supplemental information for : “ CpG islands and GC content dictate nucleosome depletion in a transcription independent manner at mammalian promoters ” , 2012 .

[13]  B. Pugh,et al.  Comprehensive Genome-wide Protein-DNA Interactions Detected at Single-Nucleotide Resolution , 2011, Cell.

[14]  Jason H. Moore,et al.  Missing heritability and strategies for finding the underlying causes of complex disease , 2010, Nature Reviews Genetics.

[15]  P. Cramer,et al.  RNA polymerase II–TFIIB structure and mechanism of transcription initiation , 2009, Nature.

[16]  B. Pugh,et al.  ChIP‐exo Method for Identifying Genomic Location of DNA‐Binding Proteins with Near‐Single‐Nucleotide Accuracy , 2012, Current protocols in molecular biology.

[17]  D. Gilmour,et al.  Promoter proximal pausing on genes in metazoans , 2009, Chromosoma.

[18]  J. Zeitlinger,et al.  RNA polymerase II pausing during development , 2014, Development.

[19]  André L. Martins,et al.  Analysis of nascent RNA identifies a unified architecture of initiation regions at mammalian promoters and enhancers , 2014, Nature Genetics.

[20]  D. Brutlag,et al.  A genome-wide analysis of CpG dinucleotides in the human genome distinguishes two distinct classes of promoters , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[21]  K. Kinzler,et al.  The Antisense Transcriptomes of Human Cells , 2008, Science.

[22]  Martin S. Taylor,et al.  Genome-wide analysis of mammalian promoter architecture and evolution , 2006, Nature Genetics.

[23]  Yuan He,et al.  Structural visualization of key steps in human transcription initiation , 2013, Nature.

[24]  Tatiana Tatusova,et al.  NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins , 2004, Nucleic Acids Res..

[25]  K. Struhl,et al.  A wide variety of DNA sequences can functionally replace a yeast TATA element for transcriptional activation. , 1990, Genes & development.

[26]  T Lagrange,et al.  New core promoter element in RNA polymerase II-dependent transcription: sequence-specific DNA binding by transcription factor IIB. , 1998, Genes & development.

[27]  Wensheng Deng,et al.  A core promoter element downstream of the TATA box that is recognized by TFIIB. , 2005, Genes & development.

[28]  R. Cortese,et al.  Transcription by RNA polymerase III. , 1983, Current topics in developmental biology.

[29]  B. Pugh,et al.  Identification and Distinct Regulation of Yeast TATA Box-Containing Genes , 2004, Cell.

[30]  J. Lis,et al.  RNA polymerase II interacts with the promoter region of the noninduced hsp70 gene in Drosophila melanogaster cells. , 1986, Molecular and cellular biology.

[31]  R. Young,et al.  A Chromatin Landmark and Transcription Initiation at Most Promoters in Human Cells , 2007, Cell.

[32]  Jean-Christophe Aude,et al.  Genomic binding of Pol III transcription machinery and relationship with TFIIS transcription factor distribution in mouse embryonic stem cells , 2011, Nucleic acids research.

[33]  Philipp Kapranov,et al.  Dark Matter RNA: Existence, Function, and Controversy , 2012, Front. Gene..

[34]  Christophe Malabat,et al.  Widespread bidirectional promoters are the major source of cryptic transcripts in yeast , 2009, Nature.

[35]  Patrick Cramer,et al.  Review Conservation between the Rna Polymerase I, Ii, and Iii Transcription Initiation Machineries , 2022 .

[36]  Mikael Bodén,et al.  MEME Suite: tools for motif discovery and searching , 2009, Nucleic Acids Res..

[37]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[38]  Transcription by RNA Polymerase III , 1988 .

[39]  Timothy J. Durham,et al.  "Systematic" , 1966, Comput. J..

[40]  Bryan J Venters,et al.  A barrier nucleosome model for statistical positioning of nucleosomes throughout the yeast genome. , 2008, Genome research.

[41]  Leah Barrera,et al.  A high-resolution map of active promoters in the human genome , 2005, Nature.

[42]  Huiming Zhang,et al.  Active DNA demethylation in plants and animals. , 2012, Cold Spring Harbor symposia on quantitative biology.

[43]  Kimberly Glass,et al.  All and only CpG containing sequences are enriched in promoters abundantly bound by RNA polymerase II in multiple tissues , 2008, BMC Genomics.

[44]  J. Maguire,et al.  Integrative analysis of the melanoma transcriptome. , 2010, Genome research.

[45]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[46]  T. Tatusova,et al.  NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins , 2006, Nucleic Acids Research.

[47]  S. Sainsbury,et al.  Structure and function of the initially transcribing RNA polymerase II–TFIIB complex , 2012, Nature.

[48]  Leighton J. Core,et al.  Regulating RNA polymerase pausing and transcription elongation in embryonic stem cells. , 2011, Genes & development.

[49]  ENCODEConsortium,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[50]  Gene W. Yeo,et al.  Divergent Transcription from Active Promoters , 2008, Science.

[51]  L. Steinmetz,et al.  Bidirectional promoters generate pervasive transcription in yeast , 2009, Nature.

[52]  B. Pugh,et al.  Genome-wide structure and organization of eukaryotic pre-initiation complexes , 2011, Nature.

[53]  Jun Wang,et al.  Transcriptional pause release is a rate-limiting step for somatic cell reprogramming. , 2014, Cell stem cell.

[54]  Jon W. Huss,et al.  BioGPS: an extensible and customizable portal for querying and organizing gene annotation resources , 2009, Genome Biology.

[55]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[56]  Leighton J. Core,et al.  Precise Maps of RNA Polymerase Reveal How Promoters Direct Initiation and Pausing , 2013, Science.

[57]  William Stafford Noble,et al.  Sequence features and chromatin structure around the genomic regions bound by 119 human transcription factors , 2012, Genome research.

[58]  D. Baltimore,et al.  The “initiator” as a transcription control element , 1989, Cell.

[59]  G. Kreiman,et al.  Widespread transcription at neuronal activity-regulated enhancers , 2010, Nature.

[60]  Nadav S. Bar,et al.  Landscape of transcription in human cells , 2012, Nature.