High-throughput identification of long-range regulatory elements and their target promoters in the human genome

Enhancer elements are essential for tissue-specific gene regulation during mammalian development. Although these regulatory elements are often distant from their target genes, they affect gene expression by recruiting transcription factors to specific promoter regions. Because of this long-range action, the annotation of enhancer element–target promoter pairs remains elusive. Here, we developed a novel analysis methodology that takes advantage of Hi-C data to comprehensively identify these interactions throughout the human genome. To do this, we used a geometric distribution-based model to identify DNA–DNA interaction hotspots that contact gene promoters with high confidence. We observed that these promoter-interacting hotspots significantly overlap with known enhancer-associated histone modifications and DNase I hypersensitive sites. Thus, we defined thousands of candidate enhancer elements by incorporating these features, and found that they have a significant propensity to be bound by p300, an enhancer binding transcription factor. Furthermore, we revealed that their target genes are significantly bound by RNA Polymerase II and demonstrate tissue-specific expression. Finally, we uncovered that these elements are generally found within 1 Mb of their targets, and often regulate multiple genes. In total, our study presents a novel high-throughput workflow for confident, genome-wide discovery of enhancer–target promoter pairs, which will significantly improve our understanding of these regulatory interactions.

[1]  Raymond K. Auerbach,et al.  A User's Guide to the Encyclopedia of DNA Elements (ENCODE) , 2011, PLoS biology.

[2]  M. Levine Transcriptional Enhancers in Animal Development and Evolution , 2010, Current Biology.

[3]  V. Corces,et al.  Tissue‐specific transcriptional enhancers may act in trans on the gene located in the homologous chromosome: the molecular basis of transvection in Drosophila. , 1990, The EMBO journal.

[4]  A. Tanay,et al.  Three-Dimensional Folding and Functional Organization Principles of the Drosophila Genome , 2012, Cell.

[5]  Matthew D. Young,et al.  ChIP-seq analysis reveals distinct H3K27me3 profiles that correlate with transcriptional activity , 2011, Nucleic acids research.

[6]  I. Amit,et al.  Comprehensive mapping of long range interactions reveals folding principles of the human genome , 2011 .

[7]  William Stafford Noble,et al.  A Three-Dimensional Model of the Yeast Genome , 2010, Nature.

[8]  M. Kimmel,et al.  Conflict of interest statement. None declared. , 2010 .

[9]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[10]  David Haussler,et al.  ENCODE whole-genome data in the UCSC genome browser (2011 update) , 2010, Nucleic Acids Res..

[11]  K. Pollard,et al.  Detection of nonneutral substitution rates on mammalian phylogenies. , 2010, Genome research.

[12]  C. Glass,et al.  Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. , 2010, Molecular cell.

[13]  Ting Wang,et al.  ENCODE whole-genome data in the UCSC Genome Browser , 2009, Nucleic Acids Res..

[14]  A. Visel,et al.  ChIP-seq accurately predicts tissue-specific activity of enhancers , 2009, Nature.

[15]  T. Speed,et al.  Summaries of Affymetrix GeneChip probe level data. , 2003, Nucleic acids research.

[16]  ENCODEConsortium,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[17]  D. S. Gross,et al.  Nuclease hypersensitive sites in chromatin. , 1988, Annual review of biochemistry.

[18]  Hideki Tanizawa,et al.  Mapping of long-range associations throughout the fission yeast genome reveals global genome organization linked to transcriptional regulation , 2010, Nucleic acids research.

[19]  Aaron R. Quinlan,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2022 .

[20]  Michael R. Green,et al.  Transcriptional regulatory elements in the human genome. , 2006, Annual review of genomics and human genetics.

[21]  Emery H. Bresnick,et al.  Integration of Hi-C and ChIP-seq data reveals distinct types of chromatin linkages , 2012, Nucleic acids research.

[22]  J B Lawrence,et al.  Molecular cloning and functional analysis of the adenovirus E1A-associated 300-kD protein (p300) reveals a protein with properties of a transcriptional adaptor. , 1994, Genes & development.

[23]  Jesse R. Dixon,et al.  Topological Domains in Mammalian Genomes Identified by Analysis of Chromatin Interactions , 2012, Nature.

[24]  J. Banerji,et al.  Expression of a β-globin gene is enhanced by remote SV40 DNA sequences , 1981, Cell.

[25]  J. Banerji,et al.  Expression of a beta-globin gene is enhanced by remote SV40 DNA sequences. , 1981, Cell.

[26]  Dimitris Thanos,et al.  Transcription factors mediate long-range enhancer–promoter interactions , 2009, Proceedings of the National Academy of Sciences.

[27]  M. Bucan,et al.  Promoter features related to tissue specificity as measured by Shannon entropy , 2005, Genome Biology.

[28]  Rafael A Irizarry,et al.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data. , 2003, Biostatistics.

[29]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[30]  S. McKnight,et al.  Transcriptional control signals of a eukaryotic protein-coding gene. , 1982, Science.

[31]  Reza Kalhor,et al.  Genome architectures revealed by tethered chromosome conformation capture and population-based modeling , 2011, Nature Biotechnology.

[32]  R. Young,et al.  Histone H3K27ac separates active from poised enhancers and predicts developmental state , 2010, Proceedings of the National Academy of Sciences.

[33]  Marcel H. Schulz,et al.  Integrative analysis of genomic, functional and protein interaction data predicts long-range enhancer-target gene interactions , 2010, Nucleic acids research.

[34]  K. Zhao,et al.  Characterization of genome-wide enhancer-promoter interactions reveals co-expression of interacting genes and modes of higher order chromatin organization , 2012, Cell Research.

[35]  Richard Axel,et al.  Interchromosomal Interactions and Olfactory Receptor Choice , 2006, Cell.

[36]  A. Gutierrez-Hartmann,et al.  ETS transcription factors in endocrine systems , 2007, Trends in Endocrinology & Metabolism.

[37]  Y. Ruan,et al.  ChIP‐based methods for the identification of long‐range chromatin interactions , 2009, Journal of cellular biochemistry.

[38]  Nathaniel D. Heintzman,et al.  Histone modifications at human enhancers reflect global cell-type-specific gene expression , 2009, Nature.

[39]  Nathaniel D. Heintzman,et al.  Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome , 2007, Nature Genetics.

[40]  S. Elgin,et al.  The chromatin structure of specific genes: II. Disruption of chromatin structure during gene activity , 1979, Cell.

[41]  Raymond K. Auerbach,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[42]  Ieuan Clay,et al.  The transcriptional interactome: gene expression in 3D. , 2010, Current opinion in genetics & development.

[43]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[44]  Michael A. Beer,et al.  Discriminative prediction of mammalian enhancers from DNA sequence. , 2011, Genome research.

[45]  Terence P. Speed,et al.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias , 2003, Bioinform..

[46]  M. Blanchette,et al.  Discovery of regulatory elements by a computational method for phylogenetic footprinting. , 2002, Genome research.

[47]  J. Dekker,et al.  The long-range interaction landscape of gene promoters , 2012, Nature.

[48]  P. Ryvkin,et al.  Genome-Wide Double-Stranded RNA Sequencing Reveals the Functional Significance of Base-Paired RNAs in Arabidopsis , 2010, PLoS genetics.

[49]  Timothy J. Durham,et al.  Systematic analysis of chromatin state dynamics in nine human cell types , 2011, Nature.

[50]  A. Nienhuis,et al.  Transcriptional regulation of fetal to adult hemoglobin switching: new therapeutic opportunities. , 2011, Blood.

[51]  Timothy J. Durham,et al.  "Systematic" , 1966, Comput. J..

[52]  Keji Zhao,et al.  Genome-wide prediction of conserved and nonconserved enhancers by histone acetylation patterns. , 2006, Genome research.

[53]  Paul T. Groth,et al.  The ENCODE (ENCyclopedia Of DNA Elements) Project , 2004, Science.