Cell Specificity of Human Regulatory Annotations and Their Genetic Effects on Gene Expression

Varshney et al. directly compare five widely-used annotations of active regulatory elements: stretch, super, and typical enhancers; highoccupancy target (HOT) regions; and broad domains in four human cell types. Overall, their results suggest that current... Epigenomic signatures from histone marks and transcription factor (TF)-binding sites have been used to annotate putative gene regulatory regions. However, a direct comparison of these diverse annotations is missing, and it is unclear how genetic variation within these annotations affects gene expression. Here, we compare five widely used annotations of active regulatory elements that represent high densities of one or more relevant epigenomic marks—“super” and “typical” (nonsuper) enhancers, stretch enhancers, high-occupancy target (HOT) regions, and broad domains—across the four matched human cell types for which they are available. We observe that stretch and super enhancers cover cell type-specific enhancer “chromatin states,” whereas HOT regions and broad domains comprise more ubiquitous promoter states. Expression quantitative trait loci (eQTL) in stretch enhancers have significantly smaller effect sizes compared to those in HOT regions. Strikingly, chromatin accessibility QTL in stretch enhancers have significantly larger effect sizes compared to those in HOT regions. These observations suggest that stretch enhancers could harbor genetically primed chromatin to enable changes in TF binding, possibly to drive cell type-specific responses to environmental stimuli. Our results suggest that current eQTL studies are relatively underpowered or could lack the appropriate environmental context to detect genetic effects in the most cell type-specific “regulatory annotations,” which likely contributes to infrequent colocalization of eQTL with genome-wide association study signals.

[1]  Kaur Alasoo,et al.  Shared genetic effects on chromatin and gene expression indicate a role for enhancer priming in immune response , 2018, Nature Genetics.

[2]  Mark Gerstein,et al.  The ModERN Resource: Genome-Wide Binding Profiles for Hundreds of Drosophila and Caenorhabditis elegans Transcription Factors , 2017, Genetics.

[3]  D. Ucar,et al.  Chromatin interaction networks revealed unique connectivity patterns of broad H3K4me3 domains and super enhancers in 3D chromatin , 2017, Scientific Reports.

[4]  Nicola J. Rinaldi,et al.  Genetic effects on gene expression across human tissues , 2017, Nature.

[5]  Hailiang Huang,et al.  Fine-mapping inflammatory bowel disease loci to single variant resolution , 2017, Nature.

[6]  Jian Yan,et al.  Mice deficient of Myc super-enhancer region reveal differential control mechanism between normal and pathological growth , 2017, eLife.

[7]  G. Hon,et al.  Multiplexed Engineering and Analysis of Combinatorial Enhancer Activity in Single Cells. , 2017, Molecular cell.

[8]  Alexander Gusev,et al.  Functional Architectures of Local and Distal Regulation of Gene Expression in Multiple Human Tissues. , 2017, American journal of human genetics.

[9]  Laura J. Scott,et al.  Genetic regulatory signatures underlying islet gene expression and type 2 diabetes , 2017, Proceedings of the National Academy of Sciences.

[10]  Jennifer A. Mitchell,et al.  Enhancers and super-enhancers have an equivalent regulatory role in embryonic stem cells through regulation of single or multiple genes , 2017, Genome research.

[11]  Sridhar Rao,et al.  Super-Enhancers at the Nanog Locus Differentially Regulate Neighboring Pluripotency-Associated Genes. , 2016, Cell reports.

[12]  L. Pennacchio,et al.  Genetic dissection of the α-globin super-enhancer in vivo , 2016, Nature Genetics.

[13]  Laura J. Scott,et al.  The genetic regulatory signature of type 2 diabetes in human skeletal muscle , 2016, Nature Communications.

[14]  L. Hennighausen,et al.  Hierarchy within the mammary STAT5-driven Wap super-enhancer , 2016, Nature Genetics.

[15]  Roland Eils,et al.  Active medulloblastoma enhancers reveal subgroup-specific cellular origins , 2016, Nature.

[16]  Xiaochen Bo,et al.  Genome-wide identification and characterisation of HOT regions in the human genome , 2016, bioRxiv.

[17]  Eric Haugen,et al.  Large-scale identification of sequence variants impacting human transcription factor occupancy in vivo , 2015, Nature Genetics.

[18]  Jonathan K. Pritchard,et al.  WASP: allele-specific software for robust molecular quantitative trait locus discovery , 2015, Nature Methods.

[19]  Xi Chen,et al.  Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor-suppressor genes , 2015, Nature Genetics.

[20]  Ji Zhang,et al.  GREGOR: evaluating global enrichment of trait-associated variants in epigenomic features using a systematic, data-driven approach , 2015, Bioinform..

[21]  Stephen C. J. Parker,et al.  Motif signatures in stretch enhancers are enriched for disease-associated genetic variants , 2015, Epigenetics & Chromatin.

[22]  Charles Y. Lin,et al.  Convergence of developmental and oncogenic signaling pathways at transcriptional super-enhancers. , 2015, Molecular cell.

[23]  Carson C Chow,et al.  Second-generation PLINK: rising to the challenge of larger and richer datasets , 2014, GigaScience.

[24]  Peter J. Bickel,et al.  Comparative analysis of regulatory information and circuits across distant species , 2014, Nature.

[25]  Aaron C. Daugherty,et al.  H3K4me3 Breadth Is Linked to Cell Identity and Transcriptional Consistency , 2014, Cell.

[26]  K. Tan,et al.  Global view of enhancer–promoter interactome in human cells , 2014, Proceedings of the National Academy of Sciences.

[27]  M. Lupien,et al.  Combinatorial effects of multiple enhancer variants in linkage disequilibrium dictate levels of gene expression to confer susceptibility to common traits , 2014, Genome research.

[28]  Mark I. McCarthy,et al.  Pancreatic islet enhancer clusters enriched in type 2 diabetes risk–associated variants , 2013, Nature Genetics.

[29]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[30]  R. Young,et al.  Super-Enhancers in the Control of Cell Identity and Disease , 2013, Cell.

[31]  Stephen C. J. Parker,et al.  Chromatin stretch enhancer states drive cell-specific gene regulation and harbor human disease risk variants , 2013, Proceedings of the National Academy of Sciences.

[32]  Howard Y. Chang,et al.  Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position , 2013, Nature Methods.

[33]  Pedro G. Ferreira,et al.  Transcriptome and genome sequencing uncovers functional variation in humans , 2013, Nature.

[34]  Caleb Webber,et al.  GAT: a simulation framework for testing the association of genomic intervals , 2013, Bioinform..

[35]  David A. Orlando,et al.  Master Transcription Factors and Mediator Establish Super-Enhancers at Key Cell Identity Genes , 2013, Cell.

[36]  David A. Orlando,et al.  Selective Inhibition of Tumor Oncogenes by Disruption of Super-Enhancers , 2013, Cell.

[37]  Heng Li Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM , 2013, 1303.3997.

[38]  Buhm Han,et al.  Chromatin marks identify critical cell types for fine mapping complex trait variants , 2012 .

[39]  Sven Rahmann,et al.  Snakemake--a scalable bioinformatics workflow engine. , 2012, Bioinformatics.

[40]  Shane J. Neph,et al.  Systematic Localization of Common Disease-Associated Variation in Regulatory DNA , 2012, Science.

[41]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[42]  William Stafford Noble,et al.  Sequence features and chromatin structure around the genomic regions bound by 119 human transcription factors , 2012, Genome research.

[43]  Barry J Dickson,et al.  HOT regions function as patterned developmental enhancers and have a distinct cis-regulatory signature. , 2012, Genes & development.

[44]  Manolis Kellis,et al.  ChromHMM: automating chromatin-state discovery and characterization , 2012, Nature Methods.

[45]  Joseph K. Pickrell,et al.  DNaseI sensitivity QTLs are a major determinant of human expression variation , 2011, Nature.

[46]  Gonçalo R. Abecasis,et al.  The variant call format and VCFtools , 2011, Bioinform..

[47]  Timothy J. Durham,et al.  Systematic analysis of chromatin state dynamics in nine human cell types , 2011, Nature.

[48]  Gos Micklem,et al.  Supporting Online Material Materials and Methods Figs. S1 to S50 Tables S1 to S18 References Identification of Functional Elements and Regulatory Circuits by Drosophila Modencode , 2022 .

[49]  Mazhar Adli,et al.  Genome-wide chromatin maps derived from limited numbers of hematopoietic progenitors , 2010, Nature Methods.

[50]  Aaron R. Quinlan,et al.  Bioinformatics Applications Note Genome Analysis Bedtools: a Flexible Suite of Utilities for Comparing Genomic Features , 2022 .

[51]  F. Collins,et al.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits , 2009, Proceedings of the National Academy of Sciences.

[52]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[53]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[54]  T. Mikkelsen,et al.  Genome-wide maps of chromatin state in pluripotent and lineage-committed cells , 2007, Nature.

[55]  Ling V. Sun,et al.  Hotspots of transcription factor colocalization in the genome of Drosophila melanogaster , 2006, Proceedings of the National Academy of Sciences.

[56]  James A. Cuff,et al.  A Bivalent Chromatin Structure Marks Key Developmental Genes in Embryonic Stem Cells , 2006, Cell.

[57]  M. Bucan,et al.  Promoter features related to tissue specificity as measured by Shannon entropy , 2005, Genome Biology.

[58]  D. S. Gross,et al.  Nuclease hypersensitive sites in chromatin. , 1988, Annual review of biochemistry.