Modeling cis-regulation with a compendium of genome-wide histone H3K27ac profiles

Model-based analysis of regulation of gene expression (MARGE) is a framework for interpreting the relationship between the H3K27ac chromatin environment and differentially expressed gene sets. The framework has three main functions: MARGE-potential, MARGE-express, and MARGE-cistrome. MARGE-potential defines a regulatory potential (RP) for each gene as the sum of H3K27ac ChIP-seq signals weighted by a function of genomic distance from the transcription start site. The MARGE framework includes a compendium of RPs derived from 365 human and 267 mouse H3K27ac ChIP-seq data sets. Relative RPs, scaled using this compendium, are superior to superenhancers in predicting BET (bromodomain and extraterminal domain) -inhibitor repressed genes. MARGE-express, which uses logistic regression to retrieve relevant H3K27ac profiles from the compendium to accurately model a query set of differentially expressed genes, was tested on 671 diverse gene sets from MSigDB. MARGE-cistrome adopts a novel semisupervised learning approach to identify cis-regulatory elements regulating a gene set. MARGE-cistrome exploits information from H3K27ac signal at DNase I hypersensitive sites identified from published human and mouse DNase-seq data. We tested the framework on newly generated RNA-seq and H3K27ac ChIP-seq profiles upon siRNA silencing of multiple transcriptional and epigenetic regulators in a prostate cancer cell line, LNCaP-abl. MARGE-cistrome can predict the binding sites of silenced transcription factors without matched H3K27ac ChIP-seq data. Even when the matching H3K27ac ChIP-seq profiles are available, MARGE leverages public H3K27ac profiles to enhance these data. This study demonstrates the advantage of integrating a large compendium of historical epigenetic data for genomic studies of transcriptional regulation.

[1]  Gordon K Smyth,et al.  Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments , 2004, Statistical applications in genetics and molecular biology.

[2]  Nathan C. Sheffield,et al.  Predicting cell-type–specific gene expression from regions of open chromatin , 2012, Genome research.

[3]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[4]  Shane J. Neph,et al.  An expansive human regulatory lexicon encoded in transcription factor footprints , 2012, Nature.

[5]  Manolis Kellis,et al.  Large-scale epigenome imputation improves data quality and disease variant enrichment , 2015, Nature Biotechnology.

[6]  David A. Orlando,et al.  Master Transcription Factors and Mediator Establish Super-Enhancers at Key Cell Identity Genes , 2013, Cell.

[7]  P. Phillips,et al.  A novel kinase, AATYK induces and promotes neuronal differentiation in a human neuroblastoma (SH-SY5Y) cell line. , 2000, Brain research. Molecular brain research.

[8]  Clifford A. Meyer,et al.  Identifying and mitigating bias in next-generation sequencing methods for chromatin biology , 2014, Nature Reviews Genetics.

[9]  Howard Y. Chang,et al.  Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position , 2013, Nature Methods.

[10]  J. Pike,et al.  Selective Distal Enhancer Control of the Mmp13 Gene Identified through Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR) Genomic Deletions* , 2015, The Journal of Biological Chemistry.

[11]  Qian Wang,et al.  A comprehensive view of nuclear receptor cancer cistromes. , 2011, Cancer research.

[12]  A. Shilatifard,et al.  Zic2 is an enhancer-binding factor required for embryonic stem cell specification , 2015, Molecular cell.

[13]  S. Knapp,et al.  RVX-208, an inhibitor of BET transcriptional regulators with selectivity for the second bromodomain , 2013, Proceedings of the National Academy of Sciences.

[14]  Tom Misteli,et al.  The double bromodomain protein Brd4 binds to acetylated chromatin during interphase and mitosis , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Julia A. Lasserre,et al.  Histone modification levels are predictive for gene expression , 2010, Proceedings of the National Academy of Sciences.

[16]  Y. Wada,et al.  Brd2 is required for cell cycle exit and neuronal differentiation through the E2F1 pathway in mouse neuroepithelial cells. , 2012, Biochemical and biophysical research communications.

[17]  R. Young,et al.  Histone H3K27ac separates active from poised enhancers and predicts developmental state , 2010, Proceedings of the National Academy of Sciences.

[18]  Ryan A. Flynn,et al.  A unique chromatin signature uncovers early developmental enhancers in humans , 2011, Nature.

[19]  Rafael A Irizarry,et al.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data. , 2003, Biostatistics.

[20]  T. Mikkelsen,et al.  Genome-wide maps of chromatin state in pluripotent and lineage-committed cells , 2007, Nature.

[21]  Xiaojin Zhu,et al.  Semi-Supervised Learning , 2010, Encyclopedia of Machine Learning.

[22]  R. Young,et al.  Super-Enhancers in the Control of Cell Identity and Disease , 2013, Cell.

[23]  K. Pienta,et al.  A hierarchical network of transcription factors governs androgen receptor-dependent prostate cancer growth. , 2007, Molecular cell.

[24]  I. Amit,et al.  Comprehensive mapping of long range interactions reveals folding principles of the human genome , 2011 .

[25]  G. K. Davis,et al.  Phenotypic robustness conferred by apparently redundant transcriptional enhancers , 2010, Nature.

[26]  Helga Thorvaldsdóttir,et al.  Molecular signatures database (MSigDB) 3.0 , 2011, Bioinform..

[27]  Matthew C. Canver,et al.  BCL11A enhancer dissection by Cas9-mediated in situ saturating mutagenesis , 2015, Nature.

[28]  R. Krumlauf,et al.  Long-range regulation by shared retinoic acid response elements modulates dynamic expression of posterior Hoxb genes in CNS development. , 2014, Developmental biology.

[29]  Matthew E. Ritchie,et al.  limma powers differential expression analyses for RNA-sequencing and microarray studies , 2015, Nucleic acids research.

[30]  Manolis Kellis,et al.  Discovery and Characterization of Chromatin States for Systematic Annotation of the Human Genome , 2011, RECOMB.

[31]  G. Bourque,et al.  Transposable elements have rewired the core regulatory network of human embryonic stem cells , 2010, Nature Genetics.

[32]  Wouter de Laat,et al.  A Regulatory Archipelago Controls Hox Genes Transcription in Digits , 2011, Cell.

[33]  C. Verfaillie,et al.  Zic3 enhances the generation of mouse induced pluripotent stem cells. , 2013, Stem cells and development.

[34]  David A. Orlando,et al.  Selective Inhibition of Tumor Oncogenes by Disruption of Super-Enhancers , 2013, Cell.

[35]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[36]  Sven Rahmann,et al.  Genome analysis , 2022 .

[37]  M. Levine,et al.  Shadow Enhancers as a Source of Evolutionary Novelty , 2008, Science.

[38]  Clifford A. Meyer,et al.  Genome-wide analysis of estrogen receptor binding sites , 2006, Nature Genetics.

[39]  Hanfei Sun,et al.  Target analysis by integration of transcriptome and ChIP-seq data with BETA , 2013, Nature Protocols.

[40]  Eric Legius,et al.  PRC2 loss amplifies Ras-driven transcription and confers sensitivity to BRD4-based therapies , 2014, Nature.

[41]  E. Birney,et al.  High-resolution genome-wide in vivo footprinting of diverse transcription factors in human cells. , 2011, Genome research.

[42]  Thomas Lengauer,et al.  ROCR: visualizing classifier performance in R , 2005, Bioinform..

[43]  W. Wong,et al.  ChIP-Seq of transcription factors predicts absolute and differential gene expression in embryonic stem cells , 2009, Proceedings of the National Academy of Sciences.

[44]  Katy A. Muzikar,et al.  Repression of DNA-binding dependent glucocorticoid receptor-mediated gene expression , 2009, Proceedings of the National Academy of Sciences.

[45]  E. Furlong,et al.  Transcription factors: from enhancer binding to developmental control , 2012, Nature Reviews Genetics.

[46]  Eric S. Lander,et al.  Comparative Epigenomic Analysis of Murine and Human Adipogenesis , 2010, Cell.

[47]  William Stafford Noble,et al.  Integrative annotation of chromatin elements from ENCODE data , 2012, Nucleic acids research.

[48]  Jesse R. Dixon,et al.  Topological Domains in Mammalian Genomes Identified by Analysis of Chromatin Interactions , 2012, Nature.

[49]  Jun S. Liu,et al.  Inference of transcriptional regulation in cancers , 2015, Proceedings of the National Academy of Sciences.

[50]  Dustin E. Schones,et al.  High-Resolution Profiling of Histone Methylations in the Human Genome , 2007, Cell.

[51]  Mark Groudine,et al.  The hypersensitive sites of the murine β-globin locus control region act independently to affect nuclear localization and transcriptional elongation. , 2012, Blood.

[52]  M. Daly,et al.  Genome-wide mapping of DNase hypersensitive sites using massively parallel signature sequencing (MPSS). , 2005, Genome research.

[53]  Hongkai Ji,et al.  A comparative analysis of genome-wide chromatin immunoprecipitation data for mammalian transcription factors , 2006, Nucleic acids research.

[54]  Jonathan Schug,et al.  Genome-wide analysis reveals conserved and divergent features of Notch1/RBPJ binding in human and murine T-lymphoblastic leukemia cells , 2011, Proceedings of the National Academy of Sciences.

[55]  Nathan C. Sheffield,et al.  The accessible chromatin landscape of the human genome , 2012, Nature.

[56]  David Z. Chen,et al.  Architecture of the human regulatory network derived from ENCODE data , 2012, Nature.

[57]  Shane J. Neph,et al.  Circuitry and Dynamics of Human Transcription Factor Regulatory Networks , 2012, Cell.

[58]  William Stafford Noble,et al.  Global mapping of protein-DNA interactions in vivo by digital genomic footprinting , 2009, Nature Methods.

[59]  Nathaniel D. Heintzman,et al.  Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome , 2007, Nature Genetics.

[60]  Howard Y. Chang,et al.  Control of somatic tissue differentiation by the long non-coding RNA TINCR , 2012, Nature.

[61]  Timothy J. Durham,et al.  Systematic analysis of chromatin state dynamics in nine human cell types , 2011, Nature.

[62]  Charles Y. Lin,et al.  Discovery and characterization of super-enhancer-associated dependencies in diffuse large B cell lymphoma. , 2013, Cancer cell.

[63]  D. Price P-TEFb, a Cyclin-Dependent Kinase Controlling Elongation by RNA Polymerase II , 2000, Molecular and Cellular Biology.

[64]  M. Kretz,et al.  A LncRNA-MAF:MAFB transcription factor network regulates epidermal differentiation. , 2015, Developmental cell.

[65]  Michael Q. Zhang,et al.  Integrative analysis of 111 reference human epigenomes , 2015, Nature.

[66]  Elhanan Borenstein,et al.  Conservation of trans-acting circuitry during mammalian regulatory evolution , 2014, Nature.

[67]  Christopher J. Ott,et al.  BET bromodomain inhibition targets both c-Myc and IL7R in high-risk acute lymphoblastic leukemia. , 2012, Blood.

[68]  J. Brady,et al.  The bromodomain protein Brd4 is a positive regulatory component of P-TEFb and stimulates RNA polymerase II-dependent transcription. , 2005, Molecular cell.

[69]  Michael Levine,et al.  Multiple enhancers ensure precision of gap gene-expression patterns in the Drosophila embryo , 2011, Proceedings of the National Academy of Sciences.

[70]  A. Mortazavi,et al.  Genome-Wide Mapping of in Vivo Protein-DNA Interactions , 2007, Science.