Early enhancer establishment and regulatory locus complexity shape transcriptional programs in hematopoietic differentiation

We carried out an integrative analysis of enhancer landscape and gene expression dynamics during hematopoietic differentiation using DNase-seq, histone mark ChIP-seq and RNA sequencing to model how the early establishment of enhancers and regulatory locus complexity govern gene expression changes at cell state transitions. We found that high-complexity genes—those with a large total number of DNase-mapped enhancers across the lineage—differ architecturally and functionally from low-complexity genes, achieve larger expression changes and are enriched for both cell type–specific and transition enhancers, which are established in hematopoietic stem and progenitor cells and maintained in one differentiated cell fate but lost in others. We then developed a quantitative model to accurately predict gene expression changes from the DNA sequence content and lineage history of active enhancers. Our method suggests a new mechanistic role for PU.1 at transition peaks during B cell specification and can be used to correct assignments of enhancers to genes.

[1]  B Calabretta,et al.  Overexpression of the zinc finger protein MZF1 inhibits hematopoietic development from embryonic stem cells: correlation with negative regulation of CD34 and c-myb promoter activity , 1995, Molecular and cellular biology.

[2]  R. Young,et al.  Super-Enhancers in the Control of Cell Identity and Disease , 2013, Cell.

[3]  Renaud Gaujoux,et al.  A flexible R package for nonnegative matrix factorization , 2010, BMC Bioinformatics.

[4]  B. Kee,et al.  Molecular Basis of Hematopoiesis , 2009 .

[5]  Jon C. Aster,et al.  NOTCH1–RBPJ complexes drive target gene expression through dynamic interactions with superenhancers , 2013, Proceedings of the National Academy of Sciences.

[6]  Jiangwen Zhang,et al.  Transcriptional regulation of the Ikzf1 locus. , 2013, Blood.

[7]  Yuka Kanno,et al.  STATs Shape the Active Enhancer Landscape of T Cell Populations , 2012, Cell.

[8]  J. Dekker,et al.  The long-range interaction landscape of gene promoters , 2012, Nature.

[9]  Graham M Lord,et al.  T-bet: a bridge between innate and adaptive immunity , 2013, Nature Reviews Immunology.

[10]  R. Young,et al.  Histone H3K27ac separates active from poised enhancers and predicts developmental state , 2010, Proceedings of the National Academy of Sciences.

[11]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[12]  Michael Q. Zhang,et al.  Integrative analysis of 111 reference human epigenomes , 2015, Nature.

[13]  Guillermo Sapiro,et al.  Online Learning for Matrix Factorization and Sparse Coding , 2009, J. Mach. Learn. Res..

[14]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[15]  David R. Hunter,et al.  mixtools: An R Package for Analyzing Mixture Models , 2009 .

[16]  Robert Gentleman,et al.  Software for Computing and Annotating Genomic Ranges , 2013, PLoS Comput. Biol..

[17]  M. Bonke,et al.  Chromatin "Prepattern" and Histone Modifiers in a Fate Choice for Liver and Pancreas , 2011 .

[18]  Marc D. Perry,et al.  ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia , 2012, Genome research.

[19]  M. Crossley,et al.  PU.1 and Haematopoietic Cell Fate: Dosage Matters , 2011, International journal of cell biology.

[20]  S. Hannenhalli,et al.  Enhancer networks revealed by correlated DNAse hypersensitivity states of enhancers , 2013, Nucleic acids research.

[21]  Heidi Dvinge,et al.  PeakAnalyzer: Genome-wide annotation of chromatin binding and modification loci , 2010, BMC Bioinformatics.

[22]  Nathaniel D. Heintzman,et al.  Histone modifications at human enhancers reflect global cell-type-specific gene expression , 2009, Nature.

[23]  S. Smale Pioneer factors in embryonic stem cells and differentiation. , 2010, Current opinion in genetics & development.

[24]  Aaron C. Daugherty,et al.  H3K4me3 Breadth Is Linked to Cell Identity and Transcriptional Consistency , 2014, Cell.

[25]  Cameron S. Osborne,et al.  The pluripotent regulatory circuitry connecting promoters to their long-range interacting elements , 2015, Genome research.

[26]  R. Hill,et al.  Long range regulation of the sonic hedgehog gene. , 2014, Current opinion in genetics & development.

[27]  E. Rothenberg The chromatin landscape and transcription factors in T cell programming. , 2014, Trends in immunology.

[28]  Shane J. Neph,et al.  Developmental Fate and Cellular Maturity Encoded in Human Regulatory DNA Landscapes , 2013, Cell.

[29]  Philip A. Ewels,et al.  Mapping long-range promoter contacts in human cells with high-resolution capture Hi-C , 2015, Nature Genetics.

[30]  J. Keith Joung,et al.  Interactome Maps of Mouse Gene Regulatory Domains Reveal Basic Principles of Transcriptional Regulation , 2013, Cell.

[31]  Shane J. Neph,et al.  Foxp3 Exploits a Pre-Existent Enhancer Landscape for Regulatory T Cell Lineage Specification , 2012, Cell.

[32]  Nathan C. Sheffield,et al.  The accessible chromatin landscape of the human genome , 2012, Nature.

[33]  Peter J. Bickel,et al.  Measuring reproducibility of high-throughput experiments , 2011, 1110.4705.

[34]  David A. Orlando,et al.  Master Transcription Factors and Mediator Establish Super-Enhancers at Key Cell Identity Genes , 2013, Cell.

[35]  Dong-er Zhang,et al.  CCAAT/Enhancer-binding Protein Activates the CD14 Promoter and Mediates Transforming Growth Factor β Signaling in Monocyte Development* , 1999, The Journal of Biological Chemistry.

[36]  Ryan A. Flynn,et al.  A unique chromatin signature uncovers early developmental enhancers in humans , 2011, Nature.

[37]  C. Glass,et al.  Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. , 2010, Molecular cell.

[38]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[39]  J. Lieb,et al.  What are super-enhancers? , 2014, Nature Genetics.

[40]  Lee E. Edsall,et al.  A map of the cis-regulatory sequences in the mouse genome , 2012, Nature.

[41]  Bradley E. Bernstein,et al.  Genome-wide Chromatin State Transitions Associated with Developmental and Environmental Cues , 2013, Cell.

[42]  James A. Cuff,et al.  A Bivalent Chromatin Structure Marks Key Developmental Genes in Embryonic Stem Cells , 2006, Cell.

[43]  N. Friedman,et al.  Chromatin state dynamics during blood formation , 2014, Science.

[44]  A. Rudensky,et al.  Role of conserved non-coding DNA elements in the Foxp3 gene in regulatory T-cell fate , 2010, Nature.

[45]  Jane M J Lin,et al.  Identification and Characterization of Cell Type–Specific and Ubiquitous Chromatin Regulatory Structures in the Human Genome , 2007, PLoS genetics.

[46]  Christina S. Leslie,et al.  SeqGL Identifies Context-Dependent Binding Signals in Genome-Wide Regulatory Element Maps , 2015, PLoS Comput. Biol..

[47]  Stephen C. J. Parker,et al.  Chromatin stretch enhancer states drive cell-specific gene regulation and harbor human disease risk variants , 2013, Proceedings of the National Academy of Sciences.

[48]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[49]  Howard Y. Chang,et al.  Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position , 2013, Nature Methods.

[50]  Jill M Dowen,et al.  Control of Cell Identity Genes Occurs in Insulated Neighborhoods in Mammalian Chromosomes , 2014, Cell.

[51]  Cory Y. McLean,et al.  GREAT improves functional interpretation of cis-regulatory regions , 2010, Nature Biotechnology.

[52]  Timothy J. Durham,et al.  "Systematic" , 1966, Comput. J..