Open chromatin defined by DNaseI and FAIRE identifies regulatory elements that shape cell-type identity.

The human body contains thousands of unique cell types, each with specialized functions. Cell identity is governed in large part by gene transcription programs, which are determined by regulatory elements encoded in DNA. To identify regulatory elements active in seven cell lines representative of diverse human cell types, we used DNase-seq and FAIRE-seq (Formaldehyde Assisted Isolation of Regulatory Elements) to map "open chromatin." Over 870,000 DNaseI or FAIRE sites, which correspond tightly to nucleosome-depleted regions, were identified across the seven cell lines, covering nearly 9% of the genome. The combination of DNaseI and FAIRE is more effective than either assay alone in identifying likely regulatory elements, as judged by coincidence with transcription factor binding locations determined in the same cells. Open chromatin common to all seven cell types tended to be at or near transcription start sites and to be coincident with CTCF binding sites, while open chromatin sites found in only one cell type were typically located away from transcription start sites and contained DNA motifs recognized by regulators of cell-type identity. We show that open chromatin regions bound by CTCF are potent insulators. We identified clusters of open regulatory elements (COREs) that were physically near each other and whose appearance was coordinated among one or more cell types. Gene expression and RNA Pol II binding data support the hypothesis that COREs control gene activity required for the maintenance of cell-type identity. This publicly available atlas of regulatory elements may prove valuable in identifying noncoding DNA sequence variants that are causally linked to human disease.

[1]  M. Kendall Statistical Methods for Research Workers , 1937, Nature.

[2]  Sarah C. R. Elgin,et al.  The chromatin structure of specific genes: I. Evidence for higher order domains of defined DNA sequence , 1979, Cell.

[3]  K. Kok,et al.  Nuclease-hypersensitive sites in chromatin of the estrogen-inducible apoVLDL II gene of chicken. , 1985, Nucleic acids research.

[4]  J. Dooley,et al.  Orphan receptors. , 1993, British Journal of Cancer.

[5]  F. Sladek,et al.  The orphan receptors COUP-TF and HNF-4 serve as accessory factors required for induction of phosphoenolpyruvate carboxykinase gene transcription by glucocorticoids. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[6]  H. Schöler,et al.  Formation of Pluripotent Stem Cells in the Mammalian Embryo Depends on the POU Transcription Factor Oct4 , 1998, Cell.

[7]  A. West,et al.  The Protein CTCF Is Required for the Enhancer Blocking Activity of Vertebrate Insulators , 1999, Cell.

[8]  R. Rice,et al.  Functional AP1 and CRE response elements in the human keratinocyte transglutaminase promoter mediating Whn suppression. , 2000, Gene.

[9]  T. Taniguchi,et al.  IRF family of transcription factors as regulators of host defense. , 2001, Annual review of immunology.

[10]  T. Frayling,et al.  A distant upstream promoter of the HNF-4alpha gene connects the transcription factors involved in maturity-onset diabetes of the young. , 2001, Human molecular genetics.

[11]  Alex E. Lash,et al.  Gene Expression Omnibus: NCBI gene expression and hybridization array data repository , 2002, Nucleic Acids Res..

[12]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[13]  E. Seto,et al.  Histone modifications. , 2003, Methods.

[14]  R. Flavell,et al.  Interchromosomal associations between alternatively expressed loci , 2005, Nature.

[15]  R. Fisher Statistical methods for research workers , 1927, Protoplasma.

[16]  J. Lieb,et al.  Cell Cycle–Specified Fluctuation of Nucleosome Occupancy at Gene Promoters , 2006, PLoS genetics.

[17]  T. Wolfsberg,et al.  DNase-chip: a high-resolution method to identify DNase I hypersensitive sites using tiled microarrays , 2006, Nature Methods.

[18]  M. Daly,et al.  Genome-wide mapping of DNase hypersensitive sites using massively parallel signature sequencing (MPSS). , 2005, Genome research.

[19]  K. K. Deal,et al.  Distant regulatory elements in a Sox10‐βGEO BAC transgene are required for expression of Sox10 in the enteric nervous system and other neural crest‐derived tissues , 2006, Developmental dynamics : an official publication of the American Association of Anatomists.

[20]  Alexander E. Kel,et al.  TRANSFAC® and its module TRANSCompel®: transcriptional gene regulation in eukaryotes , 2005, Nucleic Acids Res..

[21]  Jane M J Lin,et al.  Identification and Characterization of Cell Type–Specific and Ubiquitous Chromatin Regulatory Structures in the Human Genome , 2007, PLoS genetics.

[22]  V. Iyer,et al.  FAIRE (Formaldehyde-Assisted Isolation of Regulatory Elements) isolates active regulatory elements from human chromatin. , 2007, Genome research.

[23]  Panayiotis V. Benos,et al.  STAMP: a web tool for exploring DNA-binding motif similarities , 2007, Nucleic Acids Res..

[24]  R. Young,et al.  A Chromatin Landmark and Transcription Initiation at Most Promoters in Human Cells , 2007, Cell.

[25]  E. Dejana,et al.  Foxs and Ets in the transcriptional regulation of endothelial cell differentiation and angiogenesis. , 2007, Biochimica et biophysica acta.

[26]  Jonghwan Kim,et al.  Mapping the chromosomal targets of STAT1 by Sequence Tag Analysis of Genomic Enrichment (STAGE). , 2007, Genome research.

[27]  William Stafford Noble,et al.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.

[28]  M. Brand,et al.  Nucleosome and transcription activator antagonism at human β-globin locus control region DNase I hypersensitive sites , 2007, Nucleic acids research.

[29]  Terrence S. Furey,et al.  F-Seq: a feature density estimator for high-throughput sequence tags , 2008, Bioinform..

[30]  J. D. Engel,et al.  GATA1-related leukaemias , 2008, Nature Reviews Cancer.

[31]  Ole Winther,et al.  JASPAR, the open access database of transcription factor-binding profiles: new content and tools in the 2008 update , 2007, Nucleic Acids Res..

[32]  Z. Weng,et al.  High-Resolution Mapping and Characterization of Open Chromatin across the Genome , 2008, Cell.

[33]  R. Durbin,et al.  Mapping Quality Scores Mapping Short Dna Sequencing Reads and Calling Variants Using P

, 2022 .

[34]  R. Durbin,et al.  Mapping short DNA sequencing reads and variants calling using mapping quality scores ( Supplementary Text ) , 2008 .

[35]  P. Giresi,et al.  Isolation of active regulatory elements from eukaryotic chromatin using FAIRE (Formaldehyde Assisted Isolation of Regulatory Elements). , 2009, Methods.

[36]  A. Sharov,et al.  Exhaustive Search for Over-represented DNA Sequence Motifs with CisFinder , 2009, DNA research : an international journal for rapid publication of reports on genes and genomes.

[37]  Jan Komorowski,et al.  Differential binding and co-binding pattern of FOXA1 and FOXA3 and their relation to H3K4me3 in HepG2 cells revealed by ChIP-seq , 2009, Genome Biology.

[38]  Victor X. Jin,et al.  Genomic Targets of the KRAB and SCAN Domain-containing Zinc Finger Protein 263* , 2009, The Journal of Biological Chemistry.

[39]  G. Crawford,et al.  Mapping regulatory elements by DNaseI hypersensitivity chip (DNase-Chip). , 2009, Methods in molecular biology.

[40]  Sayan Mukherjee,et al.  Evidence-ranked motif identification , 2010, Genome Biology.

[41]  Henriette O'Geen,et al.  Discovering hematopoietic mechanisms through genome-wide analysis of GATA factor chromatin occupancy. , 2009, Molecular cell.

[42]  William Stafford Noble,et al.  Global mapping of protein-DNA interactions in vivo by digital genomic footprinting , 2009, Nature Methods.

[43]  Martha L. Bulyk,et al.  UniPROBE: an online database of protein binding microarray data on protein–DNA interactions , 2008, Nucleic Acids Res..

[44]  F. Collins,et al.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits , 2009, Proceedings of the National Academy of Sciences.

[45]  Nathaniel D. Heintzman,et al.  Histone modifications at human enhancers reflect global cell-type-specific gene expression , 2009, Nature.

[46]  Bas E. Dutilh,et al.  Genome-Wide Profiling of p63 DNA–Binding Sites Identifies an Element that Regulates Gene Expression during Limb Development in the 7q21 SHFM1 Locus , 2010, PLoS genetics.

[47]  E. Birney,et al.  Heritable Individual-Specific and Allele-Specific Chromatin Signatures in Humans , 2010, Science.

[48]  Stephen C. J. Parker,et al.  Global epigenomic analysis of primary human pancreatic islets provides insights into type 2 diabetes susceptibility loci. , 2010, Cell metabolism.

[49]  Zhi Xie,et al.  hPDI: a database of experimental human protein-DNA interactions , 2010, Bioinform..

[50]  G. Crawford,et al.  DNase-seq: a high-resolution technique for mapping active gene regulatory elements across the genome from mammalian cells. , 2010, Cold Spring Harbor protocols.

[51]  J. Dekker,et al.  Genomics tools for the unraveling of chromosome architecture , 2010, Nature Biotechnology.

[52]  Karen L. Mohlke,et al.  A map of open chromatin in human pancreatic islets , 2010, Nature Genetics.

[53]  Eugene Bolotin,et al.  Integrated approach for the identification of human hepatocyte nuclear factor 4α target genes using protein binding microarrays , 2010, Hepatology.

[54]  M. Reid,et al.  The Gerbich blood group system: a review , 2010, Immunohematology.

[55]  M. Gerstein,et al.  Close association of RNA polymerase II and many transcription factors with Pol III genes , 2010, Proceedings of the National Academy of Sciences.

[56]  David J. Arenillas,et al.  JASPAR 2010: the greatly expanded open-access database of transcription factor binding profiles , 2009, Nucleic Acids Res..

[57]  Timothy J. Durham,et al.  "Systematic" , 1966, Comput. J..

[58]  Jacob F. Degner,et al.  Sequence and Chromatin Accessibility Data Accurate Inference of Transcription Factor Binding from Dna Material Supplemental Open Access , 2022 .

[59]  F. Lu,et al.  Role of redox signaling regulation in propyl gallate-induced apoptosis of human leukemia cells. , 2011, Food and chemical toxicology : an international journal published for the British Industrial Biological Research Association.

[60]  Timothy J. Durham,et al.  Systematic analysis of chromatin state dynamics in nine human cell types , 2011, Nature.

[61]  Ryan A. Flynn,et al.  A unique chromatin signature uncovers early developmental enhancers in humans , 2011, Nature.

[62]  P. Cockerill Structure and function of active chromatin and DNase I hypersensitive sites , 2011, The FEBS journal.

[63]  Raymond K. Auerbach,et al.  A User's Guide to the Encyclopedia of DNA Elements (ENCODE) , 2011, PLoS biology.

[64]  E. Birney,et al.  High-resolution genome-wide in vivo footprinting of diverse transcription factors in human cells. , 2011, Genome research.