Comprehensive Identification and Annotation of Cell Type-Specific and Ubiquitous CTCF-Binding Sites in the Human Genome

Chromatin insulators are DNA elements that regulate the level of gene expression either by preventing gene silencing through the maintenance of heterochromatin boundaries or by preventing gene activation by blocking interactions between enhancers and promoters. CCCTC-binding factor (CTCF), a ubiquitously expressed 11-zinc-finger DNA-binding protein, is the only protein implicated in the establishment of insulators in vertebrates. While CTCF has been implicated in diverse regulatory functions, CTCF has only been studied in a limited number of cell types across human genome. Thus, it is not clear whether the identified cell type-specific differences in CTCF-binding sites are functionally significant. Here, we identify and characterize cell type-specific and ubiquitous CTCF-binding sites in the human genome across 38 cell types designated by the Encyclopedia of DNA Elements (ENCODE) consortium. These cell type-specific and ubiquitous CTCF-binding sites show uniquely versatile transcriptional functions and characteristic chromatin features. In addition, we confirm the insulator barrier function of CTCF-binding and explore the novel function of CTCF in DNA replication. These results represent a critical step toward the comprehensive and systematic understanding of CTCF-dependent insulators and their versatile roles in the human genome.

[1]  Jane M J Lin,et al.  Identification and Characterization of Cell Type–Specific and Ubiquitous Chromatin Regulatory Structures in the Human Genome , 2007, PLoS genetics.

[2]  J. Tower,et al.  Functionally distinct, sequence-specific replicator and origin elements are required for Drosophila chorion gene amplification. , 2001, Genes & development.

[3]  Steven J. M. Jones,et al.  Circos: an information aesthetic for comparative genomics. , 2009, Genome research.

[4]  Michael O Dorschner,et al.  Sequencing newly replicated DNA reveals widespread plasticity in human replication timing , 2009, Proceedings of the National Academy of Sciences.

[5]  A. West,et al.  Insulators: many functions, many mechanisms. , 2002, Genes & development.

[6]  Kristian Helin,et al.  Genome-wide mapping of Polycomb target genes unravels their roles in cell fate transitions. , 2006, Genes & development.

[7]  P. Neiman,et al.  CTCF, a conserved nuclear factor required for optimal transcriptional activity of the chicken c-myc gene, is an 11-Zn-finger protein differentially expressed in multiple forms , 1993, Molecular and cellular biology.

[8]  Aaron R. Quinlan,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2022 .

[9]  Jeannie T. Lee Molecular Links between X-Inactivation and Autosomal Imprinting: X-Inactivation as a Driving Force for the Evolution of Imprinting? , 2003, Current Biology.

[10]  S Miyano,et al.  Open source clustering software. , 2004, Bioinformatics.

[11]  Shirley M. Tilghman,et al.  CTCF mediates methylation-sensitive enhancer-blocking activity at the H19/Igf2 locus , 2000, Nature.

[12]  Olivier Cuvier,et al.  Genome-Wide Mapping of Boundary Element-Associated Factor (BEAF) Binding Sites in Drosophila melanogaster Links BEAF to Transcription , 2009, Molecular and Cellular Biology.

[13]  Alexander E. Kel,et al.  TRANSFAC® and its module TRANSCompel®: transcriptional gene regulation in eukaryotes , 2005, Nucleic Acids Res..

[14]  J. Zlatanova,et al.  CTCF and its protein partners: divide and rule? , 2009, Journal of Cell Science.

[15]  V. Corces,et al.  A chromatin insulator determines the nuclear localization of DNA. , 2000, Molecular cell.

[16]  T. Mikkelsen,et al.  Genome-wide maps of chromatin state in pluripotent and lineage-committed cells , 2007, Nature.

[17]  R Ohlsson,et al.  CTCF is a uniquely versatile transcription regulator linked to epigenetics and disease. , 2001, Trends in genetics : TIG.

[18]  David J. Arenillas,et al.  JASPAR 2010: the greatly expanded open-access database of transcription factor binding profiles , 2009, Nucleic Acids Res..

[19]  R. Kamakaka,et al.  Chromatin insulators. , 2006, Annual review of genetics.

[20]  William Stafford Noble,et al.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.

[21]  Tobias Straub,et al.  Active promoters and insulators are marked by the centrosomal protein 190 , 2009, The EMBO journal.

[22]  Timothy J. Durham,et al.  "Systematic" , 1966, Comput. J..

[23]  Hui Ling Chen,et al.  CTCF Mediates Interchromosomal Colocalization Between Igf2/H19 and Wsb1/Nf1 , 2006, Science.

[24]  A. West,et al.  The Protein CTCF Is Required for the Enhancer Blocking Activity of Vertebrate Insulators , 1999, Cell.

[25]  B D Athey,et al.  Chromatin fibers are left-handed double helices with diameter and mass per unit length that depend on linker length. , 1986, Biophysical journal.

[26]  Boris Adryan,et al.  CTCF Genomic Binding Sites in Drosophila and the Organisation of the Bithorax Complex , 2007, PLoS genetics.

[27]  Martha L. Bulyk,et al.  UniPROBE, update 2011: expanded content and search tools in the online database of protein-binding microarray data on protein–DNA interactions , 2010, Nucleic Acids Res..

[28]  Antoine H. F. M. Peters,et al.  Polycomb group proteins Ezh2 and Rnf2 direct genomic contraction and imprinted repression in early mouse embryos. , 2008, Developmental cell.

[29]  V. Corces,et al.  CTCF: Master Weaver of the Genome , 2009, Cell.

[30]  Victor V Lobanenkov,et al.  Functional association of CTCF with the insulator upstream of the H19 gene is parent of origin-specific and methylation-sensitive , 2000, Current Biology.

[31]  S. Tapscott,et al.  CTCF-binding sites flank CTG/CAG repeats and form a methylation-sensitive insulator at the DM1 locus , 2001, Nature Genetics.

[32]  I. Simon,et al.  Developmental regulation of DNA replication timing at the human β globin locus , 2001 .

[33]  G. Pfeifer,et al.  Maternal-specific footprints at putative CTCF sites in the H19 imprinting control region give evidence for insulator function , 2000, Current Biology.

[34]  R. Young,et al.  A Chromatin Landmark and Transcription Initiation at Most Promoters in Human Cells , 2007, Cell.

[35]  I. Amit,et al.  Comprehensive mapping of long range interactions reveals folding principles of the human genome , 2011 .

[36]  D. Ward,et al.  Delineation of DNA replication time zones by fluorescence in situ hybridization. , 1992, The EMBO journal.

[37]  Jesse R. Raab,et al.  Insulators and promoters: closer than we think , 2010, Nature Reviews Genetics.

[38]  The role of CTCF in regulating nuclear organization , 2008, The Journal of experimental medicine.

[39]  Victor V Lobanenkov,et al.  A novel sequence-specific DNA binding protein which interacts with three regularly spaced direct repeats of the CCCTC-motif in the 5'-flanking sequence of the chicken c-myc gene. , 1990, Oncogene.

[40]  Michael Gribskov,et al.  Combining evidence using p-values: application to sequence homology searches , 1998, Bioinform..

[41]  Mary Goldman,et al.  The UCSC Genome Browser database: update 2011 , 2010, Nucleic Acids Res..

[42]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[43]  Dustin E. Schones,et al.  High-Resolution Profiling of Histone Methylations in the Human Genome , 2007, Cell.

[44]  Victor G Corces,et al.  Three subclasses of a Drosophila insulator show distinct and cell type-specific genomic distributions. , 2009, Genes & development.

[45]  Christopher D. Brown,et al.  A Comprehensive Map of Insulator Elements for the Drosophila Genome , 2010, PLoS genetics.

[46]  Elissa P. Lei,et al.  Coordinated control of dCTCF and gypsy chromatin insulators in Drosophila. , 2007, Molecular cell.

[47]  M. Bartolomei,et al.  Transgenic RNAi Reveals Essential Function for CTCF in H19 Gene Imprinting , 2004, Science.

[48]  I. Simon,et al.  Developmental regulation of DNA replication timing at the human beta globin locus. , 2001, The EMBO journal.

[49]  E. Rubio,et al.  Thec-myc Insulator Element and Matrix Attachment Regions Definethe c-myc ChromosomalDomain , 2003, Molecular and Cellular Biology.

[50]  D. Timm,et al.  Asymmetries in the nucleosome core particle at 2.5 A resolution. , 2000, Acta crystallographica. Section D, Biological crystallography.

[51]  David Haussler,et al.  ENCODE whole-genome data in the UCSC genome browser (2011 update) , 2010, Nucleic Acids Res..

[52]  G. Felsenfeld,et al.  Methylation of a CTCF-dependent boundary controls imprinted expression of the Igf2 gene , 2000, Nature.

[53]  B. Williams,et al.  Mapping and quantifying mammalian transcriptomes by RNA-Seq , 2008, Nature Methods.

[54]  S. Salzberg,et al.  The Transcriptional Landscape of the Mammalian Genome , 2005, Science.

[55]  Victor G Corces,et al.  Boundary elements and nuclear organization , 2004, Biology of the cell.

[56]  Xiaochen Bo,et al.  Genome-wide analysis of the relationships between DNaseI HS, histone modifications and gene expression reveals distinct modes of chromatin domains , 2011, Nucleic acids research.

[57]  Charles Elkan,et al.  Fitting a Mixture Model By Expectation Maximization To Discover Motifs In Biopolymer , 1994, ISMB.

[58]  Douglas A. Hosack,et al.  Identifying biological themes within lists of genes with EASE , 2003, Genome Biology.

[59]  A. Vostrov,et al.  A Region to the N-terminal Side of the CTCF Zinc Finger Domain Is Essential for Activating Transcription from the Amyloid Precursor Protein Promoter* , 2002, The Journal of Biological Chemistry.

[60]  J. Zlatanova,et al.  CCCTC-binding factor: to loop or to bridge , 2009, Cellular and Molecular Life Sciences.

[61]  A. Vostrov,et al.  The zinc finger protein CTCF binds to the APBbeta domain of the amyloid beta-protein precursor promoter. Evidence for a role in transcriptional activation. , 1997, The Journal of biological chemistry.

[62]  Victor G Corces,et al.  Chromatin insulators: regulatory mechanisms and epigenetic inheritance. , 2008, Molecular cell.

[63]  S. Berger The complex language of chromatin regulation during transcription , 2007, Nature.

[64]  G. Felsenfeld,et al.  Insulators: exploiting transcriptional and epigenetic mechanisms , 2006, Nature Reviews Genetics.

[65]  Satoru Miyano,et al.  Open source clustering software , 2004 .

[66]  A. Krumm,et al.  Targeted Deletion of Multiple CTCF-Binding Elements in the Human C-MYC Gene Reveals a Requirement for CTCF in C-MYC Expression , 2009, PloS one.

[67]  T. Mikkelsen,et al.  Genome-scale DNA methylation maps of pluripotent and differentiated cells , 2008, Nature.

[68]  E. J. Kuhn,et al.  Genomic insulators: connecting properties to mechanism. , 2003, Current opinion in cell biology.

[69]  J. D. Engel,et al.  Human β-Globin Locus Control Region HS5Contains CTCF- and Developmental Stage-Dependent Enhancer-BlockingActivity in ErythroidCells , 2003, Molecular and Cellular Biology.

[70]  Nathaniel D. Heintzman,et al.  Histone modifications at human enhancers reflect global cell-type-specific gene expression , 2009, Nature.

[71]  Nathaniel D. Heintzman,et al.  Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome , 2007, Nature Genetics.

[72]  J. Tower,et al.  A transcriptional insulator element, the su(Hw) binding site, protects a chromosomal DNA replication origin from position effects , 1997, Molecular and cellular biology.

[73]  Raymond K. Auerbach,et al.  A User's Guide to the Encyclopedia of DNA Elements (ENCODE) , 2011, PLoS biology.

[74]  Dustin E. Schones,et al.  Dynamic Regulation of Nucleosome Positioning in the Human Genome , 2008, Cell.

[75]  V. Corces,et al.  Chromatin insulators: lessons from the fly. , 2009, Briefings in functional genomics & proteomics.

[76]  Cizhong Jiang,et al.  Nucleosome positioning and gene regulation: advances through genomics , 2009, Nature Reviews Genetics.

[77]  C. Disteche,et al.  Boundaries between chromosomal domains of X inactivation and escape bind CTCF and lack CpG methylation during early development. , 2005, Developmental cell.

[78]  William Stafford Noble,et al.  Unsupervised segmentation of continuous genomic data , 2007, Bioinform..

[79]  T. Mikkelsen,et al.  Systematic discovery of regulatory motifs in conserved regions of the human genome, including thousands of CTCF insulator sites , 2007, Proceedings of the National Academy of Sciences.

[80]  Manolis Kellis,et al.  Discovery and Characterization of Chromatin States for Systematic Annotation of the Human Genome , 2011, RECOMB.

[81]  Jeannie T. Lee,et al.  Evidence that homologous X-chromosome pairing requires transcription and Ctcf protein , 2007, Nature Genetics.

[82]  M. Bodén,et al.  Associating transcription factor-binding site motifs with target GO terms and target genes , 2008, Nucleic acids research.

[83]  Z. Weng,et al.  The Insulator Binding Protein CTCF Positions 20 Nucleosomes around Its Binding Sites across the Human Genome , 2008, PLoS genetics.

[84]  Terence P. Speed,et al.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias , 2003, Bioinform..

[85]  Dustin E. Schones,et al.  Global analysis of the insulator binding protein CTCF in chromatin barrier regions reveals demarcation of active and repressive domains. , 2008, Genome research.

[86]  Raja Jothi,et al.  Genome-wide identification of in vivo protein–DNA binding sites from ChIP-Seq data , 2008, Nucleic acids research.

[87]  Z. Weng,et al.  High-Resolution Mapping and Characterization of Open Chromatin across the Genome , 2008, Cell.

[88]  Chee Seng Chan,et al.  CTCF-Mediated Functional Chromatin Interactome in Pluripotent Cells , 2011, Nature Genetics.

[89]  Ravi Sachidanandam,et al.  Genome wide ChIP-chip analyses reveal important roles for CTCF in Drosophila genome organization. , 2009, Developmental biology.

[90]  David Haussler,et al.  The UCSC Genome Browser database: update 2010 , 2009, Nucleic Acids Res..

[91]  V. Corces,et al.  Chromatin insulators and boundaries: effects on transcription and nuclear organization. , 2001, Annual review of genetics.

[92]  R. Ghirlando,et al.  Chromatin boundaries and chromatin domains. , 2004, Cold Spring Harbor symposia on quantitative biology.

[93]  William Stafford Noble,et al.  Quantifying similarity between motifs , 2007, Genome Biology.

[94]  Eric S. Lander,et al.  Genomic Maps and Comparative Analysis of Histone Modifications in Human and Mouse , 2005, Cell.

[95]  Victor V Lobanenkov,et al.  Thyroid hormone‐regulated enhancer blocking: cooperation of CTCF and thyroid hormone receptor , 2003, The EMBO journal.

[96]  Mikael Bodén,et al.  Assigning roles to DNA regulatory motifs using comparative genomics , 2010, Bioinform..

[97]  R. Renkawitz,et al.  Modular structure of a chicken lysozyme silencer: Involvement of an unusual thyroid hormone receptor binding site , 1990, Cell.

[98]  Michael Q. Zhang,et al.  Analysis of the Vertebrate Insulator Protein CTCF-Binding Sites in the Human Genome , 2007, Cell.

[99]  P. Neiman,et al.  An exceptionally conserved transcriptional repressor, CTCF, employs different combinations of zinc fingers to bind diverged promoter sequences of avian and mammalian c-myc oncogenes , 1996, Molecular and cellular biology.