Subtelomeric CTCF and cohesin binding site organization using improved subtelomere assemblies and a novel annotation pipeline

Mapping genome-wide data to human subtelomeres has been problematic due to the incomplete assembly and challenges of low-copy repetitive DNA elements. Here, we provide updated human subtelomere sequence assemblies that were extended by filling telomere-adjacent gaps using clone-based resources. A bioinformatic pipeline incorporating multiread mapping for annotation of the updated assemblies using short-read data sets was developed and implemented. Annotation of subtelomeric sequence features as well as mapping of CTCF and cohesin binding sites using ChIP-seq data sets from multiple human cell types confirmed that CTCF and cohesin bind within 3 kb of the start of terminal repeat tracts at many, but not all, subtelomeres. CTCF and cohesin co-occupancy were also enriched near internal telomere-like sequence (ITS) islands and the nonterminal boundaries of subtelomere repeat elements (SREs) in transformed lymphoblastoid cell lines (LCLs) and human embryonic stem cell (ES) lines, but were not significantly enriched in the primary fibroblast IMR90 cell line. Subtelomeric CTCF and cohesin sites predicted by ChIP-seq using our bioinformatics pipeline (but not predicted when only uniquely mapping reads were considered) were consistently validated by ChIP-qPCR. The colocalized CTCF and cohesin sites in SRE regions are candidates for mediating long-range chromatin interactions in the transcript-rich SRE region. A public browser for the integrated display of short-read sequence-based annotations relative to key subtelomere features such as the start of each terminal repeat tract, SRE identity and organization, and subtelomeric gene models was established.

[1]  G. Benson,et al.  Tandem repeats finder: a program to analyze DNA sequences. , 1999, Nucleic acids research.

[2]  J. Campisi,et al.  The senescence-associated secretory phenotype: the dark side of tumor suppression. , 2010, Annual review of pathology.

[3]  Heather C. Mefford,et al.  The complex structure and dynamic evolution of human subtelomeres , 2002, Nature Reviews Genetics.

[4]  B. Roe,et al.  Sequence comparison of human and yeast telomeres identifies structurally distinct subtelomeric domains. , 1997, Human molecular genetics.

[5]  H. Riethman,et al.  A role for CTCF and cohesin in subtelomere chromatin organization, TERRA transcription, and telomere end protection , 2012, The EMBO journal.

[6]  D. Dorsett Cohesin: genomic insights into controlling gene transcription and development. , 2011, Current opinion in genetics & development.

[7]  International Human Genome Sequencing Consortium Finishing the euchromatic sequence of the human genome , 2004 .

[8]  Li Hsu,et al.  Elevated Rates of Sister Chromatid Exchange at Chromosome Ends , 2007, PLoS genetics.

[9]  Harold Riethman,et al.  Human telomere structure and biology. , 2008, Annual review of genomics and human genetics.

[10]  E. Gilson,et al.  Identification of a perinuclear positioning element in human subtelomeres that requires A‐type lamins and CTCF , 2009, The EMBO journal.

[11]  E. Gilson,et al.  D4Z4 as a prototype of CTCF and lamins-dependent insulator in human cells , 2010, Nucleus.

[12]  P. Park,et al.  Design and analysis of ChIP-seq experiments for DNA-binding proteins , 2008, Nature Biotechnology.

[13]  Marc D. Perry,et al.  ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia , 2012, Genome research.

[14]  Vishwanath R Iyer,et al.  Genome-wide Studies of CCCTC-binding Factor (CTCF) and Cohesin Provide Insight into Chromatin Structure and Regulation* , 2012, The Journal of Biological Chemistry.

[15]  Colin N. Dewey,et al.  Discovering Transcription Factor Binding Sites in Highly Repetitive Regions of Genomes with Multi-Read Analysis of ChIP-Seq Data , 2011, PLoS Comput. Biol..

[16]  Matthew T. Maurano,et al.  Widespread plasticity in CTCF occupancy linked to DNA methylation , 2012, Genome research.

[17]  G. Thomas,et al.  Allele-specific relative telomere lengths are inherited , 2006, Human Genetics.

[18]  J. Kissil,et al.  Cohesins localize with CTCF at the KSHV latency control region and at cellular c‐myc and H19/Igf2 insulators , 2008, The EMBO journal.

[19]  H. Riethman,et al.  Mapping and initial analysis of human subtelomeric sequence assemblies. , 2003, Genome research.

[20]  Shan Jiang,et al.  Telomerase reactivation reverses tissue degeneration in aged telomerase deficient mice , 2010, Nature.

[21]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[22]  P. Lieberman,et al.  Telomere Repeat Binding Factors TRF1, TRF2, and hRAP1 Modulate Replication of Epstein-Barr Virus OriP , 2003, Journal of Virology.

[23]  Joshua M. Korn,et al.  Mapping and sequencing of structural variation from eight human genomes , 2008, Nature.

[24]  J. Lingner,et al.  Molecular Dissection of Telomeric Repeat-Containing RNA Biogenesis Unveils the Presence of Distinct and Multiple Regulatory Pathways , 2010, Molecular and Cellular Biology.

[25]  J. Hess,et al.  MLL Associates with Telomeres and Regulates Telomeric Repeat-Containing RNA Transcription , 2009, Molecular and Cellular Biology.

[26]  Michael Brudno,et al.  SHRiMP: Accurate Mapping of Short Color-space Reads , 2009, PLoS Comput. Biol..

[27]  A. Jeffreys,et al.  Mechanisms underlying telomere repeat turnover, revealed by hypervariable variant repeat distribution patterns in the human Xp/Yp telomere. , 1995, The EMBO journal.

[28]  D. Church,et al.  Spidey: a tool for mRNA-to-genomic alignments. , 2001, Genome research.

[29]  David J. Reiss,et al.  CTCF physically links cohesin to chromatin , 2008, Proceedings of the National Academy of Sciences.

[30]  M. Blasco,et al.  Cohesin‐SA1 deficiency drives aneuploidy and tumourigenesis in mice due to impaired replication of telomeres , 2012, The EMBO journal.

[31]  M. MacDonald,et al.  The telomeric 60 kb of chromosome arm 4p is homologous to telomeric regions on 13p, 15p, 21p, and 22p. , 1992, Genomics.

[32]  Barbara J. Trask,et al.  Human subtelomeres are hot spots of interchromosomal recombination and segmental duplication , 2005, Nature.

[33]  R. Shiekhattar,et al.  Cell cycle regulation of chromatin at an origin of DNA replication , 2005, The EMBO journal.

[34]  V. Corces,et al.  CTCF: Master Weaver of the Genome , 2009, Cell.

[35]  David Kipling,et al.  Structural stability and chromosome-specific telomere length is governed by cis-acting determinants in humans. , 2006, Human molecular genetics.

[36]  Jianrong Wang,et al.  A Gibbs sampling strategy applied to the mapping of ambiguous short-sequence tags , 2010, Bioinform..

[37]  B. Trask,et al.  Transcriptional activity of multiple copies of a subtelomerically located olfactory receptor gene that is polymorphic in number and location. , 2001, Human molecular genetics.

[38]  Ronald A. DePinho,et al.  Linking functional decline of telomeres, mitochondria and stem cells during ageing , 2010, Nature.

[39]  Gautier Koscielny,et al.  Ensembl 2012 , 2011, Nucleic Acids Res..

[40]  Victor G Corces,et al.  Chromatin insulators: regulatory mechanisms and epigenetic inheritance. , 2008, Molecular cell.

[41]  J. Shay,et al.  Does a sentinel or a subset of short telomeres determine replicative senescence? , 2004, Molecular biology of the cell.

[42]  Stephan Sauer,et al.  Cohesins Functionally Associate with CTCF on Mammalian Chromosome Arms , 2008, Cell.

[43]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[44]  R. Chawla,et al.  CpG-island promoters drive transcription of human telomeres. , 2009, RNA.

[45]  Tatiana A. Tatusova,et al.  NCBI Reference Sequences (RefSeq): current status, new features and genome annotation policy , 2011, Nucleic Acids Res..

[46]  C. Azzalin,et al.  Telomeric Repeat–Containing RNA and RNA Surveillance Factors at Mammalian Chromosome Ends , 2007, Science.

[47]  T. Hirano At the heart of the chromosome: SMC proteins in action , 2006, Nature Reviews Molecular Cell Biology.

[48]  E. Eichler,et al.  A Human Genome Structural Variation Sequencing Resource Reveals Insights into Mutational Mechanisms , 2010, Cell.

[49]  M. Blasco,et al.  Developmentally regulated transcription of mammalian telomeres by DNA-dependent RNA polymerase II , 2008, Nature Cell Biology.

[50]  R. Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[51]  A. Decottignies,et al.  Telomere length regulates TERRA levels through increased trimethylation of telomeric H3K9 and HP1α , 2012, Nature Structural &Molecular Biology.

[52]  E. Gilson,et al.  The D4Z4 Macrosatellite Repeat Acts as a CTCF and A-Type Lamins-Dependent Insulator in Facio-Scapulo-Humeral Dystrophy , 2009, PLoS genetics.

[53]  E. Gilson,et al.  The human TTAGGG repeat factors 1 and 2 bind to a subset of interstitial telomeric sequences and satellite repeats , 2011, Cell Research.

[54]  B. Trask,et al.  Human Subtelomeric WASH Genes Encode a New Subclass of the WASP Family , 2007, PLoS genetics.

[55]  H. Riethman,et al.  TERRA RNA binding to TRF2 facilitates heterochromatin formation and ORC recruitment at telomeres. , 2009, Molecular cell.

[56]  K. Nasmyth,et al.  The structure and function of SMC and kleisin complexes. , 2005, Annual review of biochemistry.

[57]  W. Vach,et al.  The pattern of chromosome-specific variations in telomere length in humans is determined by inherited, telomere-near factors and is maintained throughout life , 2003, Mechanisms of Ageing and Development.

[58]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[59]  R. Chen,et al.  Human telomeric proteins occupy selective interstitial sites , 2011, Cell Research.

[60]  H. Riethman Human subtelomeric copy number variations , 2009, Cytogenetic and Genome Research.

[61]  Victor V Lobanenkov,et al.  Does CTCF mediate between nuclear organization and gene expression? , 2010, BioEssays : news and reviews in molecular, cellular and developmental biology.

[62]  Susan Smith,et al.  Differential regulation of telomere and centromere cohesion by the Scc3 homologues SA1 and SA2, respectively, in human cells , 2009, The Journal of cell biology.

[63]  R. Davuluri,et al.  Identification of Host-Chromosome Binding Sites and Candidate Gene Targets for Kaposi's Sarcoma-Associated Herpesvirus LANA , 2012, Journal of Virology.

[64]  G. Thomas,et al.  Segmental polymorphisms in the proterminal regions of a subset of human chromosomes. , 2002, Genome research.

[65]  H. Aburatani,et al.  Cohesin mediates transcriptional insulation by CCCTC-binding factor , 2008, Nature.

[66]  Y. Segev,et al.  Hypomethylation of subtelomeric regions in ICF syndrome is associated with abnormally short telomeres and enhanced transcription from telomeric regions. , 2008, Human molecular genetics.

[67]  David A. Orlando,et al.  Mediator and Cohesin Connect Gene Expression and Chromatin Architecture , 2010, Nature.

[68]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[69]  Wilhelm Palm,et al.  How shelterin protects mammalian telomeres. , 2008, Annual review of genetics.

[70]  D. Odom,et al.  CTCF and Cohesin: Linking Gene Regulatory Elements with Their Targets , 2013, Cell.

[71]  Nigel Carter,et al.  Spreading of mammalian DNA‐damage response factors studied by ChIP‐chip at damaged telomeres , 2007, The EMBO journal.

[72]  Raymond K. Auerbach,et al.  A User's Guide to the Encyclopedia of DNA Elements (ENCODE) , 2011, PLoS biology.

[73]  Susan Smith,et al.  SA1 binds directly to DNA through its unique AT-hook to promote sister chromatid cohesion at telomeres , 2013, Journal of Cell Science.

[74]  Weihua Zeng,et al.  Cohesin: a critical chromatin organizer in mammalian gene regulation. , 2011, Biochemistry and cell biology = Biochimie et biologie cellulaire.

[75]  Aaron R. Quinlan,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2022 .

[76]  P. de Knijff,et al.  Specific sequence variations within the 4q35 region are associated with facioscapulohumeral muscular dystrophy. , 2007, American journal of human genetics.

[77]  Judith Campisi,et al.  Senescent cells as a source of inflammatory factors for tumor progression , 2010, Cancer and Metastasis Reviews.

[78]  H. Riethman,et al.  Human subtelomeric duplicon structure and organization , 2007, Genome Biology.