Disease-Associated Short Tandem Repeats Co-localize with Chromatin Domain Boundaries

More than 25 inherited human disorders are caused by the unstable expansion of repetitive DNA sequences termed short tandem repeats (STRs). A fundamental unresolved question is why some STRs are susceptible to pathologic expansion, whereas thousands of repeat tracts across the human genome are relatively stable. Here, we discover that nearly all disease-associated STRs (daSTRs) are located at boundaries demarcating 3D chromatin domains. We identify a subset of boundaries with markedly higher CpG island density compared to the rest of the genome. daSTRs specifically localize to ultra-high-density CpG island boundaries, suggesting they might be hotspots for epigenetic misregulation or topological disruption linked to STR expansion. Fragile X syndrome patients exhibit severe boundary disruption in a manner that correlates with local loss of CTCF occupancy and the degree of FMR1 silencing. Our data uncover higher-order chromatin architecture as a new dimension in understanding repeat expansion disorders.

[1]  C. Nusbaum,et al.  Chromosome Conformation Capture Carbon Copy (5C): a massively parallel solution for mapping interactions between genomic elements. , 2006, Genome research.

[2]  Daniel S. Day,et al.  Activation of proto-oncogenes by disruption of chromosome neighborhoods , 2015, Science.

[3]  Harry T Orr,et al.  Trinucleotide repeat disorders. , 2007, Annual review of neuroscience.

[4]  Thomas G. Gilgenast,et al.  Local Genome Topology Can Exhibit an Incompletely Rewired 3D-Folding State during Somatic Cell Reprogramming. , 2016, Cell stem cell.

[5]  S. Tapscott,et al.  CTCF-binding sites flank CTG/CAG repeats and form a methylation-sensitive insulator at the DM1 locus , 2001, Nature Genetics.

[6]  Jing He,et al.  Allelic reprogramming of 3D chromatin architecture during early mammalian development , 2017, Nature.

[7]  S. Sunkin,et al.  CTCF Regulates Ataxin-7 Expression through Promotion of a Convergently Transcribed, Antisense Noncoding RNA , 2011, Neuron.

[8]  L. Ettwiller,et al.  Functional and topological characteristics of mammalian regulatory domains , 2014, Genome research.

[9]  Lee E. Edsall,et al.  Distinct epigenomic landscapes of pluripotent and lineage-committed human cells. , 2010, Cell stem cell.

[10]  Melissa J. Landrum,et al.  RefSeq: an update on mammalian reference sequences , 2013, Nucleic Acids Res..

[11]  O. Yanuka,et al.  Reversion of FMR1 Methylation and Silencing by Editing the Triplet Repeats in Fragile X iPSC-Derived Neurons. , 2015, Cell reports.

[12]  Jane S. Paulsen,et al.  Identification of Genetic Factors that Modify Clinical Onset of Huntington’s Disease , 2015, Cell.

[13]  R. Wells,et al.  DNA triplet repeat expansion and mismatch repair. , 2015, Annual review of biochemistry.

[14]  Jennifer E. Phillips-Cremins,et al.  Architectural Protein Subclasses Shape 3D Organization of Genomes during Lineage Commitment , 2013, Cell.

[15]  W. Vogel,et al.  The DMPK gene of severely affected myotonic dystrophy patients is hypermethylated proximal to the largely expanded CTG repeat. , 1998, American journal of human genetics.

[16]  C. Richards,et al.  Somatic heterogeneity of the CTG repeat in myotonic dystrophy is age and size dependent. , 1995, American journal of human genetics.

[17]  C. E. Pearson Slipping while sleeping? Trinucleotide repeat expansions in germ cells. , 2003, Trends in molecular medicine.

[18]  A. Visel,et al.  Disruptions of Topological Chromatin Domains Cause Pathogenic Rewiring of Gene-Enhancer Interactions , 2015, Cell.

[19]  K. Robertson DNA methylation and human disease , 2005, Nature Reviews Genetics.

[20]  Mikael Bodén,et al.  MEME Suite: tools for motif discovery and searching , 2009, Nucleic Acids Res..

[21]  S. Mirkin Expandable DNA repeats and human disease , 2007, Nature.

[22]  H. Hameister,et al.  Demethylation, reactivation, and destabilization of human fragile X full-mutation alleles in mouse embryocarcinoma cells. , 2001, American journal of human genetics.

[23]  Danielle S. Bassett,et al.  Detecting hierarchical 3-D genome domain reconfiguration with network modularity , 2016, bioRxiv.

[24]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[25]  Yaniv Erlich,et al.  Genome-wide profiling of heritable and de novo STR variations , 2016, Nature Methods.

[26]  Terrence S. Furey,et al.  The UCSC Table Browser data retrieval tool , 2004, Nucleic Acids Res..

[27]  S. Warren,et al.  Characterization of the full fragile X syndrome mutation in fetal gametes , 1997, Nature Genetics.

[28]  Daning Lu,et al.  Chromosome conformation elucidates regulatory relationships in developing human brain , 2016, Nature.

[29]  Jesse R. Dixon,et al.  Topological Domains in Mammalian Genomes Identified by Analysis of Chromatin Interactions , 2012, Nature.

[30]  Yaniv Erlich,et al.  Profiling short tandem repeats from short reads. , 2013, Methods in molecular biology.

[31]  Peter A. Jones,et al.  Cancer genetics and epigenetics: two sides of the same coin? , 2012, Cancer cell.

[32]  David Haussler,et al.  ENCODE Data in the UCSC Genome Browser: year 5 update , 2012, Nucleic Acids Res..

[33]  J. Sedat,et al.  Spatial partitioning of the regulatory landscape of the X-inactivation centre , 2012, Nature.

[34]  Jian Yu,et al.  Studying human mutations by sperm typing: instability of CAG trinucleotide repeats in the human androgen receptor gene , 1994, Nature Genetics.

[35]  Jennifer E. Phillips-Cremins,et al.  YY1 and CTCF orchestrate a 3D chromatin looping switch during early neural lineage commitment , 2017, Genome research.

[36]  J. Taylor,et al.  Repeat expansion disease: progress and puzzles in disease pathogenesis , 2010, Nature Reviews Genetics.

[37]  W. Vogel,et al.  Unusual mutations in high functioning fragile X males: apparent instability of expanded unmethylated CGG repeats. , 1998, Journal of medical genetics.

[38]  S. Tavaré,et al.  Single sperm analysis of the trinucleotide repeats in the Huntington's disease gene: quantification of the mutation frequency spectrum. , 1995, Human molecular genetics.

[39]  Terence P. Speed,et al.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias , 2003, Bioinform..

[40]  Shawn M. Gillespie,et al.  Insulator dysfunction and oncogene activation in IDH mutant gliomas , 2015, Nature.

[41]  Stephen J. Tapscott,et al.  CTCF cis-Regulates Trinucleotide Repeat Instability in an Epigenetic Manner: A Novel Basis for Mutational Hot Spot Determination , 2008, PLoS genetics.

[42]  Danielle S Bassett,et al.  Detecting hierarchical genome folding with network modularity , 2018, Nature Methods.

[43]  A. Quinlan BEDTools: The Swiss‐Army Tool for Genome Feature Analysis , 2014, Current protocols in bioinformatics.

[44]  M. Hayden,et al.  Somatic and gonadal mosaicism of the Huntington disease gene CAG repeat in brain and sperm , 1994, Nature Genetics.

[45]  C. McMurray Mechanisms of trinucleotide repeat instability during human development , 2010, Nature Reviews Genetics.

[46]  D. Monckton,et al.  Cis-acting modifiers of expanded CAG/CTG triplet repeat expandability: associations with flanking GC content and proximity to CpG islands. , 1999, Human molecular genetics.

[47]  L. Mirny,et al.  Iterative Correction of Hi-C Data Reveals Hallmarks of Chromosome Organization , 2012, Nature Methods.

[48]  Anthony D. Schmitt,et al.  A Compendium of Chromatin Contact Maps Reveals Spatially Active Regions in the Human Genome. , 2016, Cell reports.

[49]  D. Chitayat,et al.  Tissue- and age-specific DNA replication patterns at the CTG/CAG-expanded human myotonic dystrophy type 1 locus , 2010, Nature Structural &Molecular Biology.

[50]  David L. Nelson,et al.  The Unstable Repeats—Three Evolving Faces of Neurological Disease , 2013, Neuron.

[51]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[52]  J. Dekker,et al.  Mapping networks of physical interactions between genomic elements using 5C technology , 2007, Nature Protocols.

[53]  Fidel Ramírez,et al.  deepTools2: a next generation web server for deep-sequencing data analysis , 2016, Nucleic Acids Res..

[54]  Jing Liang,et al.  Chromatin architecture reorganization during stem cell differentiation , 2015, Nature.

[55]  R. Young,et al.  Histone H3K27ac separates active from poised enhancers and predicts developmental state , 2010, Proceedings of the National Academy of Sciences.

[56]  Neva C. Durand,et al.  A 3D Map of the Human Genome at Kilobase Resolution Reveals Principles of Chromatin Looping , 2014, Cell.

[57]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[58]  Jill M Dowen,et al.  Control of Cell Identity Genes Occurs in Insulated Neighborhoods in Mammalian Chromosomes , 2014, Cell.

[59]  C. E. Pearson,et al.  Repeat instability: mechanisms of dynamic mutations , 2005, Nature Reviews Genetics.

[60]  Pedro P. Rocha,et al.  CTCF establishes discrete functional chromatin domains at the Hox clusters during differentiation , 2015, Science.

[61]  Yaniv Erlich,et al.  Abundant contribution of short tandem repeats to gene expression variation in humans , 2015, Nature Genetics.

[62]  Peter A. Jones,et al.  Epigenetics in cancer. , 2010, Carcinogenesis.

[63]  G. Felsenfeld,et al.  Critical DNA Binding Interactions of the Insulator Protein CTCF , 2007, Journal of Biological Chemistry.

[64]  B. Popovich,et al.  Hypomethylation of an expanded FMR1 allele is not associated with a global DNA methylation defect. , 1999, American journal of human genetics.

[65]  W. Vogel,et al.  Mitotic stability of fragile X mutations in differentiated cells indicates early post–conceptional trinucleotide repeat expansion , 1993, Nature Genetics.

[66]  J. Sutcliffe,et al.  Identification of a gene (FMR-1) containing a CGG repeat coincident with a breakpoint cluster region exhibiting length variation in fragile X syndrome , 1991, Cell.