Regulation potential of transcribed simple repeated sequences in developing neurons

Simple repeated sequences (SRSs), defined as tandem iterations of microsatellite- to satellite-sized DNA units, occupy a substantial part of the human genome. Some of these elements are known to be transcribed in the context of repeat expansion disorders. Mounting evidence suggests that the transcription of SRSs may also contribute to normal cellular functions. Here, we used genome-wide bioinformatics approaches to systematically examine SRS transcriptional activity in cells undergoing neuronal differentiation. We identified thousands of long noncoding RNAs containing >200-nucleotide-long SRSs (SRS-lncRNAs), with hundreds of these transcripts significantly upregulated in the neural lineage. We show that SRS-lncRNAs often originate from telomere-proximal regions and that they have a strong potential to form multivalent contacts with a wide range of RNA-binding proteins. Our analyses also uncovered a cluster of neurally upregulated SRS-lncRNAs encoded in a centromere-proximal part of chromosome 9, which underwent an evolutionarily recent segmental duplication. Using a newly established in vitro system for rapid neuronal differentiation of induced pluripotent stem cells, we demonstrate that at least some of the bioinformatically predicted SRS-lncRNAs, including those encoded in the segmentally duplicated part of chromosome 9, indeed increase their expression in developing neurons to readily detectable levels. These data suggest that many SRSs may be expressed in a cell type and developmental stage-specific manner, providing a valuable resource for further studies focused on the functional consequences of SRS-lncRNAs in the normal development of the human brain.

[1]  Peter K. Todd,et al.  Native functions of short tandem repeats , 2023, eLife.

[2]  Howard Y. Chang,et al.  Long non-coding RNAs: definitions, functions, challenges and recommendations , 2023, Nature Reviews Molecular Cell Biology.

[3]  I. Feliciello,et al.  Satellite DNAs in Health and Disease , 2022, Genes.

[4]  Jiuhong Kang,et al.  Cpmer: A new conserved eEF1A2-binding partner that regulates Eomes translation and cardiomyocyte differentiation , 2022, Stem cell reports.

[5]  B. McStay The p-Arms of Human Acrocentric Chromosomes Play by a Different Set of Rules. , 2022, Annual review of genomics and human genetics.

[6]  K. Sobczak,et al.  Partners in crime: Proteins implicated in RNA repeat expansion diseases , 2022, Wiley interdisciplinary reviews. RNA.

[7]  N. Altemose,et al.  A classical revival: Human satellite DNAs enter the genomics era. , 2022, Seminars in cell & developmental biology.

[8]  E. Makeyev,et al.  Analysis of RNA-containing compartments by hybridization and proximity labeling in cultured human cells , 2022, STAR protocols.

[9]  Y. Nagai,et al.  The molecular pathogenesis of repeat expansion diseases. , 2021, Biochemical Society transactions.

[10]  E. Makeyev,et al.  Hybridization-proximity labeling reveals spatially ordered interactions of nuclear RNA compartments , 2021, Molecular cell.

[11]  G. Trigiante,et al.  Emerging Roles of Repetitive and Repeat-Containing RNA in Nuclear and Chromatin Organization and Gene Expression , 2021, Frontiers in Cell and Developmental Biology.

[12]  Sara B. Linker,et al.  The role of retrotransposable elements in ageing and age-associated diseases , 2021, Nature.

[13]  W. Hevers,et al.  NGN2 induces diverse neuron types from human pluripotency , 2021, Stem cell reports.

[14]  J. Mendell,et al.  NORAD-induced Pumilio phase separation is required for genome stability , 2021, Nature.

[15]  K. Kitagawa,et al.  The Role of Human Centromeric RNA in Chromosome Stability , 2021, Frontiers in Molecular Biosciences.

[16]  Yafei Yin,et al.  Localization of RNAs in the nucleus: cis- and trans- regulation , 2021, RNA biology.

[17]  S. Bione,et al.  TERRA transcription destabilizes telomere integrity to initiate break-induced replication in human ALT cells , 2021, Nature Communications.

[18]  F. Johnson,et al.  TERRA G-quadruplex RNA interaction with TRF2 GAR domain is required for telomere integrity , 2021, Scientific Reports.

[19]  Gene W. Yeo,et al.  Repeat RNA expansion disorders of the nervous system: post-transcriptional mechanisms and therapeutic strategies , 2020, Critical reviews in biochemistry and molecular biology.

[20]  C. Sim,et al.  Transcript Assembly and Quantification by RNA-Seq Reveals Significant Differences in Gene Expression and Genetic Variants in Mosquitoes of the Culex pipiens (Diptera: Culicidae) Complex , 2020, Journal of Medical Entomology.

[21]  Ling-Ling Chen,et al.  Mechanisms of Long Noncoding RNA Nuclear Retention. , 2020, Trends in biochemical sciences.

[22]  U. Ala Competing Endogenous RNAs, Non-Coding RNAs and Diseases: An Intertwined Story , 2020, Cells.

[23]  J. Déjardin,et al.  Telomeric Chromatin and TERRA. , 2020, Journal of molecular biology.

[24]  T. Hirose,et al.  Short Tandem Repeat-Enriched Architectural RNAs in Nuclear Bodies: Functions and Associated Diseases , 2020, Non-coding RNA.

[25]  C. Rougeulle,et al.  X chromosome inactivation in human development , 2020, Development.

[26]  Howard Y. Chang,et al.  Structural modularity of the XIST ribonucleoprotein complex , 2019, Nature Communications.

[27]  U. Schmitz,et al.  The changing paradigm of intron retention: regulation, ramifications and recipes , 2019, Nucleic acids research.

[28]  T. Natsume,et al.  Two distinct nuclear stress bodies containing different sets of RNA-binding proteins are formed with HSATIII architectural noncoding RNAs upon thermal stress exposure. , 2019, Biochemical and biophysical research communications.

[29]  M. Swanson,et al.  Short Tandem Repeat Expansions and RNA-Mediated Pathogenesis in Myotonic Dystrophy , 2019, International journal of molecular sciences.

[30]  J. Iwakiri,et al.  LncRNA-dependent nuclear stress bodies promote intron retention through SR protein phosphorylation , 2019, bioRxiv.

[31]  G. Tartaglia,et al.  An Integrative Study of Protein-RNA Condensates Identifies Scaffolding RNAs and Reveals Players in Fragile X-Associated Tremor/Ataxia Syndrome , 2018, Cell reports.

[32]  E. Makeyev,et al.  A Short Tandem Repeat-Enriched RNA Assembles a Nuclear Compartment to Control Alternative Splicing and Promote Cell Survival , 2018, Molecular cell.

[33]  Alexander F. Palazzo,et al.  Sequence Determinants for Nuclear Retention and Cytoplasmic Export of mRNAs and lncRNAs , 2018, Front. Genet..

[34]  A. Isaacs,et al.  C9orf72-mediated ALS and FTD: multiple pathways to disease , 2018, Nature Reviews Neurology.

[35]  Jacob C. Ulirsch,et al.  The NORAD lncRNA assembles a topoisomerase complex critical for genome stability , 2018, Nature.

[36]  Shannon M. McNulty,et al.  Alpha satellite DNA biology: finding function in the recesses of the genome , 2018, Chromosome Research.

[37]  Michael D. Blower,et al.  Centromere Biology: Transcription Goes on Stage , 2018, Molecular and Cellular Biology.

[38]  Michael S. Fernandopulle,et al.  Transcription Factor–Mediated Differentiation of Human iPSCs into Neurons , 2018, Current protocols in cell biology.

[39]  I. Grummt,et al.  Dynamic regulation of nucleolar architecture. , 2018, Current opinion in cell biology.

[40]  M. Swanson,et al.  Intron retention induced by microsatellite expansions as a disease biomarker , 2018, Proceedings of the National Academy of Sciences.

[41]  N. Brockdorff,et al.  hnRNPK Recruits PCGF3/5-PRC1 to the Xist RNA B-Repeat to Establish Polycomb-Mediated Chromosomal Silencing , 2017, Molecular cell.

[42]  J. Doudna,et al.  Widespread Translational Remodeling during Human Neuronal Differentiation. , 2017, Cell reports.

[43]  Jeannie T. Lee,et al.  Repeat E anchors Xist RNA to the inactive X chromosomal compartment through CDKN1A-interacting protein (CIZ1) , 2017, Proceedings of the National Academy of Sciences.

[44]  P. Gao,et al.  Non-coding RNAs participate in the regulatory network of CLDN4 via ceRNA mediated miRNA evasion , 2017, Nature Communications.

[45]  Shannon M. McNulty,et al.  Human Centromeres Produce Chromosome-Specific and Array-Specific Alpha Satellite Transcripts that Are Complexed with CENP-A and CENP-C. , 2017, Developmental cell.

[46]  Shannon M. McNulty,et al.  RNA-dependent stabilization of SUV39H1 at constitutive heterochromatin , 2017, eLife.

[47]  Kevin R. Parker,et al.  ciRS-7 exonic sequence is embedded in a long non-coding RNA locus , 2017, bioRxiv.

[48]  P. Jolivet,et al.  Telomere Length Determines TERRA and R-Loop Regulation through the Cell Cycle , 2017, Cell.

[49]  N. Brockdorff,et al.  PCGF3/5–PRC1 initiates Polycomb recruitment in X chromosome inactivation , 2017, Science.

[50]  Eric T. Wang,et al.  Myotonic dystrophy: disease repeat range, penetrance, age of onset, and relationship between repeat size and phenotypes. , 2017, Current opinion in genetics & development.

[51]  N. Brockdorff,et al.  The nuclear matrix protein CIZ1 facilitates localization of Xist RNA to the inactive X-chromosome territory , 2017, Genes & development.

[52]  W. Krzyzosiak,et al.  Structural Characteristics of Simple RNA Repeats Associated with Disease and their Deleterious Protein Interactions , 2017, Front. Cell. Neurosci..

[53]  R. McCarthy,et al.  LncPRESS1 Is a p53-Regulated LncRNA that Safeguards Pluripotency by Disrupting SIRT6-Mediated De-acetylation of Histone H3K56. , 2016, Molecular cell.

[54]  J. Lawrence,et al.  SAF-A Requirement in Anchoring XIST RNA to Chromatin Varies in Transformed and Primary Cells. , 2016, Developmental cell.

[55]  N. Brockdorff,et al.  Control of Chromosomal Localization of Xist by hnRNP U Family Molecules. , 2016, Developmental cell.

[56]  S. Krobitsch,et al.  The bromodomain protein BRD4 regulates splicing during heat shock , 2016, Nucleic acids research.

[57]  Jeffrey T Leek,et al.  Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown , 2016, Nature Protocols.

[58]  Michael D. Blower Centromeric Transcription Regulates Aurora-B Localization and Activation. , 2016, Cell reports.

[59]  Lior Pachter,et al.  Near-optimal probabilistic RNA-seq quantification , 2016, Nature Biotechnology.

[60]  Tsung-Cheng Chang,et al.  Noncoding RNA NORAD Regulates Genomic Stability by Sequestering PUMILIO Proteins , 2016, Cell.

[61]  S. Itzkovitz,et al.  A conserved abundant cytoplasmic long noncoding RNA modulates repression by Pumilio proteins in human cells , 2015, Nature Communications.

[62]  Sebastien M. Weyn-Vanhentenryck,et al.  MBNL Sequestration by Toxic RNAs and RNA Misprocessing in the Myotonic Dystrophy Brain. , 2015, Cell reports.

[63]  William Stafford Noble,et al.  The MEME Suite , 2015, Nucleic Acids Res..

[64]  D. Gallie Faculty Opinions recommendation of The Xist lncRNA interacts directly with SHARP to silence transcription through HDAC3. , 2015 .

[65]  Qiangfeng Cliff Zhang,et al.  Systematic Discovery of Xist RNA Binding Proteins , 2015, Cell.

[66]  G. Meola,et al.  Myotonic dystrophies: An update on clinical aspects, genetic, pathology, and molecular pathomechanisms. , 2015, Biochimica et biophysica acta.

[67]  Chengyu Liu,et al.  Transcription Activator-Like Effector Nuclease (TALEN)-Mediated CLYBL Targeting Enables Enhanced Transgene Expression and One-Step Generation of Dual Reporter Human Induced Pluripotent Stem Cell (iPSC) and Neural Stem Cell (NSC) Lines , 2015, PloS one.

[68]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[69]  J. Rougemont,et al.  Functional characterization of the TERRA transcriptome at damaged telomeres , 2014, Nature Communications.

[70]  Y. Dalal,et al.  A long non-coding RNA is required for targeting centromeric protein A to the human centromere , 2014, eLife.

[71]  T. Tani,et al.  Involvement of satellite I noncoding RNA in regulation of chromosome segregation , 2014, Genes to cells : devoted to molecular & cellular mechanisms.

[72]  Björn Usadel,et al.  Trimmomatic: a flexible trimmer for Illumina sequence data , 2014, Bioinform..

[73]  P. Chartrand,et al.  Telomeric noncoding RNA TERRA is induced by telomere shortening to nucleate telomerase molecules at short telomeres. , 2013, Molecular cell.

[74]  R. Nalavade,et al.  Mechanisms of RNA-induced toxicity in CAG repeat disorders , 2013, Cell Death and Disease.

[75]  Brendan J. Frey,et al.  A compendium of RNA-binding motifs for decoding gene regulation , 2013, Nature.

[76]  T. Südhof,et al.  Rapid Single-Step Induction of Functional Neurons from Human Pluripotent Stem Cells , 2013, Neuron.

[77]  Sebastian D. Mackowiak,et al.  Circular RNAs are a large class of animal RNAs with regulatory potency , 2013, Nature.

[78]  J. Kjems,et al.  Natural RNA circles function as efficient microRNA sponges , 2013, Nature.

[79]  P. Lieberman,et al.  Formation of telomeric repeat-containing RNA (TERRA) foci in highly proliferating mouse cerebellar neuronal progenitors and medulloblastoma , 2012, Journal of Cell Science.

[80]  A. Decottignies,et al.  Telomere length regulates TERRA levels through increased trimethylation of telomeric H3K9 and HP1α , 2012, Nature Structural &Molecular Biology.

[81]  Z. Q. Lim,et al.  Coordinated regulation of neuronal mRNA steady-state levels through developmentally controlled intron retention. , 2012, Genes & development.

[82]  A. Masuda,et al.  CUGBP1 and MBNL1 preferentially bind to 3′ UTRs and facilitate mRNA decay , 2012, Scientific Reports.

[83]  V. Rybin,et al.  The Xist RNA A-repeat comprises a novel AUCG tetraloop fold and a platform for multimerization. , 2011, RNA.

[84]  M. Kyba,et al.  Inducible Cassette Exchange: A Rapid and Efficient System Enabling Conditional Gene Expression in Embryonic Stem and Primary Cells , 2011, Stem cells.

[85]  Sandy Chang,et al.  TERRA and hnRNPA1 Orchestrate an RPA-to-POT1 Switch on Telomeric Single-Stranded DNA , 2010, Nature.

[86]  N. Brockdorff,et al.  The matrix protein hnRNP U is required for chromosomal localization of Xist RNA. , 2010, Developmental cell.

[87]  Cole Trapnell,et al.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. , 2010, Nature biotechnology.

[88]  S. Richard,et al.  Sam68 sequestration and partial loss of function are associated with splicing alterations in FXTAS patients , 2010, The EMBO journal.

[89]  H. Riethman,et al.  TERRA RNA binding to TRF2 facilitates heterochromatin formation and ORC recruitment at telomeres. , 2009, Molecular cell.

[90]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[91]  G. Biamonti,et al.  Cellular stress and RNA splicing. , 2009, Trends in biochemical sciences.

[92]  H. Kazazian,et al.  Retrotransposons Revisited: The Restraint and Rehabilitation of Parasites , 2008, Cell.

[93]  R. Hannan,et al.  Centromere RNA is a key component for the assembly of nucleoproteins at the nucleolus and centromere. , 2007, Genome research.

[94]  M. Nei,et al.  Concerted and birth-and-death evolution of multigene families. , 2005, Annual review of genetics.

[95]  M. Swanson,et al.  Myotonic dystrophy type 1 is associated with nuclear foci of mutant RNA, sequestration of muscleblind proteins and deregulated alternative splicing in neurons. , 2004, Human molecular genetics.

[96]  Caroline Jolly,et al.  A key role for stress-induced satellite III transcripts in the relocalization of splicing factors into nuclear stress granules , 2004, Journal of Cell Science.

[97]  M. Vigneron,et al.  Stress-induced transcription of satellite III repeats , 2004, The Journal of cell biology.

[98]  D. Haussler,et al.  Evolution's cauldron: Duplication, deletion, and rearrangement in the mouse and human genomes , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[99]  M. Adams,et al.  Recent Segmental Duplications in the Human Genome , 2002, Science.

[100]  S. Riva,et al.  Stress-induced nuclear bodies are sites of accumulation of pre-mRNA processing factors. , 2001, Molecular biology of the cell.

[101]  M. Swanson,et al.  Muscleblind localizes to nuclear foci of aberrant RNA in myotonic dystrophy types 1 and 2. , 2001, Human molecular genetics.

[102]  S. Naylor,et al.  Myotonic Dystrophy Type 2 Caused by a CCTG Expansion in Intron 1 of ZNF9 , 2001, Science.

[103]  M. Lynch,et al.  The evolutionary fate and consequences of duplicate genes. , 2000, Science.

[104]  B. Byrne,et al.  Recruitment of human muscleblind proteins to (CUG)n expansions associated with myotonic dystrophy , 2000, The EMBO journal.

[105]  J. Sulston,et al.  Genomic sequence and transcriptional profile of the boundary between pericentromeric satellites and genes on human chromosome arm 10q. , 2000, Human molecular genetics.

[106]  S. Riva,et al.  A novel hnRNP protein (HAP/SAF-B) enters a subset of hnRNP complexes and relocates in nuclear granules in response to heat shock. , 1999, Journal of cell science.

[107]  笹嶋 唯博,et al.  Nuclear Matrix Protein , 1997 .

[108]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[109]  E. Nanba,et al.  [Myotonic dystrophy]. , 2005, Nihon rinsho. Japanese journal of clinical medicine.

[110]  N. Archidiacono,et al.  Human paralogs of KIAA0187 were created through independent pericentromeric-directed and chromosome-specific duplication mechanisms. , 2002, Genome research.

[111]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.