MeCP2 is a microsatellite binding protein that protects CA repeats from nucleosome invasion

MeCP2 binds hydroxymethylated CA repeats Despites of decades of research on the Rett syndrome protein MeCP2, its function remains unclear. Ibrahim et al. show that MeCP2 is a hydroxymethylated cytosine-adenosine (CA) repeat-binding protein that modulates chromatin architecture at a distance from the transcription start site (see the Perspective by Zhou and Zoghbi). MeCP2 accumulates and spreads around modified CA repeats and competes for nucleosome occupancy. Loss of MeCP2 results in a widespread increase in nucleosome density inside lamina-associated domains and transcriptional dysregulation of genes enriched in CA repeats. These results shed light on the underlying molecular mechanism of Rett syndrome, a severe disease associated with mutations in MeCP2. Science, abd5581, this issue p. eabd5581; see also abj5027, p. 1390 The Rett syndrome protein MeCP2 is a DNA microsatellite CA repeat–binding protein that regulates chromatin architecture. INTRODUCTION Rett syndrome is a severe neurodevelopmental disorder that is mainly caused by mutations in the methyl-CpG-binding protein 2 gene (MeCP2). Initially, MeCP2 was identified as an essential brain protein that binds to methylated CpG (mCG) via its methyl-binding domain (MBD) and acts as transcriptional repressor. However, during early brain development, the postnatal accumulation of MeCP2 parallels the genome-wide high-level accumulation of hydroxymethylcytosine (hmC) and methylated CpA (mCA), suggesting that MeCP2 may also recognize and bind to DNA sequences that contain these modified nucleotides. The ability of MeCP2 to recognize both mCA and hmC as well as mCG has led to conflicting conclusions regarding its function in transcriptional regulation, because these cytosine modifications are associated with either repression (mCA and mCG) or activation (hmC) of transcription. The unambiguous identification of the MeCP2 target sequence(s) would help to clarify this issue. RATIONALE CA repeats (CAn) represent ~1% of the mouse genome and belong to the microsatellite family. They are widely distributed throughout the genome and have been shown to affect transcription of nearby genes. Our recent data reveal that CAn are methylated (mCAn) or hydroxymethylated (hmCAn) in various cell types. In a search for proteins that could specifically recognize and bind these CA repeats, we identified MeCP2 as a specific reader of CA repeats. We hypothesized that the methylation status of CAn is essential for the recognition and binding of MeCP2, possibly through recognition of the modified nucleotides in CA repeats with distinct affinities, relevant for its neuronal function in transcriptional regulation. RESULTS Here we show that within the MBD family, MeCP2 is the only protein that specifically recognizes and binds to CA repeats, with much stronger affinity than mCG and mCA. MeCP2 selectively recognizes CA repeat DNA in a strand-specific manner and requires at least five consecutive CA dinucleotides to optimally bind DNA. While MeCP2 can bind in vitro to modified and nonmodified CA repeats, it exhibits impressive selectivity toward hydroxymethylated CA repeats, which are modified by DNA (cytosine-5)-methyltransferase 3A. The modified cytosine, only when located within a CA repeat, serves as a nucleation point for both MeCP2 accumulation and spreading around the repeat, which, in turn, correlates with nucleosome exclusion. In addition, loss of MeCP2 results in widespread increase in nucleosome density within lamina-associated domains (LADs) and transcriptional dysregulation of CA repeat–enriched genes located outside LADs. We have also dissected the molecular basis of the MeCP2 hydroxymethylated CA repeat recognition by solving the crystal structure of MeCP2 in complex with hmCAn. The CA repeat creates a well-defined DNA shape, with a considerably modified geometry, including a widened major groove and negative roll parameters, located precisely at the modification site. We show that the molecular recognition of the hydroxymethylated CA repeat specifically occurs through Arg133, a key MeCP2 residue whose mutation causes Rett syndrome. CONCLUSION Our work identifies MeCP2 as a hydroxymethylated CA repeat DNA binding protein that targets the 5hmC-CA-rich sequence, which are specifically located on one strand. Our data provide insights into the origin of Rett syndrome at the molecular level and suggest that this neurodevelopmental disorder could be viewed as a chromatin disease, originating from the inability of mutant MeCP2 to bind and protect the CA repeats from nucleosome invasion. Our results open a previously unexplored area of research focused on understanding the role of specific protein binding to microsatellites and other repeats in neurological diseases of unknown etiology. The Rett syndrome protein MeCP2 is a microsatellite CA repeat binding protein regulating chromatin architecture. MeCP2 is a microsatellite binding protein that specifically recognizes hydroxymethylated CA repeats. Depletion of MeCP2 alters the chromatin organization of CA repeats and LADs and results in nucleosome accumulation on CA repeats and genome-wide transcriptional dysregulation. WT, wild-type; KO, knockout. The Rett syndrome protein MeCP2 was described as a methyl-CpG-binding protein, but its exact function remains unknown. Here we show that mouse MeCP2 is a microsatellite binding protein that specifically recognizes hydroxymethylated CA repeats. Depletion of MeCP2 alters chromatin organization of CA repeats and lamina-associated domains and results in nucleosome accumulation on CA repeats and genome-wide transcriptional dysregulation. The structure of MeCP2 in complex with a hydroxymethylated CA repeat reveals a characteristic DNA shape, with considerably modified geometry at the 5-hydroxymethylcytosine, which is recognized specifically by Arg133, a key residue whose mutation causes Rett syndrome. Our work identifies MeCP2 as a microsatellite DNA binding protein that targets the 5hmC-modified CA-rich strand and maintains genome regions nucleosome-free, suggesting a role for MeCP2 dysfunction in Rett syndrome.

[1]  Harrison W. Gabel,et al.  DNMT3A Haploinsufficiency Results in Behavioral Deficits and Global Epigenomic Dysregulation Shared across Neurodevelopmental Disorders , 2020, bioRxiv.

[2]  A. Bird,et al.  Neuronal non-CG methylation is an essential target for MeCP2 function , 2020, bioRxiv.

[3]  Zhaolan Zhou,et al.  Genomic insights into MeCP2 function: A role for the maintenance of chromatin architecture , 2019, Current Opinion in Neurobiology.

[4]  Melissa Gymrek,et al.  The impact of short tandem repeat variation on gene expression , 2019, Nature Genetics.

[5]  Ana M. M. Oliveira,et al.  MeCP2: A Critical Regulator of Chromatin in Neurodevelopment and Adult Brain Function , 2019, International journal of molecular sciences.

[6]  M. Gill,et al.  Transcriptomic Studies in Mouse Models of Rett Syndrome: A Review , 2019, Neuroscience.

[7]  B. van Steensel,et al.  Promoter-Intrinsic and Local Chromatin Features Determine Gene Repression in LADs , 2019, Cell.

[8]  Harrison W. Gabel,et al.  Early-Life Gene Expression in Neurons Modulates Lasting Epigenetic States , 2017, Cell.

[9]  N. Heintz,et al.  5-hydroxymethylcytosine accumulation in postmitotic neurons results in functional demethylation of expressed genes , 2017, Proceedings of the National Academy of Sciences.

[10]  Bas van Steensel,et al.  Lamina-Associated Domains: Links with Chromosome Architecture, Heterochromatin, and Gene Repression , 2017, Cell.

[11]  A. Bird,et al.  MeCP2 recognizes cytosine methylated tri-nucleotide and di-nucleotide sequences to tune transcription in the mammalian brain , 2017, PLoS genetics.

[12]  S. Gras,et al.  Combinatorial DNA methylation codes at repetitive elements. , 2017, Genome research.

[13]  M. Ozturk,et al.  Histone H3.3 regulates mitotic progression in mouse embryonic fibroblasts. , 2017, Biochemistry and cell biology = Biochimie et biologie cellulaire.

[14]  Harrison W. Gabel,et al.  DNA methylation in the gene body influences MeCP2-mediated gene repression , 2016, Proceedings of the National Academy of Sciences.

[15]  Fidel Ramírez,et al.  deepTools2: a next generation web server for deep-sequencing data analysis , 2016, Nucleic Acids Res..

[16]  D. Breitsprecher,et al.  An Automated Microscale Thermophoresis Screening Approach for Fragment-Based Lead Discovery , 2015, Journal of biomolecular screening.

[17]  Yaniv Erlich,et al.  Abundant contribution of short tandem repeats to gene expression variation in humans , 2015, Nature Genetics.

[18]  Harrison W. Gabel,et al.  Reading the unique DNA methylation landscape of the brain: Non-CpG methylation, hydroxymethylation, and MeCP2 , 2015, Proceedings of the National Academy of Sciences.

[19]  Harrison W. Gabel,et al.  Disruption of DNA methylation-dependent long gene repression in Rett syndrome , 2015, Nature.

[20]  A. Bird,et al.  Rett syndrome: a complex disorder with simple roots , 2015, Nature Reviews Genetics.

[21]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[22]  J. Greally,et al.  Unusual Characteristics of the DNA Binding Domain of Epigenetic Regulatory Protein MeCP2 Determine Its Binding Specificity , 2014, Biochemistry.

[23]  T. Mak,et al.  ANP32E is a histone chaperone that removes H2A.Z from chromatin , 2014, Nature.

[24]  Zhaolan Zhou,et al.  Loss of MeCP2 function is associated with distinct gene expression changes in the striatum , 2013, Neurobiology of Disease.

[25]  P. Tam,et al.  Chromatin context and ncRNA highlight targets of MeCP2 in brain , 2013, RNA biology.

[26]  Matthew D. Schultz,et al.  Global Epigenomic Reconfiguration During Mammalian Brain Development , 2013, Science.

[27]  A. Bird,et al.  Rett syndrome mutations abolish the interaction of MeCP2 with the NCoR/SMRT co-repressor , 2013, Nature Neuroscience.

[28]  Yi Zhang,et al.  Genome-wide Analysis Reveals TET- and TDG-Dependent 5-Methylcytosine Oxidation Dynamics , 2013, Cell.

[29]  F. Lienert,et al.  Methylation-Dependent and -Independent Genomic Targeting Principles of the MBD Protein Family , 2013, Cell.

[30]  A. H. Smits,et al.  Dynamic Readers for 5-(Hydroxy)Methylcytosine and Its Oxidized Derivatives , 2013, Cell.

[31]  Angela D. Wilkins,et al.  An AT-Hook Domain in MeCP2 Determines the Clinical Course of Rett Syndrome and Related Disorders , 2013, Cell.

[32]  N. Heintz,et al.  MeCP2 binds to 5hmc enriched within active genes and accessible chromatin in the nervous system , 2012, Cell.

[33]  Rodney C. Samaco,et al.  Crh and Oprm1 mediate anxiety-related behavior and social approach in a mouse model of MECP2 duplication syndrome , 2011, Nature Genetics.

[34]  Harrison W. Gabel,et al.  Genome-Wide Activity-Dependent MeCP2 Phosphorylation Regulates Nervous System Development and Function , 2011, Neuron.

[35]  Christophe Blanchet,et al.  CURVES+ web server for analyzing and visualizing the helical, backbone and groove parameters of nucleic acid structures , 2011, Nucleic Acids Res..

[36]  Marcel Martin Cutadapt removes adapter sequences from high-throughput sequencing reads , 2011 .

[37]  Felix Krueger,et al.  Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications , 2011, Bioinform..

[38]  Clemens Vonrhein,et al.  Data processing and analysis with the autoPROC toolbox , 2011, Acta crystallographica. Section D, Biological crystallography.

[39]  Tao Ye,et al.  seqMINER: an integrated ChIP-seq data interpretation platform , 2010, Nucleic acids research.

[40]  Fred H. Gage,et al.  A Model for Neural Development and Treatment of Rett Syndrome Using Human Induced Pluripotent Stem Cells , 2010, Cell.

[41]  Dieter Braun,et al.  Protein-binding assays in biological liquids using microscale thermophoresis. , 2010, Nature communications.

[42]  C. Glass,et al.  Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. , 2010, Molecular cell.

[43]  P. Flicek,et al.  Molecular maps of the reorganization of genome-nuclear lamina interactions during differentiation. , 2010, Molecular cell.

[44]  P. Emsley,et al.  Features and development of Coot , 2010, Acta crystallographica. Section D, Biological crystallography.

[45]  W. Huber,et al.  which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. MAnorm: a robust model for quantitative comparison of ChIP-Seq data sets , 2011 .

[46]  Aaron R. Quinlan,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2022 .

[47]  Lee E. Edsall,et al.  Human DNA methylomes at base resolution show widespread epigenomic differences , 2009, Nature.

[48]  Christina Thaller,et al.  Mouse models of MeCP2 disorders share gene expression changes in the cerebellum and hypothalamus , 2009, Human molecular genetics.

[49]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[50]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[51]  Stephen T. C. Wong,et al.  MeCP2, a Key Contributor to Neurological Disease, Activates and Represses Transcription , 2008, Science.

[52]  A. Bird,et al.  MeCP2 binding to DNA depends upon hydration at methyl-CpG. , 2008, Molecular cell.

[53]  Randy J. Read,et al.  Phaser crystallographic software , 2007, Journal of applied crystallography.

[54]  Valerie H Adams,et al.  Intrinsic Disorder and Autonomous Domain Function in the Multifunctional Nuclear Protein, MeCP2* , 2007, Journal of Biological Chemistry.

[55]  L. Pasquier,et al.  Spectrum and distribution of MECP2 mutations in 424 Rett syndrome patients: a molecular update. , 2006, European journal of medical genetics.

[56]  J. Jurka,et al.  Repbase Update, a database of eukaryotic repetitive elements , 2005, Cytogenetic and Genome Research.

[57]  Alok J. Saldanha,et al.  Java Treeview - extensible visualization of microarray data , 2004, Bioinform..

[58]  W. Olson,et al.  3DNA: a software package for the analysis, rebuilding and visualization of three-dimensional nucleic acid structures. , 2003, Nucleic acids research.

[59]  Albrecht Bindereif,et al.  Novel functional role of CA repeats and hnRNP L in RNA stability. , 2003, RNA.

[60]  Randy J Read,et al.  Electronic Reprint Biological Crystallography Phenix: Building New Software for Automated Crystallographic Structure Determination Biological Crystallography Phenix: Building New Software for Automated Crystallographic Structure Determination , 2022 .

[61]  W. Delano The PyMOL Molecular Graphics System , 2002 .

[62]  R. Jaenisch,et al.  Deficiency of methyl-CpG binding protein-2 in CNS neurons results in a Rett-like phenotype in mice , 2001, Nature Genetics.

[63]  N. Gabellini,et al.  A polymorphic GT repeat from the human cardiac Na+Ca2+ exchanger intron 2 activates splicing. , 2001, European journal of biochemistry.

[64]  A. Bird,et al.  A mouse Mecp2-null mutation causes neurological symptoms that mimic Rett syndrome , 2001, Nature Genetics.

[65]  A. Wolffe,et al.  Functional consequences of Rett syndrome mutations on human MeCP2. , 2000, Nucleic acids research.

[66]  E. Ballestar,et al.  Effects of Rett syndrome mutations of the methyl-CpG binding domain of the transcriptional repressor MeCP2 on selectivity for association with methylated DNA. , 2000, Biochemistry.

[67]  H. Zoghbi,et al.  Influence of mutation type and X chromosome inactivation on Rett syndrome phenotypes , 2000, Annals of neurology.

[68]  H. Zoghbi,et al.  Rett syndrome is caused by mutations in X-linked MECP2, encoding methyl-CpG-binding protein 2 , 1999, Nature Genetics.

[69]  P N Barlow,et al.  The solution structure of the domain from MeCP2 that binds to methylated DNA. , 1999, Journal of molecular biology.

[70]  K. Zänker,et al.  Modulation of Epidermal Growth Factor Receptor Gene Transcription by a Polymorphic Dinucleotide Repeat in Intron 1* , 1999, The Journal of Biological Chemistry.

[71]  A. Bird,et al.  Characterization of MeCP2, a vertebrate DNA binding protein with affinity for methylated DNA. , 1992, Nucleic acids research.

[72]  A. Bird,et al.  Purification, sequence, and cellular localization of a novel chromosomal protein that binds to Methylated DNA , 1992, Cell.

[73]  Y. Benjamini,et al.  More powerful procedures for multiple significance testing. , 1990, Statistics in medicine.

[74]  L. Naylor,et al.  d(TG)n.d(CA)n sequences upstream of the rat prolactin gene form Z-DNA and inhibit gene transcription. , 1990, Nucleic acids research.

[75]  Jean Aicardi,et al.  A progressive syndrome of autism, dementia, ataxia, and loss of purposeful hand use in girls: Rett's syndrome: Report of 35 cases , 1983, Annals of neurology.

[76]  A Rett,et al.  [On a unusual brain atrophy syndrome in hyperammonemia in childhood]. , 1966, Wiener medizinische Wochenschrift.