R-loop formation is a distinctive characteristic of unmethylated human CpG island promoters.

CpG islands (CGIs) function as promoters for approximately 60% of human genes. Most of these elements remain protected from CpG methylation, a prevalent epigenetic modification associated with transcriptional silencing. Here, we report that methylation-resistant CGI promoters are characterized by significant strand asymmetry in the distribution of guanines and cytosines (GC skew) immediately downstream from their transcription start sites. Using innovative genomics methodologies, we show that transcription through regions of GC skew leads to the formation of long R loop structures. Furthermore, we show that GC skew and R loop formation potential is correlated with and predictive of the unmethylated state of CGIs. Finally, we provide evidence that R loop formation protects from DNMT3B1, the primary de novo DNA methyltransferase in early development. Altogether, these results suggest that protection from DNA methylation is a built-in characteristic of the DNA sequence of CGI promoters that is revealed by the cotranscriptional formation of R loop structures.

[1]  M. Lieber,et al.  Downstream boundary of chromosomal R-loops at murine switch regions: implications for the mechanism of class switch recombination. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Laurent Duret,et al.  Genome-wide studies highlight indirect links between human replication origins and gene regulation , 2008, Proceedings of the National Academy of Sciences.

[3]  M. Lieber,et al.  The DNA methyltransferase-like protein DNMT3L stimulates de novo methylation by Dnmt3a , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[4]  C. Walsh,et al.  Cytosine methylation and the ecology of intragenomic parasites. , 1997, Trends in genetics : TIG.

[5]  M. Lieber,et al.  R-loops at immunoglobulin class switch regions in the chromosomes of stimulated B cells , 2003, Nature Immunology.

[6]  S. Boguslawski,et al.  Characterization of monoclonal antibody to DNA.RNA and its application to immunodetection of hybrids. , 1986, Journal of immunological methods.

[7]  Ramón Díaz-Uriarte,et al.  Transcription Initiation Activity Sets Replication Origin Efficiency in Mammalian Cells , 2009, PLoS genetics.

[8]  D. Tollervey,et al.  Loss of Topoisomerase I leads to R-loop-mediated transcriptional blocks during ribosomal RNA synthesis. , 2010, Genes & development.

[9]  T. Brown,et al.  Native R-loops Persist throughout the Mouse Mitochondrial DNA Genome , 2008, Journal of Biological Chemistry.

[10]  D. Crothers,et al.  Stability and properties of double and triple helices: dramatic effects of RNA or DNA backbone composition. , 1992, Science.

[11]  A. Bird DNA methylation patterns and epigenetic memory. , 2002, Genes & development.

[12]  P. Green,et al.  Transcription-associated mutational asymmetry in mammalian evolution , 2003, Nature Genetics.

[13]  S. Crooke,et al.  Investigating the Structure of Human RNase H1 by Site-directed Mutagenesis* , 2001, The Journal of Biological Chemistry.

[14]  B. Thiers Induction of Pluripotent Stem Cells from Adult Human Fibroblasts by Defined Factors , 2008 .

[15]  Thomas Lengauer,et al.  CpG Island Methylation in Human Lymphocytes Is Highly Correlated with DNA Sequence, Repeats, and Predicted DNA Structure , 2006, PLoS genetics.

[16]  W D Wilson,et al.  Sequence specific thermodynamic and structural properties for DNA.RNA duplexes. , 1994, Biochemistry.

[17]  M. Goodman,et al.  Processive AID-catalysed cytosine deamination on single-stranded DNA simulates somatic hypermutation , 2003, Nature.

[18]  Eva K. Lee,et al.  Predicting aberrant CpG island methylation , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Francisco Antequera,et al.  Initiation of DNA replication at CpG islands in mammalian chromosomes , 1998, The EMBO journal.

[20]  M. Lieber,et al.  Conformational Variants of Duplex DNA Correlated with Cytosine-rich Chromosomal Fragile Sites* , 2009, Journal of Biological Chemistry.

[21]  M. Lieber,et al.  Fine-Structure Analysis of Activation-Induced Deaminase Accessibility to Class Switch Region R-Loops , 2005, Molecular and Cellular Biology.

[22]  S Nicolay,et al.  Transcription‐coupled TA and GC strand asymmetries in the human genome , 2003, FEBS letters.

[23]  Gene W. Yeo,et al.  Divergent Transcription from Active Promoters , 2008, Science.

[24]  Peter A. Jones,et al.  The fundamental role of epigenetic events in cancer , 2002, Nature Reviews Genetics.

[25]  P. Polak,et al.  Transcription induces strand-specific mutations at the 5' end of human genes. , 2008, Genome research.

[26]  T. Boon,et al.  Promoter-Dependent Mechanism Leading to Selective Hypomethylation within the 5′ Region of Gene MAGE-A1 in Tumor Cells , 2004, Molecular and Cellular Biology.

[27]  Yves Moreau,et al.  Comprehensive analysis of the base composition around the transcription start site in Metazoa , 2004, BMC Genomics.

[28]  Dustin E. Schones,et al.  High-Resolution Profiling of Histone Methylations in the Human Genome , 2007, Cell.

[29]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[30]  J. Griffith,et al.  The presence of RNA in a double helix inhibits its interaction with histone protein. , 1980, Nucleic acids research.

[31]  Lee E. Edsall,et al.  Human DNA methylomes at base resolution show widespread epigenomic differences , 2009, Nature.

[32]  P. Molloy,et al.  Recombinant mammalian DNA methyltransferase activity on model transcriptional gene silencing short RNA-DNA heteroduplex substrates. , 2010, The Biochemical journal.

[33]  Xiaodong Wu,et al.  Treatment of Early Non-Small Cell Lung Cancer, Stage IA, by Image-Guided Robotic Stereotactic Radioablation—CyberKnife , 2007, Cancer journal.

[34]  Jeannie T. Lee,et al.  X chromosome dosage compensation: how mammals keep the balance. , 2008, Annual review of genetics.

[35]  Michael Weber,et al.  Dynamic regulation of DNA methylation during mammalian development. , 2009, Epigenomics.

[36]  M. Pellegrini,et al.  Genome-wide erasure of DNA methylation in mouse primordial germ cells is affected by AID deficiency , 2010, Nature.

[37]  W. Lam,et al.  Chromosome-wide and promoter-specific analyses identify sites of differential DNA methylation in normal and transformed human cells , 2005, Nature Genetics.

[38]  J. Manley,et al.  Cotranscriptional processes and their influence on genome stability. , 2006, Genes & development.

[39]  A. Aguilera,et al.  Impairment of transcription elongation by R-loops in vitro. , 2007, Biochemical and biophysical research communications.

[40]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[41]  Chia-Lin Wei,et al.  Dynamic changes in the human methylome during differentiation. , 2010, Genome research.

[42]  F. Alt,et al.  Transcription-targeted DNA deamination by the AID antibody diversification enzyme , 2003, Nature.

[43]  R. Crouch,et al.  Failure to produce mitochondrial DNA results in embryonic lethality in Rnaseh1 null mice. , 2003, Molecular cell.

[44]  Israel Steinfeld,et al.  Developmental programming of CpG island methylation profiles in the human genome , 2009, Nature Structural &Molecular Biology.

[45]  Rosa Luna,et al.  Genome Instability and Transcription Elongation Impairment in Human Cells Depleted of THO/TREX , 2011, PLoS genetics.

[46]  Leighton J. Core,et al.  Nascent RNA Sequencing Reveals Widespread Pausing and Divergent Initiation at Human Promoters , 2008, Science.

[47]  M. Pfaffl,et al.  A new mathematical model for relative quantification in real-time RT-PCR. , 2001, Nucleic acids research.

[48]  F. Alt,et al.  Transcription-induced Cleavage of Immunoglobulin Switch Regions by Nucleotide Excision Repair Nucleases in Vitro* , 2000, The Journal of Biological Chemistry.

[49]  M. Nussenzweig,et al.  Deep-sequencing identification of the genomic targets of the cytidine deaminase AID and its cofactor RPA in B lymphocytes , 2011, Nature Immunology.

[50]  C. Allis,et al.  DNMT3L connects unmethylated lysine 4 of histone H3 to de novo methylation of DNA , 2007, Nature.

[51]  A. Bird,et al.  Sp1 sites in the mouse aprt gene promoter are required to prevent methylation of the CpG island. , 1994, Genes & development.

[52]  Robert S Illingworth,et al.  CpG islands – ‘A rough guide’ , 2009, FEBS letters.

[53]  Adrian Bird,et al.  Alternative chromatin structure at CpG islands , 1990, Cell.

[54]  G. Felsenfeld,et al.  Silencing of transgene transcription precedes methylation of promoter DNA and histone H3 lysine 9 , 2004, The EMBO journal.

[55]  J. Walter,et al.  Maternal methylation imprints on human chromosome 15 are established during or after fertilization , 2001, Nature Genetics.

[56]  M. Frommer,et al.  CpG islands in vertebrate genomes. , 1987, Journal of molecular biology.

[57]  F. Pauler,et al.  Silencing and transcriptional properties of the imprinted Airn ncRNA are independent of the endogenous promoter , 2008 .

[58]  B. Gómez-González,et al.  Genome instability: a mechanistic view of its causes and consequences , 2008, Nature Reviews Genetics.

[59]  T. Ushijima,et al.  The presence of RNA polymerase II, active or stalled, predicts epigenetic fate of promoter CpG islands. , 2009, Genome research.

[60]  M. Lieber,et al.  Mechanism of R-Loop Formation at Immunoglobulin Class Switch Sequences , 2007, Molecular and Cellular Biology.

[61]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[62]  J. Herman,et al.  Cancer as a manifestation of aberrant chromatin structure. , 2007, Cancer journal.

[63]  J. Herman,et al.  Histone modifications and silencing prior to DNA methylation of a tumor suppressor gene. , 2003, Cancer cell.

[64]  Robert S. Illingworth,et al.  CpG islands influence chromatin structure via the CpG-binding protein Cfp1 , 2010, Nature.

[65]  M. Lieber,et al.  G Clustering Is Important for the Initiation of Transcription-Induced R-Loops In Vitro, whereas High G Density without Clustering Is Sufficient Thereafter , 2009, Molecular and Cellular Biology.

[66]  F. Pauler,et al.  An in vitro ES cell imprinting model shows that imprinted expression of the Igf2r gene arises from an allele-specific expression bias , 2009, Development.

[67]  R. Wollman,et al.  A genome-wide siRNA screen reveals diverse cellular processes and pathways that mediate genome stability. , 2009, Molecular cell.

[68]  E. Canaani,et al.  A Motif within SET-Domain Proteins Binds Single-Stranded Nucleic Acids and Transcribed and Supercoiled DNAs and Can Interfere with Assembly of Nucleosomes , 2005, Molecular and Cellular Biology.

[69]  Aixia Zhang,et al.  An antibody-based microarray assay for small RNA detection , 2006 .