RNA structure inference through chemical mapping after accidental or intentional mutations

Significance The intricate structures of RNA molecules are crucial to their biological functions but have been difficult to accurately characterize. Multidimensional chemical-mapping methods improve accuracy but have so far involved painstaking experiments and reliance on secondary-structure prediction software. A methodology called M2-seq now lifts these limitations. Mechanistic studies clarify the origin of serendipitous M2-seq–like signals that were recently discovered but not correctly explained and also provide mutational strategies that enable robust M2-seq for new RNA transcripts. The method detects dozens of Watson–Crick helices across diverse RNA folds in vitro and within frog egg extract, with a low false-positive rate (<5%). M2-seq opens a route to unbiased discovery of RNA structures in vitro and beyond. Despite the critical roles RNA structures play in regulating gene expression, sequencing-based methods for experimentally determining RNA base pairs have remained inaccurate. Here, we describe a multidimensional chemical-mapping method called “mutate-and-map read out through next-generation sequencing” (M2-seq) that takes advantage of sparsely mutated nucleotides to induce structural perturbations at partner nucleotides and then detects these events through dimethyl sulfate (DMS) probing and mutational profiling. In special cases, fortuitous errors introduced during DNA template preparation and RNA transcription are sufficient to give M2-seq helix signatures; these signals were previously overlooked or mistaken for correlated double-DMS events. When mutations are enhanced through error-prone PCR, in vitro M2-seq experimentally resolves 33 of 68 helices in diverse structured RNAs including ribozyme domains, riboswitch aptamers, and viral RNA domains with a single false positive. These inferences do not require energy minimization algorithms and can be made by either direct visual inspection or by a neural-network–inspired algorithm called M2-net. Measurements on the P4–P6 domain of the Tetrahymena group I ribozyme embedded in Xenopus egg extract demonstrate the ability of M2-seq to detect RNA helices in a complex biological environment.

[1]  Siqi Tian,et al.  High-throughput mutate-map-rescue evaluates SHAPE-directed RNA structure and uncovers excited states. , 2014, RNA.

[2]  Kevin M Weeks,et al.  Selective 2′-hydroxyl acylation analyzed by primer extension and mutational profiling (SHAPE-MaP) for direct, versatile and accurate RNA structure analysis , 2015, Nature Protocols.

[3]  A. A. Bogdanov,et al.  Chemical Methods for the Structural Study of the Ribosome: Judgment Day , 2001, Molecular Biology.

[4]  Rhiju Das,et al.  The mutate-and-map protocol for inferring base pairs in structured RNA. , 2013, Methods in molecular biology.

[5]  Sean R Eddy,et al.  Computational analysis of conserved RNA secondary structure in transcriptomes and genomes. , 2014, Annual review of biophysics.

[6]  Feng Ding,et al.  RNA-Puzzles Round II: assessment of RNA structure prediction programs applied to three large RNA structures , 2015, RNA.

[7]  M. Rodnina,et al.  Mechanism of tRNA Translocation on the Ribosome , 2001, Molecular Biology.

[8]  G. Church,et al.  Large-scale de novo DNA synthesis: technologies and applications , 2014, Nature Methods.

[9]  Brenda L. Bass,et al.  An unwinding activity that covalently modifies its double-stranded RNA substrate , 1988, Cell.

[10]  Jill P. Mesirov,et al.  RNA Duplex Map in Living Cells Reveals Higher-Order Transcriptome Structure , 2016, Cell.

[11]  Rhiju Das,et al.  A mutate-and-map strategy for inferring base pairs in structured nucleic acids: proof of concept on a DNA/RNA helix. , 2010, Biochemistry.

[12]  Rhiju Das,et al.  RNA structure through multidimensional chemical mapping , 2016, bioRxiv.

[13]  Pablo Cordero,et al.  Primerize: automated primer assembly for transcribing non-coding RNA domains , 2015, Nucleic Acids Res..

[14]  C. J. Fuller,et al.  A cell-free system for functional centromere and kinetochore assembly , 2012, Nature Protocols.

[15]  D. Mathews,et al.  Accurate SHAPE-directed RNA secondary structure modeling, including pseudoknots , 2013, Proceedings of the National Academy of Sciences.

[16]  Rhiju Das,et al.  A two-dimensional mutate-and-map strategy for non-coding RNA structure. , 2011, Nature chemistry.

[17]  Trevor J Pugh,et al.  Discovery and characterization of artifactual mutations in deep coverage targeted capture sequencing data due to oxidative DNA damage during sample preparation , 2013, Nucleic acids research.

[18]  A. Murray,et al.  The use of Xenopus egg extracts to study mitotic spindle assembly and function in vitro. , 1999, Methods in cell biology.

[19]  N. Pace,et al.  Solution structure of RNase P RNA. , 2011, RNA.

[20]  Rhiju Das,et al.  A mutate-and-map strategy accurately infers the base pairs of a 35-nucleotide model RNA. , 2011, RNA.

[21]  Rhiju Das,et al.  Understanding the errors of SHAPE-directed RNA structure modeling. , 2011, Biochemistry.

[22]  C. Kundrot,et al.  Crystal Structure of a Group I Ribozyme Domain: Principles of RNA Packing , 1996, Science.

[23]  S. Balasubramanian,et al.  Use of fluorescence resonance energy transfer to investigate the conformation of DNA substrates bound to the Klenow fragment. , 1998, Biochemistry.

[24]  P. Kwok,et al.  A Hybrid Approach for de novo Human Genome Sequence Assembly and Phasing , 2016, Nature Methods.

[25]  Siqi Tian,et al.  RNA structure through multidimensional chemical mapping , 2016, bioRxiv.

[26]  J. E. Smith,et al.  A double-stranded RNA unwinding activity introduces structural alterations by means of adenosine to inosine conversions in mammalian cells and Xenopus eggs. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Haplotype-phased synthetic long reads from short-read sequencing , 2015 .

[28]  Steven Busan,et al.  RNA motif discovery by SHAPE and mutational profiling (SHAPE-MaP) , 2014, Nature Methods.

[29]  Rhiju Das,et al.  Ultraviolet Shadowing of RNA Can Cause Significant Chemical Damage in Seconds , 2012, Scientific Reports.

[30]  Hugh E. Olsen,et al.  The Oxford Nanopore MinION: delivery of nanopore sequencing to the genomics community , 2016, Genome Biology.

[31]  Timothy A. Whitehead,et al.  Haplotype-Phased Synthetic Long Reads from Short-Read Sequencing , 2015, bioRxiv.

[32]  S. Epstein,et al.  Chemical Mutagenesis , 1971, Nature.

[33]  J. Kieft,et al.  Zika virus produces noncoding RNAs using a multi-pseudoknot structure that confounds a cellular exonuclease , 2016, Science.

[34]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[35]  M. Sektas,et al.  RNA editing by T7 RNA polymerase bypasses InDel mutations causing unexpected phenotypic changes , 2015, Nucleic acids research.

[36]  T. Cech,et al.  Sequence-specific endoribonuclease activity of the Tetrahymena ribozyme: enhanced cleavage of certain oligonucleotide substrates that form mismatched ribozyme-substrate complexes. , 1988, Biochemistry.

[37]  Kristen K. Dang,et al.  Comparison of SIV and HIV-1 Genomic RNA Structures Reveals Impact of Sequence Evolution on Conserved and Non-Conserved Structural Motifs , 2013, PLoS pathogens.

[38]  David S. Wilson,et al.  Random mutagenesis by PCR. , 2001, Current protocols in molecular biology.

[39]  Rhiju Das,et al.  Massively parallel RNA chemical mapping with a reduced bias MAP-seq protocol. , 2013, Methods in molecular biology.

[40]  K. Weeks,et al.  Direct identification of base-paired RNA nucleotides by correlated chemical probing , 2017, RNA.

[41]  Rhiju Das,et al.  Quantitative dimethyl sulfate mapping for automated RNA secondary structure inference. , 2012, Biochemistry.

[42]  P. Sergiev,et al.  [Study of ribosome structure using the biochemical methods: judgment day]. , 2001, Molekuliarnaia biologiia.

[43]  Siqi Tian,et al.  High-throughput mutate-map-rescue evaluates SHAPE-directed RNA structure and uncovers excited states , 2014, bioRxiv.

[44]  C. J. Fuller,et al.  CENP-C recruits M18BP1 to centromeres to promote CENP-A chromatin assembly , 2011, The Journal of cell biology.

[45]  D. Mathews,et al.  Accurate SHAPE-directed RNA structure determination , 2009, Proceedings of the National Academy of Sciences.

[46]  J. Gorodkin,et al.  Full-length RNA structure prediction of the HIV-1 genome reveals a conserved core domain , 2015, Nucleic acids research.

[47]  D. Mathews,et al.  Bridging the gap between in vitro and in vivo RNA folding , 2016, Quarterly Reviews of Biophysics.

[48]  Pablo Cordero,et al.  Rich RNA Structure Landscapes Revealed by Mutate-and-Map Analysis , 2015, PLoS Comput. Biol..

[49]  E. Kandel,et al.  Control of Memory Formation Through Regulated Expression of a CaMKII Transgene , 1996, Science.

[50]  Nikolay V. Dokholyan,et al.  Single-molecule correlated chemical probing of RNA , 2014, Proceedings of the National Academy of Sciences.

[51]  Katarzyna J Purzycka,et al.  RNA-Puzzles Round III: 3D RNA structure prediction of five riboswitches and one ribozyme. , 2017, RNA.

[52]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[53]  Kristen K. Dang,et al.  Architecture and Secondary Structure of an Entire HIV-1 RNA Genome , 2009, Nature.