Automated Recognition of RNA Structure Motifs by Their SHAPE Data Signatures

High-throughput structure profiling (SP) experiments that provide information at nucleotide resolution are revolutionizing our ability to study RNA structures. Of particular interest are RNA elements whose underlying structures are necessary for their biological functions. We previously introduced patteRNA, an algorithm for rapidly mining SP data for patterns characteristic of such motifs. This work provided a proof-of-concept for the detection of motifs and the capability of distinguishing structures displaying pronounced conformational changes. Here, we describe several improvements and automation routines to patteRNA. We then consider more elaborate biological situations starting with the comparison or integration of results from searches for distinct motifs and across datasets. To facilitate such analyses, we characterize patteRNA’s outputs and describe a normalization framework that regularizes results. We then demonstrate that our algorithm successfully discerns between highly similar structural variants of the human immunodeficiency virus type 1 (HIV-1) Rev response element (RRE) and readily identifies its exact location in whole-genome structure profiles of HIV-1. This work highlights the breadth of information that can be gleaned from SP data and broadens the utility of data-driven methods as tools for the detection of novel RNA elements.

[1]  Manolis Kellis,et al.  Genome-wide probing of RNA structure reveals active unfolding of mRNA structures in vivo , 2013, Nature.

[2]  Sean R Eddy,et al.  Computational analysis of conserved RNA secondary structure in transcriptomes and genomes. , 2014, Annual review of biophysics.

[3]  Catherine Tran,et al.  Progress and challenges for chemical probing of RNA structure inside living cells. , 2015, Nature chemical biology.

[4]  Sean R. Eddy,et al.  Infernal 1.1: 100-fold faster RNA homology searches , 2013, Bioinform..

[5]  Chun Kit Kwok,et al.  Dawn of the in vivo RNA structurome and interactome. , 2016, Biochemical Society transactions.

[6]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[7]  Alain Laederach,et al.  Structural divergence creates new functional features in alphavirus genomes , 2018, Nucleic acids research.

[8]  Kyle E. Watters,et al.  Cotranscriptional Folding of a Riboswitch at Nucleotide Resolution , 2016, Nature Structural &Molecular Biology.

[9]  David H Mathews,et al.  Modeling RNA secondary structure folding ensembles using SHAPE mapping data , 2017, Nucleic acids research.

[10]  Z. Ignatova,et al.  Systematic probing of the bacterial RNA structurome to reveal new functions. , 2017, Current opinion in microbiology.

[11]  Travis E. Oliphant,et al.  Python for Scientific Computing , 2007, Computing in Science & Engineering.

[12]  Peter F. Stadler,et al.  ViennaRNA Package 2.0 , 2011, Algorithms for Molecular Biology.

[13]  Kevin B. Turner,et al.  Resistance to RevM10 inhibition reflects a conformational switch in the HIV-1 Rev response element , 2008, Proceedings of the National Academy of Sciences.

[14]  A. Laederach,et al.  Transcending the prediction paradigm: novel applications of SHAPE to RNA function and evolution , 2016, Wiley interdisciplinary reviews. RNA.

[15]  Phillip A Sharp,et al.  The Centrality of RNA , 2009, Cell.

[16]  Ge Zhang,et al.  Model-Free RNA Sequence and Structure Alignment Informed by SHAPE Probing Reveals a Conserved Alternate Secondary Structure for 16S rRNA , 2015, PLoS Comput. Biol..

[17]  J. Doudna,et al.  Insights into RNA structure and function from genome-wide studies , 2014, Nature Reviews Genetics.

[18]  R. Symons,et al.  Self-cleavage of plus and minus RNAs of a virusoid and a structural model for the active sites , 1987, Cell.

[19]  David Mavor,et al.  Thermodynamics of Rev-RNA interactions in HIV-1 Rev-RRE assembly. , 2015, Biochemistry.

[20]  Kyle E. Watters,et al.  Probing of RNA structures in a positive sense RNA virus reveals selection pressures for structural elements , 2017, Nucleic acids research.

[21]  E. Ruggiero,et al.  G-quadruplexes and G-quadruplex ligands: targets and tools in antiviral therapy , 2018, Nucleic acids research.

[22]  R. Breaker Riboswitches and the RNA world. , 2012, Cold Spring Harbor perspectives in biology.

[23]  Fei Deng,et al.  Comparative and integrative analysis of RNA structural profiling data: current practices and emerging questions , 2017, Quantitative Biology.

[24]  N. Lehman,et al.  The RNA World: molecular cooperation at the origins of life , 2014, Nature Reviews Genetics.

[25]  R. Andino,et al.  Switch from translation to RNA replication in a positive-stranded RNA virus. , 1998, Genes & development.

[26]  Robert Giegerich,et al.  A comprehensive comparison of comparative RNA structure prediction approaches , 2004, BMC Bioinformatics.

[27]  R. A. Leibler,et al.  On Information and Sufficiency , 1951 .

[28]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[29]  J. Karn,et al.  A molecular rheostat: Co-operative rev binding to stem I of the rev-response element modulates human immunodeficiency virus type-1 late gene expression , 1994 .

[30]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[31]  Yiliang Ding,et al.  Determination of in vivo RNA structure in low-abundance transcripts , 2013, Nature Communications.

[32]  Sean R. Eddy,et al.  Infernal 1.0: inference of RNA alignments , 2009, Bioinform..

[33]  M. McCarthy,et al.  Genome-wide association studies for complex traits: consensus, uncertainty and challenges , 2008, Nature Reviews Genetics.

[34]  D. Mathews,et al.  Accurate SHAPE-directed RNA secondary structure modeling, including pseudoknots , 2013, Proceedings of the National Academy of Sciences.

[35]  Dang D. Long,et al.  Potent effect of target structure on microRNA function , 2007, Nature Structural &Molecular Biology.

[36]  Howard Y. Chang,et al.  RNA SHAPE analysis in living cells. , 2013, Nature chemical biology.

[37]  Sharon Aviran,et al.  PATTERNA: transcriptome-wide search for functional RNA elements via structural data signatures , 2018, Genome Biology.

[38]  R. Breaker,et al.  Riboswitch diversity and distribution , 2017, RNA.

[39]  Cole Trapnell,et al.  Modeling and automation of sequencing-based characterization of RNA structure , 2011, Proceedings of the National Academy of Sciences.

[40]  Yuri Motorin,et al.  Detecting RNA modifications in the epitranscriptome: predict and validate , 2017, Nature Reviews Genetics.

[41]  D. Mathews,et al.  Accurate SHAPE-directed RNA structure determination , 2009, Proceedings of the National Academy of Sciences.

[42]  Qiangfeng Cliff Zhang,et al.  Landscape and variation of RNA secondary structure across the human transcriptome , 2014, Nature.

[43]  Anton Nekrutenko,et al.  StructureFold: genome-wide RNA secondary structure mapping and reconstruction in vivo , 2015, Bioinform..

[44]  Kaoru Inoue,et al.  SHAPE reveals transcript-wide interactions, complex structural domains, and protein interactions across the Xist lncRNA in living cells , 2016, Proceedings of the National Academy of Sciences.

[45]  K. Zhou,et al.  RNA-guided assembly of Rev-RRE nuclear export complexes , 2014, eLife.

[46]  Steven Busan,et al.  RNA motif discovery by SHAPE and mutational profiling (SHAPE-MaP) , 2014, Nature Methods.

[47]  Michael F. Sloma,et al.  Improving RNA secondary structure prediction with structure mapping data. , 2015, Methods in enzymology.

[48]  Lior Pachter,et al.  RNA structure characterization from chemical mapping experiments , 2011, 2011 49th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[49]  S. L. Le Grice,et al.  HIV Rev Assembly on the Rev Response Element (RRE): A Structural Perspective , 2015, Viruses.

[50]  A. Krainer,et al.  RNA Splicing at Human Immunodeficiency Virus Type 1 3′ Splice Site A2 Is Regulated by Binding of hnRNP A/B Proteins to an Exonic Splicing Silencer Element , 2001, Journal of Virology.

[51]  D. Stuart,et al.  Implications of the HIV-1 Rev dimer structure at 3.2 Å resolution for multimeric binding to the Rev response element , 2010, Proceedings of the National Academy of Sciences.

[52]  Michael T. Wolfinger,et al.  Predicting RNA secondary structures from sequence and probing data. , 2016, Methods.

[53]  Lior Pachter,et al.  PROBer Provides a General Toolkit for Analyzing Sequencing-Based Toeprinting Assays. , 2017, Cell systems.

[54]  Dinshaw J. Patel,et al.  Crystal structure reveals specific recognition of a G-quadruplex RNA by a β-turn in the RGG motif of FMRP , 2015, Proceedings of the National Academy of Sciences.

[55]  Matthew D Disney,et al.  Design of a small molecule against an oncogenic noncoding RNA , 2016, Proceedings of the National Academy of Sciences.

[56]  Giovanni Marsico,et al.  rG4-seq reveals widespread formation of G-quadruplex structures in the human transcriptome , 2016, Nature Methods.

[57]  L. Boyer,et al.  A G-Rich Motif in the lncRNA Braveheart Interacts with a Zinc-Finger Transcription Factor to Specify the Cardiovascular Lineage. , 2016, Molecular cell.

[58]  K. Weeks Advances in RNA structure analysis by chemical probing. , 2010, Current opinion in structural biology.

[59]  M. Malim,et al.  The HIV-1 Rev protein. , 1998, Annual review of microbiology.

[60]  David H. Mathews,et al.  RNAstructure: software for RNA secondary structure prediction and analysis , 2010, BMC Bioinformatics.

[61]  Nikolay V. Dokholyan,et al.  Single-molecule correlated chemical probing of RNA , 2014, Proceedings of the National Academy of Sciences.

[62]  Pablo Cordero,et al.  Rich RNA Structure Landscapes Revealed by Mutate-and-Map Analysis , 2015, PLoS Comput. Biol..

[63]  Tao Pan,et al.  RNA modifications and structures cooperate to guide RNA–protein interactions , 2017, Nature Reviews Molecular Cell Biology.

[64]  Sharon Aviran,et al.  Data-directed RNA secondary structure prediction using probabilistic modeling , 2016, RNA.

[65]  B. Williams,et al.  An excited state underlies gene regulation of a transcriptional riboswitch , 2017, Nature chemical biology.

[66]  David Mavor,et al.  RNA-directed remodeling of the HIV-1 protein Rev orchestrates assembly of the Rev–Rev response element complex , 2014, eLife.

[67]  Guido Sanguinetti,et al.  Robust statistical modeling improves sensitivity of high-throughput RNA structure probing experiments , 2016, Nature Methods.

[68]  Steven M. Gallo,et al.  Sequence-based design of bioactive small molecules that target precursor microRNAs , 2014, Nature chemical biology.

[69]  D. Bartel,et al.  RNA G-quadruplexes are globally unfolded in eukaryotic cells and depleted in bacteria , 2016, Science.

[70]  E. Dayton,et al.  Functional analysis of CAR, the target sequence for the Rev protein of HIV-1. , 1989, Science.

[71]  M. C. Hammond,et al.  Engineering and In Vivo Applications of Riboswitches. , 2017, Annual review of biochemistry.

[72]  Bo Li,et al.  Metrics for rapid quality control in RNA structure probing experiments , 2016, Bioinform..

[73]  Christine E. Heitsch,et al.  Evaluating the accuracy of SHAPE-directed RNA secondary structure predictions , 2013, Nucleic acids research.

[74]  R. Spitale,et al.  Measuring RNA structure transcriptome-wide with icSHAPE. , 2017, Methods.

[75]  Jonathan Karn,et al.  Transcriptional and posttranscriptional regulation of HIV-1 gene expression. , 2012, Cold Spring Harbor perspectives in medicine.

[76]  S. Eddy Non–coding RNA genes and the modern RNA world , 2001, Nature Reviews Genetics.

[77]  Kristen K. Dang,et al.  Architecture and Secondary Structure of an Entire HIV-1 RNA Genome , 2009, Nature.

[78]  S. L. Le Grice,et al.  The HIV-1 Rev response element (RRE) adopts alternative conformations that promote different rates of virus replication , 2015, Nucleic acids research.

[79]  M. Rosbash,et al.  A dynamic in vivo view of the HIV-I Rev-RRE interaction. , 1997, Journal of molecular biology.

[80]  Sean R. Eddy,et al.  Infernal 1.0: inference of RNA alignments , 2009, Bioinform..

[81]  Zasha Weinberg,et al.  Detection of 224 candidate structured RNAs by comparative analysis of specific subsets of intergenic regions , 2017, Nucleic acids research.

[82]  Zasha Weinberg,et al.  Bioinformatic analysis of riboswitch structures uncovers variant classes with altered ligand specificity , 2017, Proceedings of the National Academy of Sciences.

[83]  J. Kjems,et al.  An Unusual Topological Structure of the HIV-1 Rev Response Element , 2013, Cell.

[84]  S. Oliviero,et al.  In vivo probing of nascent RNA structures reveals principles of cotranscriptional folding , 2017, Nucleic acids research.

[85]  R. Spitale,et al.  Multiplex Aptamer Discovery through Apta-Seq and Its Application to ATP Aptamers Derived from Human-Genomic SELEX. , 2017, ACS chemical biology.

[86]  J. Weissman,et al.  DMS-MaPseq for genome-wide or targeted RNA structure probing in vivo , 2016, Nature Methods.

[87]  K. Weeks,et al.  Accurate detection of chemical modifications in RNA by mutational profiling (MaP) with ShapeMapper 2 , 2018, RNA.

[88]  P. Sharp,et al.  Structural analysis of the interaction between the human immunodeficiency virus Rev protein and the Rev response element. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[89]  Hua Li,et al.  Publisher Correction: Statistical modeling of RNA structure profiling experiments enables parsimonious reconstruction of structure landscapes , 2018, Nature Communications.