Principles for understanding the accuracy of SHAPE-directed RNA structure modeling.

Accurate RNA structure modeling is an important, incompletely solved, challenge. Single-nucleotide resolution SHAPE (selective 2'-hydroxyl acylation analyzed by primer extension) yields an experimental measurement of local nucleotide flexibility that can be incorporated as pseudo-free energy change constraints to direct secondary structure predictions. Prior work from our laboratory has emphasized both the overall accuracy of this approach and the need for nuanced interpretation of modeled structures. Recent studies by Das and colleagues [Kladwang, W., et al. (2011) Biochemistry 50, 8049; Nat. Chem. 3, 954], focused on analyzing six small RNAs, yielded poorer RNA secondary structure predictions than expected on the basis of prior benchmarking efforts. To understand the features that led to these divergent results, we re-examined four RNAs yielding the poorest results in this recent work: tRNA(Phe), the adenine and cyclic-di-GMP riboswitches, and 5S rRNA. Most of the errors reported by Das and colleagues reflected nonstandard experiment and data processing choices, and selective scoring rules. For two RNAs, tRNA(Phe) and the adenine riboswitch, secondary structure predictions are nearly perfect if no experimental information is included but were rendered inaccurate by the SHAPE data of Das and colleagues. When best practices were used, single-sequence SHAPE-directed secondary structure modeling recovered ~93% of individual base pairs and >90% of helices in the four RNAs, essentially indistinguishable from the results of the mutate-and-map approach with the exception of a single helix in the 5S rRNA. The field of experimentally directed RNA secondary structure prediction is entering a phase focused on the most difficult prediction challenges. We outline five constructive principles for guiding this field forward.

[1]  Jerrold R. Griggs,et al.  Algorithms for Loop Matchings , 1978 .

[2]  J. Hutton,et al.  Thermal stability and renaturation of DNA in dimethyl sulfoxide solutions: Acceleration of the renaturation rate , 1980, Biopolymers.

[3]  J. Sabina,et al.  Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. , 1999, Journal of molecular biology.

[4]  A. S. Krasilnikov,et al.  Crystal structure of the specificity domain of ribonuclease P , 2003, Nature.

[5]  A. Serganov,et al.  Structural basis for discriminative regulation of gene expression by adenine- and guanine-sensing mRNAs. , 2004, Chemistry & biology.

[6]  D. Turner,et al.  Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structure. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[7]  K. Weeks,et al.  RNA structure analysis at single nucleotide resolution by selective 2'-hydroxyl acylation and primer extension (SHAPE). , 2005, Journal of the American Chemical Society.

[8]  K. Weeks,et al.  Selective 2′-hydroxyl acylation analyzed by primer extension (SHAPE): quantitative RNA structure analysis at single nucleotide resolution , 2006, Nature Protocols.

[9]  D. Turner,et al.  A set of nearest neighbor parameters for predicting the enthalpy change of RNA secondary structure formation , 2006, Nucleic acids research.

[10]  K. Weeks,et al.  A fast-acting reagent for accurate analysis of RNA secondary and tertiary structure by SHAPE chemistry. , 2007, Journal of the American Chemical Society.

[11]  Gabriele Varani,et al.  Strong correlation between SHAPE chemistry and the generalized NMR order parameter (S2) in RNA. , 2008, Journal of the American Chemical Society.

[12]  Morgan C. Giddings,et al.  ShapeFinder: a software system for high-throughput quantitative analysis of nucleic acid reactivity information resolved by capillary electrophoresis. , 2008, RNA.

[13]  RNA Ligases , 2008, Current protocols in molecular biology.

[14]  K. Weeks,et al.  Slow conformational dynamics at C2'-endo nucleotides in RNA. , 2008, Journal of the American Chemical Society.

[15]  Morgan C. Giddings,et al.  Influence of nucleotide identity on ribose 2'-hydroxyl reactivity in RNA. , 2009, RNA.

[16]  Mijeong Kang,et al.  Structural Insights into riboswitch control of the biosynthesis of queuosine, a modified nucleotide found in the anticodon of tRNA. , 2009, Molecular cell.

[17]  D. Mathews,et al.  Accurate SHAPE-directed RNA structure determination , 2009, Proceedings of the National Academy of Sciences.

[18]  David H. Mathews,et al.  RNAstructure: software for RNA secondary structure prediction and analysis , 2010, BMC Bioinformatics.

[19]  Kathryn D. Smith,et al.  Structural basis of ligand binding by a c-di-GMP riboswitch , 2009, Nature Structural &Molecular Biology.

[20]  K. Weeks,et al.  C2′-endo nucleotides as molecular timers suggested by the folding of an RNA domain , 2009, Proceedings of the National Academy of Sciences.

[21]  K. Weeks,et al.  High-throughput SHAPE and hydroxyl radical analysis of RNA structure and ribonucleoprotein assembly. , 2009, Methods in enzymology.

[22]  A. Ferré-D’Amaré,et al.  Recognition of the bacterial second messenger cyclic diguanylate by its cognate riboswitch , 2009, Nature Structural &Molecular Biology.

[23]  J Andrew Berglund,et al.  Role of RNA structure in regulating pre-mRNA splicing. , 2010, Trends in biochemical sciences.

[24]  K. Weeks,et al.  SHAPE-directed RNA secondary structure prediction. , 2010, Methods.

[25]  M. Rodnina,et al.  The crystal structure of unmodified tRNAPhe from Escherichia coli , 2010, Nucleic acids research.

[26]  Kathryn D. Smith,et al.  Structural and biochemical determinants of ligand binding by the c-di-GMP riboswitch . , 2010, Biochemistry.

[27]  David H. Mathews,et al.  NNDB: the nearest neighbor parameter database for predicting stability of nucleic acid secondary structure , 2009, Nucleic Acids Res..

[28]  Andrea L Edwards,et al.  Structural basis for recognition of S-adenosylhomocysteine by riboswitches. , 2010, RNA.

[29]  K. Weeks Advances in RNA structure analysis by chemical probing. , 2010, Current opinion in structural biology.

[30]  Rhiju Das,et al.  Understanding the errors of SHAPE-directed RNA structure modeling. , 2011, Biochemistry.

[31]  T. Hermann,et al.  Structure of an RNA dimer of a regulatory element from human thymidylate synthase mRNA. , 2011, Acta crystallographica. Section D, Biological crystallography.

[32]  Cole Trapnell,et al.  Modeling and automation of sequencing-based characterization of RNA structure , 2011, Proceedings of the National Academy of Sciences.

[33]  Craig L. Zirbel,et al.  Sharing and archiving nucleic acid structure mapping data. , 2011, RNA.

[34]  Rhiju Das,et al.  A two-dimensional mutate-and-map strategy for non-coding RNA structure. , 2011, Nature chemistry.

[35]  J. Feigon,et al.  Comparison of Solution and Crystal Structures of PreQ1 Riboswitch Reveals Calcium-Induced Changes in Conformation and Dynamics , 2011, Journal of the American Chemical Society.

[36]  Rhiju Das,et al.  A mutate-and-map strategy accurately infers the base pairs of a 35-nucleotide model RNA. , 2011, RNA.

[37]  K. Weeks,et al.  Exploring RNA structural codes with SHAPE chemistry. , 2011, Accounts of chemical research.

[38]  Seunghyun Park,et al.  HiTRACE: high-throughput robust analysis for capillary electrophoresis , 2011, Bioinform..

[39]  K. Weeks,et al.  The mechanisms of RNA SHAPE chemistry. , 2012, Journal of the American Chemical Society.

[40]  K. Weeks,et al.  SHAPE-directed discovery of potent shRNA inhibitors of HIV-1. , 2012, Molecular therapy : the journal of the American Society of Gene Therapy.

[41]  Rhiju Das,et al.  Quantitative dimethyl sulfate mapping for automated RNA secondary structure inference. , 2012, Biochemistry.

[42]  K. Weeks,et al.  QuShape: rapid, accurate, and best-practices quantification of nucleic acid probing information, resolved by capillary electrophoresis. , 2013, RNA.

[43]  Kevin M Weeks,et al.  Selective 2′-hydroxyl acylation analyzed by primer extension and mutational profiling (SHAPE-MaP) for direct, versatile and accurate RNA structure analysis , 2015, Nature Protocols.