Evaluation of the information content of RNA structure mapping data for secondary structure prediction.

Structure mapping experiments (using probes such as dimethyl sulfate [DMS], kethoxal, and T1 and V1 RNases) are used to determine the secondary structures of RNA molecules. The process is iterative, combining the results of several probes with constrained minimum free-energy calculations to produce a model of the structure. We aim to evaluate whether particular probes provide more structural information, and specifically, how noise in the data affects the predictions. Our approach involves generating "decoy" RNA structures (using the sFold Boltzmann sampling procedure) and evaluating whether we are able to identify the correct structure from this ensemble of structures. We show that with perfect information, we are always able to identify the optimal structure for five RNAs of known structure. We then collected orthogonal structure mapping data (DMS and RNase T1 digest) under several solution conditions using our high-throughput capillary automated footprinting analysis (CAFA) technique on two group I introns of known structure. Analysis of these data reveals the error rates in the data under optimal (low salt) and suboptimal solution conditions (high MgCl(2)). We show that despite these errors, our computational approach is less sensitive to experimental noise than traditional constraint-based structure prediction algorithms. Finally, we propose a novel approach for visualizing the interaction of chemical and enzymatic mapping data with RNA structure. We project the data onto the first two dimensions of a multidimensional scaling of the sFold-generated decoy structures. We are able to directly visualize the structural information content of structure mapping data and reconcile multiple data sets.

[1]  W. Gilbert,et al.  Mapping adenines, guanines, and pyrimidines in RNA. , 1977, Nucleic acids research.

[2]  R. E. Lockard,et al.  Mapping tRNA structure in solution using double-strand-specific ribonuclease V1 from cobra venom. , 1981, Nucleic acids research.

[3]  RNA structure analysis using T2 ribonuclease: detection of pH and metal ion induced conformational changes in yeast tRNAPhe. , 1984, Nucleic acids research.

[4]  J. Ebel,et al.  Probing the structure of RNAs in solution. , 1987, Nucleic acids research.

[5]  T. Cech,et al.  Defining the inside and outside of a catalytic RNA molecule. , 1989, Science.

[6]  O. Uhlenbeck,et al.  Keeping RNA happy. , 1995, RNA.

[7]  C. Kundrot,et al.  Crystal Structure of a Group I Ribozyme Domain: Principles of RNA Packing , 1996, Science.

[8]  Thomas F. Coleman,et al.  An Interior Trust Region Approach for Nonlinear Minimization Subject to Bounds , 1993, SIAM J. Optim..

[9]  D. Turner,et al.  Secondary structure model of the RNA recognized by the reverse transcriptase from the R2 retrotransposable element. , 1997, RNA.

[10]  S. Beaucage,et al.  Current Protocols in Nucleic Acid Chemistry , 1999 .

[11]  J. Sabina,et al.  Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. , 1999, Journal of molecular biology.

[12]  V. Dolnik,et al.  DNA sequencing by capillary electrophoresis (review). , 1999, Journal of biochemical and biophysical methods.

[13]  P. Romby,et al.  Probing RNA structure and RNA-ligand complexes with chemical probes. , 2000, Methods in enzymology.

[14]  Nan Yu,et al.  The Comparative RNA Web (CRW) Site: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs , 2002, BMC Bioinformatics.

[15]  References to commonly used techniques. , 2001, Current protocols in nucleic acid chemistry.

[16]  F. Major,et al.  RNA canonical and non-canonical base pairing types: a recognition method and complete repertoire. , 2002, Nucleic acids research.

[17]  Eric Westhof,et al.  The non-Watson-Crick base pairs and their associated isostericity matrices. , 2002, Nucleic acids research.

[18]  X. Zhuang,et al.  Exploration of the transition state for tertiary structure formation between an RNA helix and a large structured RNA. , 2003, Journal of molecular biology.

[19]  Ignacio Tinoco,et al.  Identifying Kinetic Barriers to Mechanical Unfolding of the T. thermophila Ribozyme , 2003, Science.

[20]  Zukang Feng,et al.  The Nucleic Acid Database. , 2002, Acta crystallographica. Section D, Biological crystallography.

[21]  D. Mathews Using an RNA secondary structure partition function to determine confidence in base pairs predicted by free energy minimization. , 2004, RNA.

[22]  Keiji Takamoto,et al.  Semi-automated, single-band peak-fitting analysis of hydroxyl radical nucleic acid footprint autoradiograms for the quantitative analysis of transitions. , 2004, Nucleic acids research.

[23]  M. Chance,et al.  Monovalent ion-mediated folding of the Tetrahymena thermophila ribozyme. , 2004, Journal of molecular biology.

[24]  D. Turner,et al.  Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structure. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[25]  Ye Ding,et al.  Sfold web server for statistical folding and rational design of nucleic acids , 2004, Nucleic Acids Res..

[26]  C. Lawrence,et al.  RNA secondary structure prediction by centroids in a Boltzmann weighted ensemble. , 2005, RNA.

[27]  M. Brenowitz,et al.  Perturbation of the hierarchical folding of a large RNA by the destabilization of its Scaffold's tertiary structure. , 2005, Journal of molecular biology.

[28]  X. Zhuang Single-molecule RNA science. , 2005, Annual review of biophysics and biomolecular structure.

[29]  R. Altman,et al.  SAFA: semi-automated footprinting analysis software for high-throughput quantification of nucleic acid footprinting experiments. , 2005, RNA.

[30]  Kevin M Weeks,et al.  RNA SHAPE chemistry reveals nonhierarchical interactions dominate equilibrium structural transitions in tRNA(Asp) transcripts. , 2005, Journal of the American Chemical Society.

[31]  B. Golden,et al.  Crystal structure of a phage Twort group I ribozyme–product complex , 2005, Nature Structural &Molecular Biology.

[32]  Mike P. Liang,et al.  Local kinetic measures of macromolecular structure reveal partitioning among multiple parallel pathways from the earliest steps in the folding of a large RNA molecule. , 2006, Journal of molecular biology.

[33]  Robert Giegerich,et al.  Beyond Mfold: Recent advances in RNA bioinformatics , 2006, Journal of Biotechnology.

[34]  Peter F. Stadler,et al.  Local RNA base pairing probabilities in large sequences , 2006, Bioinform..

[35]  R. Lease,et al.  Hydroxyl radical footprinting in vivo: mapping macromolecular structures with synchrotron radiation , 2006, Nucleic acids research.

[36]  Peter Clote,et al.  Computing the Partition Function and Sampling for Saturated Secondary Structures of RNA, with Respect to the Turner Energy Model , 2007, J. Comput. Biol..

[37]  R. Russell,et al.  DMS footprinting of structured RNAs and RNA–protein complexes , 2007, Nature Protocols.

[38]  Quentin Vicens,et al.  Local RNA structural changes induced by crystallization are revealed by SHAPE. , 2007, RNA.

[39]  George M Weinstock,et al.  ENCODE: more genomic empowerment. , 2007, Genome research.

[40]  Magdalena A. Jonikas,et al.  Distinct contribution of electrostatics, initial conformational ensemble, and macromolecular stability in RNA folding , 2007, Proceedings of the National Academy of Sciences.

[41]  A. Laederach,et al.  Energy barriers, pathways, and dynamics during folding of large, multidomain RNAs. , 2008, Current opinion in chemical biology.

[42]  Morgan C. Giddings,et al.  ShapeFinder: a software system for high-throughput quantitative analysis of nucleic acid reactivity information resolved by capillary electrophoresis. , 2008, RNA.

[43]  M. Brenowitz,et al.  Monitoring structural changes in nucleic acids with single residue spatial and millisecond time resolution by quantitative hydroxyl radical footprinting , 2008, Nature Protocols.

[44]  David H. Mathews,et al.  NMR-Assisted Prediction of RNA Secondary Structure: Identification of a Probable Pseudoknot in the Coding Region of an R2 Retrotransposon , 2008, Journal of the American Chemical Society.

[45]  Morgan C. Giddings,et al.  High-Throughput SHAPE Analysis Reveals Structures in HIV-1 Genomic RNA Strongly Conserved across Distinct Biological States , 2008, PLoS biology.

[46]  N. Morton,et al.  Into the post-HapMap era. , 2008, Advances in genetics.

[47]  R. Altman,et al.  High-throughput single-nucleotide structural mapping by capillary automated footprinting analysis , 2008, Nucleic acids research.

[48]  Morgan C. Giddings,et al.  Influence of nucleotide identity on ribose 2'-hydroxyl reactivity in RNA. , 2009, RNA.

[49]  D. Mathews,et al.  Accurate SHAPE-directed RNA structure determination , 2009, Proceedings of the National Academy of Sciences.

[50]  Yann Ponty,et al.  VARNA: Interactive drawing and editing of the RNA secondary structure , 2009, Bioinform..