Describing the Structural Diversity within an RNA's Ensemble

RNA is usually classified as either structured or unstructured; however, neither category is adequate in describing the diversity of secondary structures expected in biological systems We describe this diversity within the ensemble of structures by using two different metrics: the average Shannon entropy and the ensemble defect. The average Shannon entropy is a measure of the structural diversity calculated from the base pair probability matrix. The ensemble defect, a tool in identifying optimal sequences for a given structure, is a measure of the average number of structural differences between a target structure and all the structures that make up the ensemble, scaled to the length of the sequence. In this paper, we show examples and discuss various uses of these metrics in both structured and unstructured RNA. By exploring how these two metrics describe RNA as an ensemble of different structures, as would be found in biological systems, it will push the field beyond the standard “structured” and “unstructured” categorization.

[1]  Kevin M Weeks,et al.  RNA SHAPE chemistry reveals nonhierarchical interactions dominate equilibrium structural transitions in tRNA(Asp) transcripts. , 2005, Journal of the American Chemical Society.

[2]  W. Dawson,et al.  A new entropy model for RNA: part I. A critique of the standard Jacobson-Stockmayer model applied to multiple cross links , 2012 .

[3]  Vincent Moulton,et al.  A comparison of RNA folding measures , 2005, BMC Bioinformatics.

[4]  Wei-Min Liu,et al.  A data mining approach to discover unusual folding regions in genome sequences , 2002, Knowl. Based Syst..

[5]  Elena Rivas,et al.  Secondary structure alone is generally not statistically significant for the detection of noncoding RNAs , 2000, Bioinform..

[6]  David H Mathews,et al.  Revolutions in RNA secondary structure prediction. , 2006, Journal of molecular biology.

[7]  Ensemble of secondary structures for encapsidated satellite tobacco mosaic virus RNA consistent with chemical probing and crystallography constraints. , 2011, Biophysical journal.

[8]  David Haussler,et al.  The UCSC Genome Browser Database: 2008 update , 2007, Nucleic Acids Res..

[9]  M. Gottesman,et al.  Sensitive measurement of single-nucleotide polymorphism-induced changes of RNA conformation: application to disease studies , 2012, Nucleic acids research.

[10]  Ye Ding,et al.  Structure clustering features on the Sfold Web server , 2005, Bioinform..

[11]  Peter F. Stadler,et al.  Partition function and base pairing probabilities of RNA heterodimers , 2006, Algorithms for Molecular Biology.

[12]  K. Mathee,et al.  A dynamic and intricate regulatory network determines Pseudomonas aeruginosa virulence , 2012, Nucleic acids research.

[13]  R. Gutell,et al.  The accuracy of ribosomal RNA comparative structure models. , 2002, Current opinion in structural biology.

[14]  Kevin Truong,et al.  Identification and characterization of subfamily-specific signatures in a large protein superfamily by a hidden Markov model approach , 2002, BMC Bioinformatics.

[15]  David H Mathews,et al.  Prediction of RNA secondary structure by free energy minimization. , 2006, Current opinion in structural biology.

[16]  Niles A. Pierce,et al.  Nucleic acid sequence design via efficient ensemble defect optimization , 2011, J. Comput. Chem..

[17]  Mike P. Liang,et al.  Local kinetic measures of macromolecular structure reveal partitioning among multiple parallel pathways from the earliest steps in the folding of a large RNA molecule. , 2006, Journal of molecular biology.

[18]  D. Mathews,et al.  Accurate SHAPE-directed RNA secondary structure modeling, including pseudoknots , 2013, Proceedings of the National Academy of Sciences.

[19]  P. Hraber,et al.  Estimating the Contributions of Selection and Self-Organization in RNA Secondary Structure , 1999, Journal of Molecular Evolution.

[20]  S. Woodson Recent insights on RNA folding mechanisms from catalytic RNA , 2000, Cellular and Molecular Life Sciences CMLS.

[21]  X. Chen,et al.  Evolutionary origins and directed evolution of RNA. , 2009, The international journal of biochemistry & cell biology.

[22]  D. Herschlag,et al.  Unwinding RNA's secrets: advances in the biology, physics, and modeling of complex RNAs. , 2008, Current opinion in structural biology.

[23]  C. Lawrence,et al.  A statistical sampling algorithm for RNA secondary structure prediction. , 2003, Nucleic acids research.

[24]  Russell L. Malmberg,et al.  Stable stem enabled shannon entropies distinguish non-coding RNAs from random backgrounds , 2011, ICCABS.

[25]  Howard Y. Chang,et al.  Genome-wide measurement of RNA secondary structure in yeast , 2010, Nature.

[26]  D. Turner,et al.  Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structure. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Joshua S. Martin,et al.  Evolutionary Evidence for Alternative Structure in RNA Sequence Co-variation , 2013, PLoS Comput. Biol..

[28]  Rhiju Das,et al.  The fastest global events in RNA folding: electrostatic relaxation and tertiary collapse of the Tetrahymena ribozyme. , 2003, Journal of molecular biology.

[29]  P. Clote,et al.  Computing the Partition Function for Kinetically Trapped RNA Secondary Structures , 2011, PloS one.

[30]  David W. Digby,et al.  mRNAs have greater negative folding free energies than shuffled or codon choice randomized sequences. , 1999, Nucleic acids research.

[31]  D. Bartel,et al.  One sequence, two ribozymes: implications for the emergence of new ribozyme folds. , 2000, Science.

[32]  Alain Laederach,et al.  Structural effects of linkage disequilibrium on the transcriptome. , 2012, RNA.

[33]  P. Stadler,et al.  Design of multistable RNA molecules. , 2001, RNA.

[34]  Sean R. Eddy,et al.  Rfam: annotating non-coding RNAs in complete genomes , 2004, Nucleic Acids Res..

[35]  Harald Schwalbe,et al.  Three-state mechanism couples ligand and temperature sensing in riboswitches , 2013, Nature.

[36]  Magdalena A. Jonikas,et al.  Structural inference of native and partially folded RNA by high-throughput contact mapping , 2008, Proceedings of the National Academy of Sciences.

[37]  Mark C. W. van Rossum,et al.  Optimal Learning Rules for Discrete Synapses , 2008, PLoS Comput. Biol..

[38]  Magdalena A. Jonikas,et al.  Understanding the role of three-dimensional topology in determining the folding intermediates of group I introns. , 2013, Biophysical journal.

[39]  D. Mathews Using an RNA secondary structure partition function to determine confidence in base pairs predicted by free energy minimization. , 2004, RNA.

[40]  T. Earnest,et al.  Crystal Structure of the Ribosome at 5.5 Å Resolution , 2001, Science.

[41]  E. Andersen Prediction and design of DNA and RNA structures. , 2010, New biotechnology.

[42]  Manju Bansal,et al.  A novel method for prokaryotic promoter prediction based on DNA stability , 2005, BMC Bioinformatics.

[43]  Walter N. Moss,et al.  Folding and finding RNA secondary structure. , 2010, Cold Spring Harbor perspectives in biology.

[44]  Michiaki Hamada,et al.  Fighting against uncertainty: an essential issue in bioinformatics , 2013, Briefings Bioinform..

[45]  M. Huynen,et al.  Assessing the reliability of RNA folding using statistical mechanics. , 1997, Journal of molecular biology.

[46]  H E Stanley,et al.  Statistical properties of DNA sequences. , 1995, Physica A.

[47]  R. Breaker,et al.  Riboswitches as versatile gene control elements. , 2005, Current opinion in structural biology.

[48]  J. Lemay,et al.  Crossing the borders: Poly(A)-binding proteins working on both sides of the fence , 2010, RNA biology.

[49]  David N. Rider,et al.  Evaluation of Candidate Stromal Epithelial Cross-Talk Genes Identifies Association between Risk of Serous Ovarian Cancer and TERT, a Cancer Susceptibility “Hot-Spot” , 2010, PLoS genetics.

[50]  S. Sell,et al.  Identification of intergenic trans-regulatory RNAs containing a disease-linked SNP sequence and targeting cell cycle progression/differentiation pathways in multiple common human disorders , 2009, Cell cycle.

[51]  Matthias Bethge,et al.  Temporal Adaptation Enhances Efficient Contrast Gain Control on Natural Images , 2012, PLoS Comput. Biol..

[52]  Peter F. Stadler,et al.  New journal: Algorithms for Molecular Biology , 2006, Algorithms for Molecular Biology.

[53]  C. Vonrhein,et al.  Structure of the 30S ribosomal subunit , 2000, Nature.

[54]  H. Al‐Hashimi,et al.  RNA dynamics: it is about time. , 2008, Current opinion in structural biology.

[55]  Mohsen Khorshid,et al.  PAR-CliP - A Method to Identify Transcriptome-wide the Binding Sites of RNA Binding Proteins , 2010, Journal of visualized experiments : JoVE.

[56]  Yusuke Nakamura,et al.  A genome-wide association identified the common genetic variants influence disease severity in β0-thalassemia/hemoglobin E , 2009, Human Genetics.

[57]  P. Clote,et al.  Structural RNA has lower folding energy than random RNA of the same dinucleotide frequency. , 2005, RNA.

[58]  M. Cazzola,et al.  Microelectronic DNA chip for hereditary hyperferritinemia cataract syndrome, a model for large‐scale analysis of disorders of iron metabolism , 2006, Human mutation.

[59]  Alain Laederach,et al.  Evaluating our ability to predict the structural disruption of RNA by SNPs , 2012, BMC Genomics.

[60]  Robert M. Dirks,et al.  Paradigms for computational nucleic acid design. , 2004, Nucleic acids research.

[61]  A. Laederach,et al.  Evaluation of the information content of RNA structure mapping data for secondary structure prediction. , 2010, RNA.

[62]  Ye Ding Statistical and Bayesian approaches to RNA secondary structure prediction. , 2006, RNA.

[63]  Donald C. Chang,et al.  The microRNA: overview of the RNA gene that modulates gene functions. , 2006, Methods in molecular biology.

[64]  R. Altman,et al.  SAFA: semi-automated footprinting analysis software for high-throughput quantification of nucleic acid footprinting experiments. , 2005, RNA.

[65]  Nan Yu,et al.  The Comparative RNA Web (CRW) Site: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs , 2002, BMC Bioinformatics.

[66]  J. Doudna,et al.  Ribozyme structures and mechanisms. , 2000, Annual review of biochemistry.

[67]  Alain Laederach,et al.  Disease-Associated Mutations That Alter the RNA Structural Ensemble , 2010, PLoS genetics.

[68]  Lee Ann McCue,et al.  Measuring Global Credibility with Application to Local Sequence Alignment , 2008, PLoS Comput. Biol..

[69]  D. Mathews,et al.  Accurate SHAPE-directed RNA structure determination , 2009, Proceedings of the National Academy of Sciences.

[70]  Zasha Weinberg,et al.  R2R - software to speed the depiction of aesthetic consensus RNA secondary structures , 2011, BMC Bioinformatics.

[71]  J. Holton,et al.  Structures of the Bacterial Ribosome at 3.5 Å Resolution , 2005, Science.

[72]  G. Glinsky SNP-guided microRNA maps (MirMaps) of 16 common human disorders identify a clinically accessible therapy reversing transcriptional aberrations of nuclear import and inflammasome pathways , 2008, Cell cycle.

[73]  Peter F. Stadler,et al.  The RNAsnp web server: predicting SNP effects on local RNA secondary structure , 2013, Nucleic Acids Res..

[74]  D. Sornette,et al.  The US Stock Market Leads the Federal Funds Rate and Treasury Bond Yields , 2011, PloS one.

[75]  Joshua S. Martin,et al.  Exhaustive Enumeration of Kinetic Model Topologies for the Analysis of Time-Resolved RNA Folding , 2009, Algorithms.

[76]  Kaizhong Zhang,et al.  RNA molecules with structure dependent functions are uniquely folded. , 2002, Nucleic acids research.

[77]  Sean R. Eddy,et al.  Rfam: an RNA family database , 2003, Nucleic Acids Res..

[78]  Peter F. Stadler,et al.  ViennaRNA Package 2.0 , 2011, Algorithms for Molecular Biology.

[79]  K. Weeks,et al.  SHAPE-directed RNA secondary structure prediction. , 2010, Methods.

[80]  J. Sabina,et al.  Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. , 1999, Journal of molecular biology.

[81]  D. Mathews Using OligoWalk to identify efficient siRNA sequences. , 2010, Methods in molecular biology.

[82]  G. Glinsky Disease phenocode analysis identifies SNP-guided microRNA maps (MirMaps) associated with human "master" disease genes , 2008, Cell cycle.

[83]  Peter F. Stadler,et al.  Memory Efficient Folding Algorithms for Circular RNA Secondary Structures , 2006, German Conference on Bioinformatics.

[84]  Harry F Noller,et al.  RNA Structure: Reading the Ribosome , 2005, Science.

[85]  T. Rana,et al.  Illuminating the silence: understanding the structure and function of small RNAs , 2007, Nature Reviews Molecular Cell Biology.

[86]  Peter Clote,et al.  Computing the Partition Function and Sampling for Saturated Secondary Structures of RNA, with Respect to the Turner Energy Model , 2007, J. Comput. Biol..

[87]  R. Batey,et al.  Ligand-dependent folding of the three-way junction in the purine riboswitch. , 2008, RNA.

[88]  A. Ferré-D’Amaré,et al.  Crystal structures of the thi-box riboswitch bound to thiamine pyrophosphate analogs reveal adaptive RNA-small molecule recognition. , 2006, Structure.

[89]  David H Mathews,et al.  Predicting helical coaxial stacking in RNA multibranch loops. , 2007, RNA.