Computational approaches to 3D modeling of RNA

Many exciting discoveries have recently revealed the versatility of RNA and its importance in a variety of functions within the cell. Since the structural features of RNA are of major importance to their biological function, there is much interest in predicting RNA structure, either in free form or in interaction with various ligands, including proteins, metabolites and other molecules. In recent years, an increasing number of researchers have developed novel RNA algorithms for predicting RNA secondary and tertiary structures. In this review, we describe current experimental and computational advances and discuss recent ideas that are transforming the traditional view of RNA folding. To evaluate the performance of the most recent RNA 3D folding algorithms, we provide a comparative study in order to test the performance of available 3D structure prediction algorithms for an RNA data set of 43 structures of various lengths and motifs. We find that the algorithms vary widely in terms of prediction quality across different RNA lengths and topologies; most predictions have very large root mean square deviations from the experimental structure. We conclude by outlining some suggestions for future RNA folding research.

[1]  A Xayaphoummine,et al.  Prediction and statistics of pseudoknots in RNA structures using exactly clustered stochastic simulations , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[2]  T. Schlick,et al.  Annotation of tertiary interactions in RNA structures reveals variations and correlations. , 2008, RNA.

[3]  Seema Chauhan,et al.  Tertiary interactions determine the accuracy of RNA folding. , 2008, Journal of the American Chemical Society.

[4]  Peixuan Guo,et al.  Controllable self-assembly of nanoparticles for specific delivery of multiple therapeutic molecules to cancer cells using RNA nanotechnology. , 2005, Nano letters.

[5]  Arndt Borkhardt,et al.  High expression of precursor microRNA‐155/BIC RNA in children with Burkitt lymphoma , 2004, Genes, chromosomes & cancer.

[6]  Changbong Hyeon,et al.  Theory of RNA Folding: From Hairpins to Ribozymes , 2009 .

[7]  Stefanie A. Mortimer,et al.  Time-resolved RNA SHAPE chemistry: quantitative RNA structure analysis in one-second snapshots and at single-nucleotide resolution , 2009, Nature Protocols.

[8]  I. Tinoco,et al.  RNA folding causes secondary structure rearrangement. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[9]  D. Mathews,et al.  Accurate SHAPE-directed RNA structure determination , 2009, Proceedings of the National Academy of Sciences.

[10]  Michael Z Michael,et al.  Reduced accumulation of specific microRNAs in colorectal neoplasia. , 2003, Molecular cancer research : MCR.

[11]  R. Nussinov,et al.  Tree graphs of RNA secondary structures and their comparisons. , 1989, Computers and biomedical research, an international journal.

[12]  James W. Brown,et al.  The RNA structure alignment ontology. , 2009, RNA.

[13]  Robert Giegerich,et al.  Pure multiple RNA secondary structure alignments: a progressive profile approach , 2004, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[14]  Y. Yatabe,et al.  Reduced Expression of the let-7 MicroRNAs in Human Lung Cancers in Association with Shortened Postoperative Survival , 2004, Cancer Research.

[15]  Peter F. Stadler,et al.  Memory Efficient Folding Algorithms for Circular RNA Secondary Structures , 2006, German Conference on Bioinformatics.

[16]  Christian N. S. Pedersen,et al.  RNA Pseudoknot Prediction in Energy-Based Models , 2000, J. Comput. Biol..

[17]  Tamar Schlick,et al.  Mathematical and Biological Scientists Assess the State-of-the-Art in RNA Science at an IMA Workshop RNA in Biology, Bioengineering and Biotechnology , 2010 .

[18]  G Benedetti,et al.  A graph-topological approach to recognition of pattern and similarity in RNA secondary structures. , 1996, Biophysical chemistry.

[19]  Bruce A. Shapiro,et al.  Structural Domains within the 3′ Untranslated Region of Turnip Crinkle Virus , 2008, Journal of Virology.

[20]  L. Jaeger,et al.  The architectonics of programmable RNA and DNA nanostructures. , 2006, Current opinion in structural biology.

[21]  B. Felden,et al.  Emerging views on tmRNA‐mediated protein tagging and ribosome rescue , 2001, Molecular microbiology.

[22]  Nan Yu,et al.  The Comparative RNA Web (CRW) Site: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs , 2002, BMC Bioinformatics.

[23]  M. Sioud,et al.  Ribozymes and siRnas: from structure to preclinical applications. , 2006, Handbook of experimental pharmacology.

[24]  H. Hansma,et al.  Building Programmable Jigsaw Puzzles with RNA , 2004, Science.

[25]  S. Schroeder Advances in RNA Structure Prediction from Sequence: New Tools for Generating Hypotheses about Viral RNA Structure-Function Relationships , 2009, Journal of Virology.

[26]  Craig L. Zirbel,et al.  Classification and energetics of the base-phosphate interactions in RNA , 2009, Nucleic acids research.

[27]  W. Allen Miller,et al.  The 3′-Terminal Structure Required for Replication of Barley Yellow Dwarf Virus RNA Contains an Embedded 3′ End , 2002 .

[28]  François Major,et al.  Automated extraction and classification of RNA tertiary structure cyclic motifs , 2006, Nucleic acids research.

[29]  E Westhof,et al.  Monitoring of the cooperative unfolding of the sunY group I intron of bacteriophage T4. The active form of the sunY ribozyme is stabilized by multiple interactions with 3' terminal intron components. , 1993, Journal of molecular biology.

[30]  E. Westhof,et al.  The building blocks and motifs of RNA architecture. , 2006, Current opinion in structural biology.

[31]  Magdalena A. Jonikas,et al.  Coarse-grained modeling of large RNA molecules with knowledge-based potentials and structural filters. , 2009, RNA.

[32]  Taekjip Ha,et al.  Conformational flexibility of four-way junctions in RNA. , 2004, Journal of molecular biology.

[33]  S. Woodson,et al.  Global stabilization of rRNA structure by ribosomal proteins S4, S17, and S20. , 2009, Journal of molecular biology.

[34]  E Rivas,et al.  A dynamic programming algorithm for RNA structure prediction including pseudoknots. , 1998, Journal of molecular biology.

[35]  H. Al‐Hashimi,et al.  RNA dynamics: it is about time. , 2008, Current opinion in structural biology.

[36]  Feng Ding,et al.  iFoldRNA: three-dimensional RNA structure prediction and folding , 2008, Bioinform..

[37]  Michael Zuker,et al.  Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information , 1981, Nucleic Acids Res..

[38]  Kaizhong Zhang,et al.  Comparing multiple RNA secondary structures using tree comparisons , 1990, Comput. Appl. Biosci..

[39]  Tao Pan,et al.  Folding of noncoding RNAs during transcription facilitated by pausing-induced nonnative structures , 2007, Proceedings of the National Academy of Sciences.

[40]  B. Shapiro,et al.  RNA secondary structure prediction from sequence alignments using a network of k-nearest neighbor classifiers. , 2006, RNA.

[41]  C. Croce,et al.  Frequent deletions and down-regulation of micro- RNA genes miR15 and miR16 at 13q14 in chronic lymphocytic leukemia , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[42]  Eric Westhof,et al.  The Dynamic Landscapes of RNA Architecture , 2009, Cell.

[43]  H. Tagawa,et al.  A microRNA cluster as a target of genomic amplification in malignant lymphoma , 2005, Leukemia.

[44]  I. Tinoco,et al.  How RNA folds. , 1999, Journal of molecular biology.

[45]  Feng Ding,et al.  Native-like RNA tertiary structures using a sequence-encoded cleavage agent and refinement by discrete molecular dynamics. , 2009, Journal of the American Chemical Society.

[46]  E. Westhof,et al.  Geometric nomenclature and classification of RNA base pairs. , 2001, RNA.

[47]  Feng Ding,et al.  Emergence of Protein Fold Families through Rational Design , 2006, PLoS Comput. Biol..

[48]  Kirsten L. Frieda,et al.  Direct Observation of Hierarchical Folding in Single Riboswitch Aptamers , 2008, Science.

[49]  Weixiong Zhang,et al.  An Iterated loop matching approach to the prediction of RNA secondary structures with pseudoknots , 2004, Bioinform..

[50]  Alain Xayaphoummine,et al.  Kinefold web server for RNA/DNA folding path and structure prediction including pseudoknots and knots , 2005, Nucleic Acids Res..

[51]  T. Steitz,et al.  The complete atomic structure of the large ribosomal subunit at 2.4 A resolution. , 2000, Science.

[52]  Brice Felden,et al.  RNA structure: experimental analysis. , 2007, Current opinion in microbiology.

[53]  Magdalena A. Jonikas,et al.  Structural inference of native and partially folded RNA by high-throughput contact mapping , 2008, Proceedings of the National Academy of Sciences.

[54]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[55]  R. Olsthoorn,et al.  A conformational switch at the 3′ end of a plant virus RNA regulates viral replication , 1999, The EMBO journal.

[56]  R. Stefl,et al.  NMR methodology for the study of nucleic acids. , 2001, Current opinion in structural biology.

[57]  K. Weeks,et al.  RNA structure analysis at single nucleotide resolution by selective 2'-hydroxyl acylation and primer extension (SHAPE). , 2005, Journal of the American Chemical Society.

[58]  R. Gutell,et al.  The accuracy of ribosomal RNA comparative structure models. , 2002, Current opinion in structural biology.

[59]  Kevin P. Murphy,et al.  Efficient parameter estimation for RNA secondary structure prediction , 2007, ISMB/ECCB.

[60]  Kiyoshi Asai,et al.  Prediction of RNA secondary structure using generalized centroid estimators , 2009, Bioinform..

[61]  Harald Schwalbe,et al.  Interplay of ‘induced fit’ and preorganization in the ligand induced folding of the aptamer domain of the guanine binding riboswitch , 2006, Nucleic acids research.

[62]  Anna Marie Pyle,et al.  Prediction of functional tertiary interactions and intermolecular interfaces from primary sequence data. , 2005, Journal of experimental zoology. Part B, Molecular and developmental evolution.

[63]  C. Vonrhein,et al.  Structure of the 30S ribosomal subunit , 2000, Nature.

[64]  H. Stanley,et al.  Discrete molecular dynamics studies of the folding of a protein-like model. , 1998, Folding & design.

[65]  Kiyoshi Asai,et al.  CentroidFold: a web server for RNA secondary structure prediction , 2009, Nucleic Acids Res..

[66]  Serafim Batzoglou,et al.  CONTRAfold: RNA secondary structure prediction without physics-based models , 2006, ISMB.

[67]  Anne Condon,et al.  RNA STRAND: The RNA Secondary Structure and Statistical Analysis Database , 2008, BMC Bioinformatics.

[68]  Robert Giegerich,et al.  Abstract shapes of RNA. , 2004, Nucleic acids research.

[69]  F. Ding,et al.  Ab initio RNA folding by discrete molecular dynamics: from structure prediction to folding mechanisms. , 2008, RNA.

[70]  David H Mathews,et al.  RNA Secondary Structure Analysis Using RNAstructure , 2006, Current protocols in bioinformatics.

[71]  David H Mathews,et al.  Revolutions in RNA secondary structure prediction. , 2006, Journal of molecular biology.

[72]  T. Schlick,et al.  Exploring the repertoire of RNA secondary motifs using graph theory; implications for RNA design. , 2003, Nucleic acids research.

[73]  Brent M. Znosko,et al.  Thermodynamic parameters for an expanded nearest-neighbor model for the formation of RNA duplexes with single nucleotide bulges. , 2002, Biochemistry.

[74]  Hélène Touzet,et al.  CARNAC: folding families of related RNAs , 2004, Nucleic Acids Res..

[75]  Jin Chu Wu,et al.  The massively parallel genetic algorithm for RNA folding: MIMD implementation and population variation , 2001, Bioinform..

[76]  William Stafford Noble,et al.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.

[77]  M. Ares,et al.  Use of dimethyl sulfate to probe RNA structure in vivo. , 2000, Methods in enzymology.

[78]  Tao Pan,et al.  RNA folding during transcription. , 2006, Annual review of biophysics and biomolecular structure.

[79]  Phillip A Sharp,et al.  The Centrality of RNA , 2009, Cell.

[80]  Emidio Capriotti,et al.  Computational RNA Structure Prediction , 2008 .

[81]  David E Draper,et al.  A guide to ions and RNA structure. , 2004, RNA.

[82]  Alan Mitchell Durham,et al.  Computational methods in noncoding RNA research , 2008, Journal of mathematical biology.

[83]  T. Schlick,et al.  Tertiary motifs revealed in analyses of higher-order RNA junctions. , 2009, Journal of molecular biology.

[84]  David H Mathews,et al.  Interpreting oligonucleotide microarray data to determine RNA secondary structure: application to the 3' end of Bombyx mori R2 RNA. , 2006, Biochemistry.

[85]  Wojciech Kasprzak,et al.  Solution structure of the cap-independent translational enhancer and ribosome-binding element in the 3′ UTR of turnip crinkle virus , 2010, Proceedings of the National Academy of Sciences.

[86]  David Baker,et al.  Protein Structure Prediction Using Rosetta , 2004, Numerical Computer Methods, Part D.

[87]  E Westhof,et al.  Derivation of the three-dimensional architecture of bacterial ribonuclease P RNAs from comparative sequence analysis. , 1998, Journal of molecular biology.

[88]  E. Westhof,et al.  Topology of three-way junctions in folded RNAs. , 2006, RNA.

[89]  Wojciech Kasprzak,et al.  Bridging the gap in RNA structure prediction. , 2007, Current opinion in structural biology.

[90]  Eric Westhof,et al.  The non-Watson-Crick base pairs and their associated isostericity matrices. , 2002, Nucleic acids research.

[91]  Sylvie Hamel,et al.  Modeling RNA tertiary structure motifs by graph-grammars , 2007, Nucleic acids research.

[92]  Gary Ruvkun,et al.  Glimpses of a Tiny RNA World , 2001, Science.

[93]  D. Draper,et al.  RNA folding: thermodynamic and molecular descriptions of the roles of ions. , 2008, Biophysical journal.

[94]  D. Turner,et al.  Dynalign: an algorithm for finding the secondary structure common to two RNA sequences. , 2002, Journal of molecular biology.

[95]  N. Seeman,et al.  The general structure of transfer RNA molecules. , 1974, Proceedings of the National Academy of Sciences of the United States of America.

[96]  E. Wagner,et al.  The role of RNAs in the regulation of virulence-gene expression. , 2006, Current opinion in microbiology.

[97]  Charles D Schwieters,et al.  A method for helical RNA global structure determination in solution using small-angle x-ray scattering and NMR measurements. , 2009, Journal of molecular biology.

[98]  Y. Yatabe,et al.  A polycistronic microRNA cluster, miR-17-92, is overexpressed in human lung cancers and enhances cell proliferation. , 2005, Cancer research.

[99]  Gaurav Sharma,et al.  Efficient pairwise RNA structure prediction using probabilistic alignment constraints in Dynalign , 2007, BMC Bioinformatics.

[100]  D. Lilley,et al.  Global structure of four-way RNA junctions studied using fluorescence resonance energy transfer. , 1998, RNA.

[101]  David H Mathews,et al.  Prediction of RNA secondary structure by free energy minimization. , 2006, Current opinion in structural biology.

[102]  Eric Westhof,et al.  New metrics for comparing and assessing discrepancies between RNA 3D structures and models. , 2009, RNA.

[103]  Michael T. Wolfinger,et al.  Folding kinetics of large RNAs. , 2008, Journal of molecular biology.

[104]  Rodrigo Lopez,et al.  Clustal W and Clustal X version 2.0 , 2007, Bioinform..

[105]  C. Lawrence,et al.  A statistical sampling algorithm for RNA secondary structure prediction. , 2003, Nucleic acids research.

[106]  Bruce A Shapiro,et al.  The role of a metastable RNA secondary structure in hepatitis delta virus genotype III RNA editing. , 2006, RNA.

[107]  J. C. Wu,et al.  RNA folding pathway functional intermediates: their prediction and analysis. , 2001, Journal of molecular biology.

[108]  B. Berkhout,et al.  RNA interference against viruses: strike and counterstrike , 2007, Nature Biotechnology.

[109]  E Westhof,et al.  New loop-loop tertiary interactions in self-splicing introns of subgroup IC and ID: a complete 3D model of the Tetrahymena thermophila ribozyme. , 1996, Chemistry & biology.

[110]  T. Schlick,et al.  Analysis of four-way junctions in RNA structures. , 2009, Journal of molecular biology.

[111]  Sean R. Eddy,et al.  Rfam: an RNA family database , 2003, Nucleic Acids Res..

[112]  Robert Giegerich,et al.  A comprehensive comparison of comparative RNA structure prediction approaches , 2004, BMC Bioinformatics.

[113]  N. Leontis,et al.  TokenRNA: A New Type of Sequence‐Specific, Label‐Free Fluorescent Biosensor for Folded RNA Molecules , 2008, Chembiochem : a European journal of chemical biology.

[114]  R. Montange,et al.  Riboswitches: emerging themes in RNA structure and function. , 2008, Annual review of biophysics.

[115]  Anke Mulder,et al.  Cryo-EM Visualization of a Viral Internal Ribosome Entry Site Bound to Human Ribosomes The IRES Functions as an RNA-Based Translation Factor , 2004, Cell.

[116]  Changbong Hyeon,et al.  Extracting stacking interaction parameters for RNA from the data set of native structures. , 2005, Journal of molecular biology.

[117]  F. Schluenzen,et al.  Structure of Functionally Activated Small Ribosomal Subunit at 3.3 Å Resolution , 2000, Cell.

[118]  F. Major,et al.  The MC-Fold and MC-Sym pipeline infers RNA structure from sequence data , 2008, Nature.

[119]  S. Lowe,et al.  A microRNA polycistron as a potential human oncogene , 2005, Nature.

[120]  Paulo P. Amaral,et al.  The Eukaryotic Genome as an RNA Machine , 2008, Science.

[121]  S. Gottesman,et al.  A small RNA regulates the expression of genes involved in iron metabolism in Escherichia coli , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[122]  A. Pardi,et al.  NMR Methods for Studying the Structure and Dynamics of RNA , 2005, Chembiochem : a European journal of chemical biology.

[123]  David Mathews,et al.  Predicting the Secondary Structure Common to Two RNA Sequences with Dynalign , 2004, Current protocols in bioinformatics.

[124]  A. Frankel,et al.  ISFOLD: Structure Prediction of Base Pairs in Non-Helical RNA Motifs from Isostericity Signatures in Their Sequence Alignments , 2008, Journal of biomolecular structure & dynamics.

[125]  Shi-Jie Chen,et al.  Predicting structures and stabilities for H-type pseudoknots with interhelix loops. , 2009, RNA.

[126]  E. L. Holbrook,et al.  Crystallization of RNA , 2001, Cellular and Molecular Life Sciences CMLS.

[127]  Yaroslava G. Yingling,et al.  Computational design of an RNA hexagonal nanoring and an RNA nanotube. , 2007, Nano letters.

[128]  D. Thirumalai,et al.  Folding of RNA involves parallel pathways. , 1997, Journal of molecular biology.

[129]  M. Waterman,et al.  RNA secondary structure: a complete mathematical analysis , 1978 .

[130]  E. Westhof,et al.  RNA structure: bioinformatic analysis. , 2007, Current opinion in microbiology.

[131]  Craig L. Zirbel,et al.  FR3D: finding local and composite recurrent structural motifs in RNA 3D structures , 2007, Journal of mathematical biology.

[132]  John E. Stone,et al.  Using VMD: An Introductory Tutorial , 2008, Current protocols in bioinformatics.

[133]  Feng Ding,et al.  Correction: Emergence of Protein Fold Families through Rational Design , 2006, PLoS Comput. Biol..

[134]  R. Giegerich,et al.  Complete probabilistic analysis of RNA shapes , 2006, BMC Biology.

[135]  J. Maizel,et al.  RNA2D3D: A program for Generating, Viewing, and Comparing 3-Dimensional Models of RNA , 2008, Journal of biomolecular structure & dynamics.

[136]  H. H. Gan,et al.  RAG: RNA-As-Graphs database-concepts, analysis, features , 2004, Bioinform..

[137]  D. Baker,et al.  Automated de novo prediction of native-like RNA tertiary structures , 2007, Proceedings of the National Academy of Sciences.

[138]  M Chance,et al.  Following the folding of RNA with time-resolved synchrotron X-ray footprinting. , 1998, Methods in enzymology.

[139]  P. Stadler,et al.  Secondary structure prediction for aligned RNA sequences. , 2002, Journal of molecular biology.

[140]  D. Turner,et al.  Thermodynamic parameters for an expanded nearest-neighbor model for formation of RNA duplexes with Watson-Crick base pairs. , 1998, Biochemistry.

[141]  B. Berkhout,et al.  RNA Interference: Its Use as Antiviral Therapy , 2006, Handbook of experimental pharmacology.

[142]  J. Sabina,et al.  Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. , 1999, Journal of molecular biology.

[143]  R. Breaker Engineered allosteric ribozymes as biosensor components. , 2002, Current opinion in biotechnology.

[144]  Peixuan Guo,et al.  Evaluation of specific delivery of chimeric phi29 pRNA/siRNA nanoparticles to multiple tumor cells. , 2009, Molecular bioSystems.

[145]  G. Lepage A new algorithm for adaptive multidimensional integration , 1978 .