Predicting coaxial helical stacking in RNA junctions

RNA junctions are important structural elements that form when three or more helices come together in space in the tertiary structures of RNA molecules. Determining their structural configuration is important for predicting RNA 3D structure. We introduce a computational method to predict, at the secondary structure level, the coaxial helical stacking arrangement in junctions, as well as classify the junction topology. Our approach uses a data mining approach known as random forests, which relies on a set of decision trees trained using length, sequence and other variables specified for any given junction. The resulting protocol predicts coaxial stacking within three- and four-way junctions with an accuracy of 81% and 77%, respectively; the accuracy increases to 83% and 87%, respectively, when knowledge from the junction family type is included. Coaxial stacking predictions for the five to ten-way junctions are less accurate (60%) due to sparse data available for training. Additionally, our application predicts the junction family with an accuracy of 85% for three-way junctions and 74% for four-way junctions. Comparisons with other methods, as well applications to unsolved RNAs, are also presented. The web server Junction-Explorer to predict junction topologies is freely available at: http://bioinformatics.njit.edu/junction.

[1]  S. Hanlon The importance of London dispersion forces in the maintenance of the deoxyribonucleic acid helix. , 1966, Biochemical and biophysical research communications.

[2]  M. Levitt Detailed Molecular Model for Transfer Ribonucleic Acid , 1969, Nature.

[3]  N. Seeman,et al.  The general structure of transfer RNA molecules. , 1974, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Stacking properties of a highly hydrophobic dinucleotide sequence, N6, N6-dimethyladenylyl(3' leads to 5')N6, N6-dimethyladenosine, occurring in 16--18-S ribosomal RNA. , 1980, European journal of biochemistry.

[5]  Y. Inoue,et al.  Stacking Properties of a Highly Hydrophobic Dinucleotide Sequence, N6,N6‐Dimeethyladenyly(3′→5′) N6,N6‐dimethyladenosine, Occurring in 16–18‐S Ribosomal RNA , 1980 .

[6]  T. Cullen,et al.  Global existence of solutions for the relativistic Boltzmann equation on the flat Robertson-Walker space-time for arbitrarily large intial data , 2005, gr-qc/0507035.

[7]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[8]  R. Garrett,et al.  Evolutionary relationships amongst archaebacteria. A comparative study of 23 S ribosomal RNAs of a sulphur-dependent extreme thermophile, an extreme halophile and a thermophilic methanogen. , 1987, Journal of molecular biology.

[9]  R. Nussinov,et al.  Origin of DNA helical structure and its sequence dependence. , 1988, Biochemistry.

[10]  A. E. Walter,et al.  Sequence dependence of stability for coaxial stacking of RNA helixes with Watson-Crick base paired interfaces. , 1994, Biochemistry.

[11]  A. E. Walter,et al.  Coaxial stacking of helixes enhances binding of oligoribonucleotides and improves predictions of RNA folding. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[12]  R. Collins,et al.  A secondary-structure model for the self-cleaving region of Neurospora VS RNA. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[13]  A. Pardi,et al.  GNRA tetraloops make a U-turn. , 1995, RNA.

[14]  N. Seeman,et al.  A nomenclature of junctions and branchpoints in nucleic acids. , 1995, Nucleic acids research.

[15]  B. Stoddard,et al.  Capturing the Structure of a Catalytic RNA Intermediate: The Hammerhead Ribozyme , 1996, Science.

[16]  A. E. Walter,et al.  Thermodynamics of coaxially stacked helixes with GA and CC mismatches. , 1996, Biochemistry.

[17]  J. Šponer,et al.  Nature of Nucleic Acid−Base Stacking: Nonempirical ab Initio and Empirical Potential Characterization of 10 Stacked Base Dimers. Comparison of Stacked and H-Bonded Base Pairs , 1996 .

[18]  D. Lilley,et al.  Global structure of four-way RNA junctions studied using fluorescence resonance energy transfer. , 1998, RNA.

[19]  D. Lilley,et al.  Folding of branched RNA species , 1998, Biopolymers.

[20]  P. Hagerman,et al.  Protein and Mg(2+)-induced conformational changes in the S15 binding site of 16 S ribosomal RNA. , 1998, Journal of molecular biology.

[21]  E Westhof,et al.  Group I-like ribozymes with a novel core organization perform obligate sequential hydrolytic cleavages at two processing sites. , 1998, RNA.

[22]  M. Honda,et al.  A Phylogenetically Conserved Stem-Loop Structure at the 5′ Border of the Internal Ribosome Entry Site of Hepatitis C Virus Is Required for Cap-Independent Viral Translation , 1999, Journal of Virology.

[23]  J. Sabina,et al.  Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. , 1999, Journal of molecular biology.

[24]  F. Schluenzen,et al.  Structure of Functionally Activated Small Ribosomal Subunit , 2000 .

[25]  T. Steitz,et al.  The complete atomic structure of the large ribosomal subunit at 2.4 A resolution. , 2000, Science.

[26]  F. Schluenzen,et al.  Structure of Functionally Activated Small Ribosomal Subunit at 3.3 Å Resolution , 2000, Cell.

[27]  C. Vonrhein,et al.  Structure of the 30S ribosomal subunit , 2000, Nature.

[28]  A Yonath,et al.  Structure of functionally activated small ribosomal subunit at 3.3 angstroms resolution. , 2000, Cell.

[29]  D. Lilley,et al.  Structures of helical junctions in nucleic acids , 2000, Quarterly Reviews of Biophysics.

[30]  D. Lilley,et al.  Structure, folding and activity of the VS ribozyme: importance of the 2‐3‐6 helical junction , 2001, The EMBO journal.

[31]  Thomas A. Steitz,et al.  RNA tertiary interactions in the large ribosomal subunit: The A-minor motif , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[32]  S C Harvey,et al.  AA.AG@helix.ends: A:A and A:G base-pairs at the ends of 16 S and 23 S rRNA helices. , 2001, Journal of molecular biology.

[33]  K. Zhou,et al.  Crystal structure of an RNA tertiary domain essential to HCV IRES-mediated translation initiation , 2002, Nature Structural Biology.

[34]  P. Stadler,et al.  Secondary structure prediction for aligned RNA sequences. , 2002, Journal of molecular biology.

[35]  H. Urlaub,et al.  Hierarchical, clustered protein interactions with U4/U6 snRNA: a biochemical role for U4/U6 proteins , 2002, The EMBO journal.

[36]  H. Nielsen,et al.  DiGIR1 and NaGIR1: naturally occurring group I-like ribozymes with unique core organization and evolved biological role. , 2002, Biochimie.

[37]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[38]  T. Steitz,et al.  The roles of ribosomal proteins in the structure assembly, and evolution of the large ribosomal subunit. , 2004, Journal of molecular biology.

[39]  Taekjip Ha,et al.  Conformational flexibility of four-way junctions in RNA. , 2004, Journal of molecular biology.

[40]  R. Montange,et al.  Structure of a natural guanine-responsive riboswitch complexed with the metabolite hypoxanthine , 2004, Nature.

[41]  Anke Mulder,et al.  Cryo-EM Visualization of a Viral Internal Ribosome Entry Site Bound to Human Ribosomes The IRES Functions as an RNA-Based Translation Factor , 2004, Cell.

[42]  D. Lilley The Varkud satellite ribozyme. , 2004, RNA.

[43]  D. Turner,et al.  Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structure. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[44]  N. Pace,et al.  Crystal structure of a bacterial ribonuclease P RNA. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[45]  J. Steyaert,et al.  Influence of the π–π interaction on the hydrogen bonding capacity of stacked DNA/RNA bases , 2005, Nucleic acids research.

[46]  D. P. Aalberts,et al.  Asymmetry in RNA pseudoknots: observation and theory , 2005, Nucleic acids research.

[47]  David H Mathews,et al.  RNA Secondary Structure Analysis Using RNAstructure , 2006, Current protocols in bioinformatics.

[48]  E. Westhof,et al.  Topology of three-way junctions in folded RNAs. , 2006, RNA.

[49]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[50]  Stephen Neidle,et al.  Principles of nucleic acid structure , 2007 .

[51]  David H Mathews,et al.  Predicting helical coaxial stacking in RNA multibranch loops. , 2007, RNA.

[52]  S. Holbrook Structural principles from large RNAs. , 2008, Annual review of biophysics.

[53]  D. Lilley,et al.  The complete VS ribozyme in solution studied by small-angle X-ray scattering. , 2008, Structure.

[54]  Eckart Bindewald,et al.  RNAJunction: a database of RNA junctions and kissing loops for three-dimensional structural analysis and nanodesign , 2007, Nucleic Acids Res..

[55]  Anne Condon,et al.  RNA STRAND: The RNA Secondary Structure and Statistical Analysis Database , 2008, BMC Bioinformatics.

[56]  J. Maizel,et al.  RNA2D3D: A program for Generating, Viewing, and Comparing 3-Dimensional Models of RNA , 2008, Journal of biomolecular structure & dynamics.

[57]  T. Schlick,et al.  Annotation of tertiary interactions in RNA structures reveals variations and correlations. , 2008, RNA.

[58]  Anna Marie Pyle,et al.  Crystal Structure of a Self-Spliced Group II Intron , 2008, Science.

[59]  D. Bartel MicroRNAs: Target Recognition and Regulatory Functions , 2009, Cell.

[60]  T. Schlick,et al.  Analysis of four-way junctions in RNA structures. , 2009, Journal of molecular biology.

[61]  Eric Westhof,et al.  The Dynamic Landscapes of RNA Architecture , 2009, Cell.

[62]  Stefanie A. Mortimer,et al.  Time-resolved RNA SHAPE chemistry: quantitative RNA structure analysis in one-second snapshots and at single-nucleotide resolution , 2009, Nature Protocols.

[63]  M. de la Peña,et al.  Three-way RNA junctions with remote tertiary contacts: a recurrent and highly versatile fold. , 2009, RNA.

[64]  T. Schlick,et al.  Tertiary motifs revealed in analyses of higher-order RNA junctions. , 2009, Journal of molecular biology.

[65]  R. Breaker,et al.  Comparative genomics reveals 104 candidate structured RNAs from bacteria, archaea, and their metagenomes , 2010, Genome Biology.

[66]  Magdalena A. Jonikas,et al.  Coarse-grained modeling of large RNA molecules with knowledge-based potentials and structural filters. , 2009, RNA.

[67]  Zasha Weinberg,et al.  Identification of candidate structured RNAs in the marine organism 'Candidatus Pelagibacter ubique' , 2009, BMC Genomics.

[68]  Kathryn D. Smith,et al.  Structural and biochemical determinants of ligand binding by the c-di-GMP riboswitch . , 2010, Biochemistry.

[69]  J. Vogel,et al.  Identification of regulatory RNAs in Bacillus subtilis , 2010, Nucleic acids research.

[70]  Christian Laing,et al.  Computational approaches to 3D modeling of RNA , 2010, Journal of physics. Condensed matter : an Institute of Physics journal.

[71]  Eric Westhof,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2022 .

[72]  Nagarajan Nandagopal,et al.  A two-length-scale polymer theory for RNA loop free energies and helix stacking. , 2010, RNA.

[73]  D. Lilley,et al.  Structure of the three-way helical junction of the hepatitis C virus IRES element. , 2010, RNA.

[74]  K. Réblová,et al.  Molecular dynamics simulations suggest that RNA three-way junctions can act as flexible RNA structural elements in the ribosome , 2010, Nucleic acids research.