Clustering to identify RNA conformations constrained by secondary structure

RNA often folds hierarchically, so that its sequence defines its secondary structure (helical base-paired regions connected by single-stranded junctions), which subsequently defines its tertiary fold. To preserve base-pairing and chain connectivity, the three-dimensional conformations that RNA can explore are strongly confined compared to when secondary structure constraints are not enforced. Using three examples, we studied how secondary structure confines and dictates an RNA’s preferred conformations. We made use of Macromolecular Conformations by SYMbolic programming (MC-Sym) fragment assembly to generate RNA conformations constrained by secondary structure. Then, to understand the correlations between different helix placements and orientations, we robustly clustered all RNA conformations by employing unique methods to remove outliers and estimate the best number of conformational clusters. We observed that the preferred conformation (as judged by largest cluster size) for each type of RNA junction molecule tested is consistent with its biological function. Further, the improved quality of models in our pruned datasets facilitates subsequent discrimination using scoring functions based either on statistical analysis (knowledge based) or experimental data.

[1]  Ana L. N. Fred,et al.  Data clustering using evidence accumulation , 2002, Object recognition supported by user interaction for service robots.

[2]  I. Tinoco,et al.  How RNA folds. , 1999, Journal of molecular biology.

[3]  Christian Laing,et al.  Computational approaches to 3D modeling of RNA , 2010, Journal of physics. Condensed matter : an Institute of Physics journal.

[4]  N. Walter,et al.  Stability of hairpin ribozyme tertiary structure is governed by the interdomain junction , 1999, Nature Structural Biology.

[5]  D. Herschlag,et al.  The ligand-free state of the TPP riboswitch: a partially folded RNA structure. , 2010, Journal of molecular biology.

[6]  Xiang-Jun Lu,et al.  3DNA: a versatile, integrated software system for the analysis, rebuilding and visualization of three-dimensional nucleic-acid structures , 2008, Nature Protocols.

[7]  Taekjip Ha,et al.  A four-way junction accelerates hairpin ribozyme folding via a discrete intermediate , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[8]  A. Serganov,et al.  Structural basis for discriminative regulation of gene expression by adenine- and guanine-sensing mRNAs. , 2004, Chemistry & biology.

[9]  Junmei Wang,et al.  How well does a restrained electrostatic potential (RESP) model perform in calculating conformational energies of organic and biological molecules? , 2000, J. Comput. Chem..

[10]  D. Herschlag,et al.  Unwinding RNA's secrets: advances in the biology, physics, and modeling of complex RNAs. , 2008, Current opinion in structural biology.

[11]  R. Batey,et al.  Ligand-dependent folding of the three-way junction in the purine riboswitch. , 2008, RNA.

[12]  Russ B Altman,et al.  Turning limited experimental information into 3D models of RNA. , 2010, RNA.

[13]  E. Westhof,et al.  Topology of three-way junctions in folded RNAs. , 2006, RNA.

[14]  D. Lilley,et al.  The folding of the hairpin ribozyme: dependence on the loops and the junction. , 2000, RNA.

[15]  Batey,et al.  Tertiary Motifs in RNA Structure and Folding. , 1999, Angewandte Chemie.

[16]  Feng Ding,et al.  On the significance of an RNA tertiary structure prediction. , 2010, RNA.

[17]  D. Baker,et al.  Atomic accuracy in predicting and designing non-canonical RNA structure , 2010, Nature Methods.

[18]  R. Micura,et al.  Ligand‐Induced Folding of the Adenosine Deaminase A‐Riboswitch and Implications on Riboswitch Translational Control , 2007, Chembiochem : a European journal of chemical biology.

[19]  D. Baker,et al.  Clustering of low-energy conformations near the native structures of small proteins. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[20]  V. Pande,et al.  Do conformational biases of simple helical junctions influence RNA folding stability and specificity? , 2009, RNA.

[21]  B. Ganem RNA world , 1987, Nature.

[22]  Eric Westhof,et al.  New metrics for comparing and assessing discrepancies between RNA 3D structures and models. , 2009, RNA.

[23]  E. Westhof,et al.  Hierarchy and dynamics of RNA folding. , 1997, Annual review of biophysics and biomolecular structure.

[24]  D. Lilley,et al.  Folding of the hairpin ribozyme in its natural conformation achieves close physical proximity of the loops. , 1998, Molecular cell.

[25]  T. Schlick,et al.  Tertiary motifs revealed in analyses of higher-order RNA junctions. , 2009, Journal of molecular biology.

[26]  Vijay S Pande,et al.  Progress and challenges in the automated construction of Markov state models for full protein systems. , 2009, The Journal of chemical physics.

[27]  W. Olson,et al.  3DNA: a software package for the analysis, rebuilding and visualization of three-dimensional nucleic acid structures. , 2003, Nucleic acids research.

[28]  Ana L. N. Fred,et al.  Evidence Accumulation Clustering Based on the K-Means Algorithm , 2002, SSPR/SPR.

[29]  Michael Levitt,et al.  Growth of novel protein structural data , 2007, Proceedings of the National Academy of Sciences.

[30]  Magdalena A. Jonikas,et al.  Coarse-grained modeling of large RNA molecules with knowledge-based potentials and structural filters. , 2009, RNA.

[31]  Tao Pan,et al.  RNA folding during transcription. , 2006, Annual review of biophysics and biomolecular structure.

[32]  A. Serganov,et al.  Structural basis for gene regulation by a thiamine pyrophosphate-sensing riboswitch , 2006, Nature.

[33]  Ana L. N. Fred,et al.  Cluster Ensemble Methods: from Single Clusterings to Combined Solutions , 2008 .

[34]  Eckart Bindewald,et al.  RNAJunction: a database of RNA junctions and kissing loops for three-dimensional structural analysis and nanodesign , 2007, Nucleic Acids Res..

[35]  F. Ding,et al.  Ab initio RNA folding by discrete molecular dynamics: from structure prediction to folding mechanisms. , 2008, RNA.

[36]  Harald Schwalbe,et al.  Metal-ion binding and metal-ion induced folding of the adenine-sensing riboswitch aptamer domain , 2007, Nucleic acids research.

[37]  T. Schlick,et al.  Analysis of four-way junctions in RNA structures. , 2009, Journal of molecular biology.

[38]  D. Lilley,et al.  Folding of the adenine riboswitch. , 2006, Chemistry & biology.

[39]  D. Baker,et al.  Automated de novo prediction of native-like RNA tertiary structures , 2007, Proceedings of the National Academy of Sciences.

[40]  R. Batey,et al.  Thermodynamic and kinetic characterization of ligand binding to the purine riboswitch aptamer domain. , 2006, Journal of molecular biology.

[41]  D. Case,et al.  Theory and applications of the generalized born solvation model in macromolecular simulations , 2000, Biopolymers.

[42]  R. Montange,et al.  Riboswitches: emerging themes in RNA structure and function. , 2008, Annual review of biophysics.

[43]  H. Al‐Hashimi,et al.  Topology Links RNA Secondary Structure with Global Conformation, Dynamics, and Adaptation , 2010, Science.

[44]  I. Kuntz,et al.  Tertiary Structure Prediction , 1989 .

[45]  David A. Case,et al.  Modeling Unusual Nucleic Acid Structures , 1998 .

[46]  F. Major,et al.  The MC-Fold and MC-Sym pipeline infers RNA structure from sequence data , 2008, Nature.

[47]  N. Ban,et al.  Structure of the Eukaryotic Thiamine Pyrophosphate Riboswitch with Its Regulatory Ligand , 2006, Science.