The role of nucleobase interactions in RNA structure and dynamics

The intricate network of interactions observed in RNA three-dimensional structures is often described in terms of a multitude of geometrical properties, including helical parameters, base pairing/stacking, hydrogen bonding and backbone conformation. We show that a simple molecular representation consisting in one oriented bead per nucleotide can account for the fundamental structural properties of RNA. In this framework, canonical Watson-Crick, non-Watson-Crick base-pairing and base-stacking interactions can be unambiguously identified within a well-defined interaction shell. We validate this representation by performing two independent, complementary tests. First, we use it to construct a sequence-independent, knowledge-based scoring function for RNA structural prediction, which compares favorably to fully atomistic, state-of-the-art techniques. Second, we define a metric to measure deviation between RNA structures that directly reports on the differences in the base–base interaction network. The effectiveness of this metric is tested with respect to the ability to discriminate between structurally and kinetically distant RNA conformations, performing better compared to standard techniques. Taken together, our results suggest that this minimalist, nucleobase-centric representation captures the main interactions that are relevant for describing RNA structure and dynamics.

[1]  T. Darden,et al.  Particle mesh Ewald: An N⋅log(N) method for Ewald sums in large systems , 1993 .

[2]  A. Serganov,et al.  A Decade of Riboswitches , 2013, Cell.

[3]  Feng Ding,et al.  RNA-Puzzles: a CASP-like evaluation of RNA three-dimensional structure prediction. , 2012, RNA.

[4]  Alberto Apostolico,et al.  Finding 3D motifs in ribosomal RNA structures , 2009, Nucleic acids research.

[5]  T. Cech,et al.  Self-splicing RNA: Autoexcision and autocyclization of the ribosomal RNA intervening sequence of tetrahymena , 1982, Cell.

[6]  Haixu Tang,et al.  RNAMotifScan: automatic identification of RNA structural motifs using secondary structural alignment , 2010, Nucleic acids research.

[7]  W. B. Arendall,et al.  RNA backbone is rotameric , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[8]  E. Westhof,et al.  Geometric nomenclature and classification of RNA base pairs. , 2001, RNA.

[9]  M. Parrinello,et al.  Canonical sampling through velocity rescaling. , 2007, The Journal of chemical physics.

[10]  A. Serganov,et al.  The crystal structure of UUCG tetraloop. , 2000, Journal of molecular biology.

[11]  W. L. Jorgensen,et al.  Comparison of simple potential functions for simulating liquid water , 1983 .

[12]  Shi-Jie Chen,et al.  Physics-based de novo prediction of RNA 3D structures. , 2011, The journal of physical chemistry. B.

[13]  H. Berendsen,et al.  Molecular dynamics with coupling to an external bath , 1984 .

[14]  Thomas P. Minka The ‘summation hack’ as an outlier model , 2003 .

[15]  Massimiliano Bonomi,et al.  PLUMED 2: New feathers for an old bird , 2013, Comput. Phys. Commun..

[16]  Stephen Neidle,et al.  Principles of nucleic acid structure , 2007 .

[17]  Anton I. Petrov,et al.  Automated classification of RNA 3D motifs and the RNA 3D Motif Atlas , 2013, RNA.

[18]  Giovanni Bussi,et al.  Ligand-induced stabilization of the aptamer terminal helix in the add adenine riboswitch , 2013, RNA.

[19]  François Major,et al.  Automated extraction and classification of RNA tertiary structure cyclic motifs , 2006, Nucleic acids research.

[20]  Kevin E. Riley,et al.  Nature and magnitude of aromatic stacking of nucleic acid bases. , 2008, Physical chemistry chemical physics : PCCP.

[21]  V. Hornak,et al.  Comparison of multiple Amber force fields and development of improved protein backbone parameters , 2006, Proteins.

[22]  G. N. Ramachandran,et al.  Stereochemistry of polypeptide chain configurations. , 1963, Journal of molecular biology.

[23]  P. Gendron,et al.  Quantitative analysis of nucleic acid three-dimensional structures. , 2001, Journal of molecular biology.

[24]  F. Major,et al.  The MC-Fold and MC-Sym pipeline infers RNA structure from sequence data , 2008, Nature.

[25]  Amedeo Caflisch,et al.  Distribution of Reciprocal of Interatomic Distances: a Fast Structural Metric , 2022 .

[26]  Jerzy Leszczynski,et al.  Electronic properties, hydrogen bonding, stacking, and cation binding of DNA and RNA bases , 2001, Biopolymers.

[27]  Kanti V. Mardia,et al.  A Probabilistic Model of RNA Conformational Space , 2009, PLoS Comput. Biol..

[28]  P. Moore,et al.  Structural motifs in RNA. , 1999, Annual review of biochemistry.

[29]  F. Ding,et al.  Ab initio RNA folding by discrete molecular dynamics: from structure prediction to folding mechanisms. , 2008, RNA.

[30]  D Thirumalai,et al.  Coarse-grained model for predicting RNA folding thermodynamics. , 2013, The journal of physical chemistry. B.

[31]  Alessandro Laio,et al.  A simple and efficient statistical potential for scoring ensembles of protein , 2012 .

[32]  W. Kabsch A solution for the best rotation to relate two sets of vectors , 1976 .

[33]  Michal Otyepka,et al.  Performance of Molecular Mechanics Force Fields for RNA Simulations: Stability of UUCG and GNRA Hairpins. , 2010, Journal of chemical theory and computation.

[34]  Eric Westhof,et al.  New metrics for comparing and assessing discrepancies between RNA 3D structures and models. , 2009, RNA.

[35]  C Kooperberg,et al.  Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. , 1997, Journal of molecular biology.

[36]  H. Scheraga,et al.  Medium- and long-range interaction parameters between amino acids for predicting three-dimensional structures of proteins. , 1976, Macromolecules.

[37]  A. Serganov,et al.  Structural basis for discriminative regulation of gene expression by adenine- and guanine-sensing mRNAs. , 2004, Chemistry & biology.

[38]  Marc A. Martí-Renom,et al.  All-atom knowledge-based potential for RNA structure prediction and assessment , 2011, Bioinform..

[39]  T. Steitz,et al.  The roles of ribosomal proteins in the structure assembly, and evolution of the large ribosomal subunit. , 2004, Journal of molecular biology.

[40]  F. Fogolari,et al.  Modeling of polypeptide chains as C alpha chains, C alpha chains with C beta, and C alpha chains with ellipsoidal lateral chains. , 1996, Biophysical journal.

[41]  Alessandro Laio,et al.  Which similarity measure is better for analyzing protein structures in a molecular dynamics trajectory? , 2011, Physical chemistry chemical physics : PCCP.

[42]  Biing-Hwang Juang,et al.  Minimum classification error rate methods for speech recognition , 1997, IEEE Trans. Speech Audio Process..

[43]  Berk Hess,et al.  LINCS: A linear constraint solver for molecular simulations , 1997 .

[44]  John D. Westbrook,et al.  Tools for the automatic identification and classification of RNA base pairs , 2003, Nucleic Acids Res..

[45]  A. Pyle,et al.  Stepping through an RNA structure: A novel approach to conformational analysis. , 1998, Journal of molecular biology.

[46]  Angel E García,et al.  High-resolution reversible folding of hyperstable RNA tetraloops using molecular dynamics simulations , 2013, Proceedings of the National Academy of Sciences.

[47]  I. Tinoco,et al.  How RNA folds. , 1999, Journal of molecular biology.

[48]  M. Parrinello,et al.  The unfolded ensemble and folding mechanism of the C-terminal GB1 beta-hairpin. , 2008, Journal of the American Chemical Society.

[49]  Vijay S Pande,et al.  Simple few-state models reveal hidden complexity in protein folding , 2012, Proceedings of the National Academy of Sciences.

[50]  D. Baker,et al.  Atomic accuracy in predicting and designing non-canonical RNA structure , 2010, Nature Methods.

[51]  G. Hong,et al.  Nucleic Acids Research , 2015, Nucleic Acids Research.

[52]  J. Šponer,et al.  Refinement of the AMBER Force Field for Nucleic Acids: Improving the Description of α/γ Conformers , 2007 .

[53]  Craig L. Zirbel,et al.  FR3D: finding local and composite recurrent structural motifs in RNA 3D structures , 2007, Journal of mathematical biology.

[54]  Adelene Y. L. Sim,et al.  Fully differentiable coarse-grained and all-atom knowledge-based potentials for RNA structure evaluation. , 2011, RNA.

[55]  M. Sundaralingam,et al.  Stereochemistry of nucleic acids and their constituents. X. solid‐slate base‐slacking patterns in nucleic acid constituents and polynucleotides , 1971, Biopolymers.

[56]  Michal Otyepka,et al.  Computer Folding of RNA Tetraloops? Are We There Yet? , 2013, Journal of chemical theory and computation.

[57]  Alessandro Laio,et al.  A simple and efficient statistical potential for scoring ensembles of protein structures , 2012, Scientific Reports.

[58]  G. Sapiro,et al.  Statistical analysis of RNA backbone , 2006, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[59]  Peter M. Kasson,et al.  GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit , 2013, Bioinform..

[60]  I. Tinoco Nucleic Acid Structures, Energetics, and Dynamics , 1996 .

[61]  D. Baker,et al.  Automated de novo prediction of native-like RNA tertiary structures , 2007, Proceedings of the National Academy of Sciences.

[62]  Yann Ponty,et al.  VARNA: Interactive drawing and editing of the RNA secondary structure , 2009, Bioinform..