On the significance of an RNA tertiary structure prediction.

Tertiary structure prediction is important for understanding structure-function relationships for RNAs whose structures are unknown and for characterizing RNA states recalcitrant to direct analysis. However, it is unknown what root-mean-square deviation (RMSD) corresponds to a statistically significant RNA tertiary structure prediction. We use discrete molecular dynamics to generate RNA-like folds for structures up to 161 nucleotides (nt) that have complex tertiary interactions and then determine the RMSD distribution between these decoys. These distributions are Gaussian-like. The mean RMSD increases with RNA length and is smaller if secondary structure constraints are imposed while generating decoys. The compactness of RNA molecules with true tertiary folds is intermediate between closely packed spheres and a freely jointed chain. We use this scaling relationship to define an expression relating RMSD with the confidence that a structure prediction is better than that expected by chance. This is the prediction significance, and corresponds to a P-value. For a 100-nt RNA, the RMSD of predicted structures should be within 25 A of the accepted structure to reach the P <or= 0.01 level if the secondary structure is predicted de novo and within 14 A if secondary structure information is used as a constraint. This significance approach should be useful for evaluating diverse RNA structure prediction and molecular modeling algorithms.

[1]  Feng Ding,et al.  Robust and generic RNA modeling using inferred constraints: a structure for the hepatitis C virus IRES pseudoknot domain. , 2010, Biochemistry.

[2]  Feng Ding,et al.  Native-like RNA tertiary structures using a sequence-encoded cleavage agent and refinement by discrete molecular dynamics. , 2009, Journal of the American Chemical Society.

[3]  A. S. Krasilnikov,et al.  Crystal structure of the specificity domain of ribonuclease P , 2003, Nature.

[4]  M. Sternberg,et al.  On the prediction of protein structure: The significance of the root-mean-square deviation. , 1980, Journal of molecular biology.

[5]  Ali Nahvi,et al.  An mRNA structure that controls gene expression by binding S-adenosylmethionine , 2003, Nature Structural Biology.

[6]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[7]  F. Ding,et al.  Ab initio RNA folding by discrete molecular dynamics: from structure prediction to folding mechanisms. , 2008, RNA.

[8]  B. Berne,et al.  The free energy landscape for β hairpin folding in explicit water , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Kevin M Weeks,et al.  Structure of an RNA switch that enforces stringent retroviral genomic RNA dimerization , 2006, Proceedings of the National Academy of Sciences.

[10]  Yang Zhang Protein structure prediction: when is it useful? , 2009, Current opinion in structural biology.

[11]  J. Eaton,et al.  MS3D structural elucidation of the HIV-1 packaging signal , 2008, Proceedings of the National Academy of Sciences.

[12]  Wojciech Kasprzak,et al.  Bridging the gap in RNA structure prediction. , 2007, Current opinion in structural biology.

[13]  E. Westhof,et al.  Modelling of the three-dimensional architecture of group I catalytic introns based on comparative sequence analysis. , 1990, Journal of molecular biology.

[14]  Robert K Z Tan,et al.  YUP: A Molecular Simulation Program for Coarse-Grained and Multi-Scaled Models. , 2006, Journal of chemical theory and computation.

[15]  K. Weeks,et al.  RNA structure analysis at single nucleotide resolution by selective 2'-hydroxyl acylation and primer extension (SHAPE). , 2005, Journal of the American Chemical Society.

[16]  Magdalena A. Jonikas,et al.  Coarse-grained modeling of large RNA molecules with knowledge-based potentials and structural filters. , 2009, RNA.

[17]  K. Weeks,et al.  Selective 2′-hydroxyl acylation analyzed by primer extension (SHAPE): quantitative RNA structure analysis at single nucleotide resolution , 2006, Nature Protocols.

[18]  A. Ferré-D’Amaré,et al.  Crystallization and structure determination of a hepatitis delta virus ribozyme: use of the RNA-binding protein U1A as a crystallization module. , 2000, Journal of molecular biology.

[19]  Feng Ding,et al.  iFoldRNA: three-dimensional RNA structure prediction and folding , 2008, Bioinform..

[20]  W. Kabsch A solution for the best rotation to relate two sets of vectors , 1976 .

[21]  E Westhof,et al.  Restrained refinement of two crystalline forms of yeast aspartic acid and phenylalanine transfer RNA crystals. , 1987, Acta crystallographica. Section A, Foundations of crystallography.

[22]  Changbong Hyeon,et al.  Size, shape, and flexibility of RNA structures. , 2006, The Journal of chemical physics.

[23]  K. Weeks,et al.  Slow conformational dynamics at C2'-endo nucleotides in RNA. , 2008, Journal of the American Chemical Society.

[24]  C Massire,et al.  MANIP: an interactive tool for modelling RNA. , 1998, Journal of molecular graphics & modelling.

[25]  Daniel S. Bridges,et al.  An Introduction to Polymer Physics , 2009 .

[26]  Jennifer A. Doudna,et al.  A conformational switch controls hepatitis delta virus ribozyme catalysis , 2004, Nature.

[27]  F. Ding,et al.  Ab initio folding of proteins with all-atom discrete molecular dynamics. , 2008, Structure.

[28]  R. Montange,et al.  Structure of a natural guanine-responsive riboswitch complexed with the metabolite hypoxanthine , 2004, Nature.

[29]  Yuko Okamoto,et al.  Generalized-ensemble algorithms: enhanced sampling techniques for Monte Carlo and molecular dynamics simulations. , 2003, Journal of molecular graphics & modelling.

[30]  F. Major,et al.  The MC-Fold and MC-Sym pipeline infers RNA structure from sequence data , 2008, Nature.

[31]  R. Breaker,et al.  The structural and functional diversity of metabolite-binding riboswitches. , 2009, Annual review of biochemistry.

[32]  Adam Zemla,et al.  LGA: a method for finding 3D similarities in protein structures , 2003, Nucleic Acids Res..

[33]  D. Mathews,et al.  Accurate SHAPE-directed RNA structure determination , 2009, Proceedings of the National Academy of Sciences.

[34]  A. Rich,et al.  Metal ions and flexibility in a viral RNA pseudoknot at atomic resolution , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[35]  Eric Westhof,et al.  New metrics for comparing and assessing discrepancies between RNA 3D structures and models. , 2009, RNA.

[36]  R. Gutell,et al.  The accuracy of ribosomal RNA comparative structure models. , 2002, Current opinion in structural biology.

[37]  Magdalena A. Jonikas,et al.  Structural inference of native and partially folded RNA by high-throughput contact mapping , 2008, Proceedings of the National Academy of Sciences.

[38]  D. Moras,et al.  Class II aminoacyl transfer RNA synthetases: crystal structure of yeast aspartyl-tRNA synthetase complexed with tRNA(Asp) , 1991, Science.

[39]  S. Holbrook Structural principles from large RNAs. , 2008, Annual review of biophysics.

[40]  R. Montange,et al.  Structure of the S-adenosylmethionine riboswitch regulatory mRNA element , 2006, Nature.

[41]  D. Baker,et al.  Automated de novo prediction of native-like RNA tertiary structures , 2007, Proceedings of the National Academy of Sciences.

[42]  J. Skolnick,et al.  What is the probability of a chance prediction of a protein structure with an rmsd of 6 A? , 1998, Folding & design.

[43]  I. Tinoco,et al.  How RNA folds. , 1999, Journal of molecular biology.

[44]  N. Ban,et al.  Structural basis of thiamine pyrophosphate analogues binding to the eukaryotic riboswitch. , 2008, Journal of the American Chemical Society.

[45]  Christopher J. Williams,et al.  The other 90% of the protein: Assessment beyond the Cαs for CASP8 template‐based and high‐accuracy models , 2009, Proteins.

[46]  David Sankoff,et al.  RNA secondary structures and their prediction , 1984 .