Prediction of geometrically feasible three-dimensional structures of pseudoknotted RNA through free energy estimation.

Accurate free energy estimation is essential for RNA structure prediction. The widely used Turner's energy model works well for nested structures. For pseudoknotted RNAs, however, there is no effective rule for estimation of loop entropy and free energy. In this work we present a new free energy estimation method, termed the pseudoknot predictor in three-dimensional space (pk3D), which goes beyond Turner's model. Our approach treats nested and pseudoknotted structures alike in one unifying physical framework, regardless of how complex the RNA structures are. We first test the ability of pk3D in selecting native structures from a large number of decoys for a set of 43 pseudoknotted RNA molecules, with lengths ranging from 23 to 113. We find that pk3D performs slightly better than the Dirks and Pierce extension of Turner's rule. We then test pk3D for blind secondary structure prediction, and find that pk3D gives the best sensitivity and comparable positive predictive value (related to specificity) in predicting pseudoknotted RNA secondary structures, when compared with other methods. A unique strength of pk3D is that it also generates spatial arrangement of structural elements of the RNA molecule. Comparison of three-dimensional structures predicted by pk3D with the native structure measured by nuclear magnetic resonance or X-ray experiments shows that the predicted spatial arrangement of stems and loops is often similar to that found in the native structure. These close-to-native structures can be used as starting points for further refinement to derive accurate three-dimensional structures of RNA molecules, including those with pseudoknots.

[1]  C. Pleij,et al.  An APL-programmed genetic algorithm for the prediction of RNA secondary structure. , 1995, Journal of theoretical biology.

[2]  Serafim Batzoglou,et al.  CONTRAfold: RNA secondary structure prediction without physics-based models , 2006, ISMB.

[3]  D. P. Aalberts,et al.  Asymmetry in RNA pseudoknots: observation and theory , 2005, Nucleic acids research.

[4]  H. Hoos,et al.  HotKnots: heuristic prediction of RNA secondary structures including pseudoknots. , 2005, RNA.

[5]  Rong Chen,et al.  Generating properly weighted ensemble of conformations of proteins from sparse or indirect distance constraints. , 2008, The Journal of chemical physics.

[6]  I. Hofacker,et al.  Beyond energy minimization: approaches to the kinetic folding of RNA , 2008 .

[7]  Shi-Jie Chen,et al.  Predicting structures and stabilities for H-type pseudoknots with interhelix loops. , 2009, RNA.

[8]  E. Siggia,et al.  Modeling RNA folding paths with pseudoknots: application to hepatitis delta virus ribozyme. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Shi-jie Chen,et al.  Folding thermodynamics of pseudoknotted chain conformations. , 2006, The Journal of chemical physics.

[10]  Jie Liang,et al.  Discrete state model and accurate estimation of loop entropy of RNA secondary structures. , 2008, The Journal of chemical physics.

[11]  Yanga Byun,et al.  PseudoViewer: web application and web service for visualizing RNA pseudoknots and secondary structures , 2006, Nucleic Acids Res..

[12]  Song Cao,et al.  Predicting RNA pseudoknot folding thermodynamics , 2006, Nucleic acids research.

[13]  E Rivas,et al.  A dynamic programming algorithm for RNA structure prediction including pseudoknots. , 1998, Journal of molecular biology.

[14]  I. Tinoco,et al.  Estimation of Secondary Structure in Ribonucleic Acids , 1971, Nature.

[15]  Jennifer A. Doudna,et al.  A conformational switch controls hepatitis delta virus ribozyme catalysis , 2004, Nature.

[16]  Christian N. S. Pedersen,et al.  RNA Pseudoknot Prediction in Energy-Based Models , 2000, J. Comput. Biol..

[17]  Liming Cai,et al.  Rapid ab initio prediction of RNA pseudoknots via graph tree decomposition , 2007, Journal of mathematical biology.

[18]  C. Pleij,et al.  An approximation of loop free energy values of RNA H-pseudoknots. , 1999, RNA.

[19]  Michael Zuker,et al.  Mfold web server for nucleic acid folding and hybridization prediction , 2003, Nucleic Acids Res..

[20]  D. Turner,et al.  Thermodynamic parameters for an expanded nearest-neighbor model for formation of RNA duplexes with Watson-Crick base pairs. , 1998, Biochemistry.

[21]  Robert Giegerich,et al.  pknotsRG: RNA pseudoknot folding including near-optimal structures and sliding windows , 2007, Nucleic Acids Res..

[22]  B. DasGupta,et al.  Order independent structural alignment of circularly permuted proteins , 2004, The 26th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[23]  David H Mathews,et al.  Predicting helical coaxial stacking in RNA multibranch loops. , 2007, RNA.

[24]  Reuven Bar-Yehuda,et al.  Scheduling split intervals , 2002, SODA '02.

[25]  Kevin P. Murphy,et al.  Efficient parameter estimation for RNA secondary structure prediction , 2007, ISMB/ECCB.

[26]  T. Schlick,et al.  Exploring the repertoire of RNA secondary motifs using graph theory; implications for RNA design. , 2003, Nucleic acids research.

[27]  Jie Liang,et al.  Monte Carlo sampling of near‐native structures of proteins with applications , 2006, Proteins.

[28]  Bhaskar DasGupta,et al.  Topology Independent Protein Structural Alignment , 2007, WABI.

[29]  Zhiyong Wang,et al.  FlexStem: improving predictions of RNA secondary structures with pseudoknots by reducing the search space , 2008, Bioinform..

[30]  Shi-jie Chen RNA folding: conformational statistics, folding kinetics, and ion electrostatics. , 2008, Annual review of biophysics.

[31]  D. Turner,et al.  Predicting thermodynamic properties of RNA. , 1995, Methods in enzymology.

[32]  Niles A. Pierce,et al.  A partition function algorithm for nucleic acid secondary structure including pseudoknots , 2003, J. Comput. Chem..

[33]  Rong Chen,et al.  Statistical geometry of lattice chain polymers with voids of defined shapes: sampling with strong constraints. , 2008, The Journal of chemical physics.

[34]  Robert Giegerich,et al.  A comprehensive comparison of comparative RNA structure prediction approaches , 2004, BMC Bioinformatics.

[35]  Weixiong Zhang,et al.  An Iterated loop matching approach to the prediction of RNA secondary structures with pseudoknots , 2004, Bioinform..

[36]  F. Major,et al.  The MC-Fold and MC-Sym pipeline infers RNA structure from sequence data , 2008, Nature.

[37]  C. Mészáros Fast Cholesky factorization for interior point methods of linear programming , 1996 .

[38]  Tatsuya Akutsu,et al.  Dynamic programming algorithms for RNA secondary structure prediction with pseudoknots , 2000, Discret. Appl. Math..

[39]  M. Summers,et al.  NMR structure of the 101-nucleotide core encapsidation signal of the Moloney murine leukemia virus. , 2004, Journal of molecular biology.

[41]  J. Ng,et al.  PseudoBase: a database with RNA pseudoknots , 2000, Nucleic Acids Res..

[42]  Shi-jie Chen,et al.  Statistical thermodynamics for chain molecules with simple RNA tertiary contacts. , 2005, The Journal of chemical physics.

[43]  D. Turner,et al.  Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structure. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[44]  Philippe Dumas,et al.  Crystal structure of the S15–rRNA complex , 2000, Nature Structural Biology.

[45]  Humberto González Díaz,et al.  2D‐RNA‐coupling numbers: A new computational chemistry approach to link secondary structure topology with biological function , 2007, J. Comput. Chem..

[46]  E. Westhof,et al.  Hierarchy and dynamics of RNA folding. , 1997, Annual review of biophysics and biomolecular structure.

[47]  Walter Fontana,et al.  Fast folding and comparison of RNA secondary structures , 1994 .

[48]  J. Sabina,et al.  Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. , 1999, Journal of molecular biology.

[49]  Wojciech Kasprzak,et al.  Bridging the gap in RNA structure prediction. , 2007, Current opinion in structural biology.

[50]  Jie Liang,et al.  Origin of scaling behavior of protein packing density: A sequential Monte Carlo study of compact long chain polymers , 2003, cond-mat/0301085.

[51]  B. Ganem RNA world , 1987, Nature.