A grammar-based approach to RNA pseudoknotted structure prediction for aligned sequences

A grammatical approach is proposed to predict RNA secondary structures including pseudoknots. The method is based on comparative sequence analysis, i.e., a prediction algorithm accepts a multiple alignment of RNA sequences. We use a stochastic multiple context-free grammar (SMCFG), which can precisely express a wide class of pseudoknots. The probability parameters for the SMCFG can be computed directly from aligned sequences. The experimental results show that the prediction performance of the proposed method is fairly high.

[1]  R. C. Underwood,et al.  Stochastic context-free grammars for tRNA modeling. , 1994, Nucleic acids research.

[2]  Jerrold R. Griggs,et al.  Algorithms for Loop Matchings , 1978 .

[3]  István Miklós,et al.  SimulFold: Simultaneously Inferring RNA Structures Including Pseudoknots, Alignments, and Trees Using a Bayesian MCMC Framework , 2007, PLoS Comput. Biol..

[4]  Tadao Kasami,et al.  On Multiple Context-Free Grammars , 1991, Theor. Comput. Sci..

[5]  Satoshi Kobayashi,et al.  Tree Adjoining Grammars for RNA Structure Prediction , 1999, Theor. Comput. Sci..

[6]  R. Durbin,et al.  RNA sequence analysis using covariance models. , 1994, Nucleic acids research.

[7]  P. Stadler,et al.  Secondary structure prediction for aligned RNA sequences. , 2002, Journal of molecular biology.

[8]  Bjarne Knudsen,et al.  Pfold: RNA Secondary Structure Prediction Using Stochastic Context-Free Grammars , 2003 .

[9]  Niles A. Pierce,et al.  An algorithm for computing nucleic acid base‐pairing probabilities including pseudoknots , 2004, J. Comput. Chem..

[10]  Hiroyuki Seki,et al.  Pairwise RNA Pseudoknotted Structure Prediction Based on Stochastic Grammar , 2009 .

[11]  Tadao Kasami,et al.  RNA Pseudoknotted Structure Prediction Using Stochastic Multiple Context-Free Grammar , 2006 .

[12]  Elena Rivas,et al.  The language of RNA: a formal grammar that includes pseudoknots , 2000, Bioinform..

[13]  Sean R. Eddy,et al.  Rfam: annotating non-coding RNAs in complete genomes , 2004, Nucleic Acids Res..

[14]  Peter F. Stadler,et al.  Prediction of consensus RNA secondary structures including pseudoknots , 2004, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[15]  Weixiong Zhang,et al.  An Iterated loop matching approach to the prediction of RNA secondary structures with pseudoknots , 2004, Bioinform..