An RNA secondary structure prediction method based on minimum and suboptimal free energy structures.

The function of an RNA-molecule is mainly determined by its tertiary structures. And its secondary structure is an important determinant of its tertiary structure. The comparative methods usually give better results than the single-sequence methods. Based on minimum and suboptimal free energy structures, the paper presents a novel method for predicting conserved secondary structure of a group of related RNAs. In the method, the information from the known RNA structures is used as training data in a SVM (Support Vector Machine) classifier. Our method has been tested on the benchmark dataset given by Puton et al. The results show that the average sensitivity of our method is higher than that of other comparative methods such as CentroidAlifold, MXScrana, RNAalifold, and TurboFold.

[1]  M. Huynen,et al.  Automatic detection of conserved RNA structure elements in complete RNA virus genomes. , 1998, Nucleic acids research.

[2]  Jerrold R. Griggs,et al.  Algorithms for Loop Matchings , 1978 .

[3]  Gaurav Sharma,et al.  TurboFold: Iterative probabilistic estimation of secondary structures for multiple RNA sequences , 2011, BMC Bioinformatics.

[4]  Bjarne Knudsen,et al.  Multithreaded comparative RNA secondary structure prediction using stochastic context-free grammars , 2011, BMC Bioinformatics.

[5]  D. Turner,et al.  Improved predictions of secondary structures for RNA. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Yasuo Tabei,et al.  A local multiple alignment method for detection of non-coding RNA sequences , 2009, Bioinform..

[7]  Robert Giegerich,et al.  A comprehensive comparison of comparative RNA structure prediction approaches , 2004, BMC Bioinformatics.

[8]  Zhang Tao-tao A Review of RNA Secondary Structure Prediction Algorithms , 2008 .

[9]  Kiyoshi Asai,et al.  Improving the accuracy of predicting secondary structure for aligned RNA sequences , 2010, Nucleic Acids Res..

[10]  M. Zuker On finding all suboptimal foldings of an RNA molecule. , 1989, Science.

[11]  Rolf Backofen,et al.  MARNA: A server for multiple alignment of RNAs , 2003, German Conference on Bioinformatics.

[12]  Hélène Touzet,et al.  CARNAC: folding families of related RNAs , 2004, Nucleic Acids Res..

[13]  P. Stadler,et al.  Secondary structure prediction for aligned RNA sequences. , 2002, Journal of molecular biology.

[14]  J. Sabina,et al.  Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. , 1999, Journal of molecular biology.

[15]  Jianfeng Xu,et al.  Feature selection for SVM via optimization of kernel polarization with Gaussian ARD kernels , 2010, Expert Syst. Appl..

[16]  Markus E. Nebel,et al.  Analysis of the Free Energy in a Stochastic RNA Secondary Structure Model , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[17]  Herbert H. Tsang,et al.  SARNA-Predict: Accuracy Improvement of RNA Secondary Structure Prediction Using Permutation-Based Simulated Annealing , 2010, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[18]  I. Tinoco,et al.  Estimation of Secondary Structure in Ribonucleic Acids , 1971, Nature.

[19]  Yang Liu,et al.  Predicting RNA secondary structure based on the class information and Hopfield network , 2009, Comput. Biol. Medicine.

[20]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[21]  Cangzhi Jia,et al.  A high-accuracy protein structural class prediction algorithm using predicted secondary structural information. , 2010, Journal of theoretical biology.

[22]  K. Chou,et al.  iLoc-Euk: A Multi-Label Classifier for Predicting the Subcellular Localization of Singleplex and Multiplex Eukaryotic Proteins , 2011, PloS one.

[23]  J. Bujnicki,et al.  CompaRNA: a server for continuous benchmarking of automated methods for RNA secondary structure prediction , 2014, Nucleic acids research.

[24]  D. Turner,et al.  Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structure. , 2004, Proceedings of the National Academy of Sciences of the United States of America.