RNALfoldz: Efficient Prediction of Thermodynamically Stable, Local Secondary Structures

The search for local RNA secondary structures and the annotation of unusually stable folding regions in genomic sequences are two well motivated bioinformatic problems. In this contribution we introduce RNALfoldz an efficient solution two tackle both tasks. It is an extension of the RNALfold algorithm augmented by support vector regression for efficient calculation of a structure’s thermodynamic stability. We demonstrate the applicability of this approach on the genome of E. coli and investigate a potential strategy to determine z-score cutoffs given a predefined false discovery rate.

[1]  Peter F Stadler,et al.  Fast and reliable prediction of noncoding RNAs , 2005, Proc. Natl. Acad. Sci. USA.

[2]  Elena Rivas,et al.  Secondary structure alone is generally not statistically significant for the detection of noncoding RNAs , 2000, Bioinform..

[3]  Vincent Moulton,et al.  A comparison of RNA folding measures , 2005, BMC Bioinformatics.

[4]  Peter F. Stadler,et al.  SnoReport: computational identification of snoRNAs with unknown targets , 2008, Bioinform..

[5]  Walter Fontana,et al.  Fast folding and comparison of RNA secondary structures , 1994 .

[6]  S. Le,et al.  A method for assessing the statistical significance of RNA folding. , 1989, Journal of theoretical biology.

[7]  Peng Jiang,et al.  MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features , 2007, Nucleic Acids Res..

[8]  Robert Giegerich,et al.  Abstract shapes of RNA. , 2004, Nucleic acids research.

[9]  Bruce A. Shapiro,et al.  A computational procedure for assessing the significance of RNA secondary structure , 1990, Comput. Appl. Biosci..

[10]  Z. Deng,et al.  sRNAscanner: A Computational Tool for Intergenic Small RNA Detection in Bacterial Genomes , 2010, PloS one.

[11]  Peter F. Stadler,et al.  Prediction of locally stable RNA secondary structures for genome-wide surveys , 2004, Bioinform..

[12]  Wei-Min Liu,et al.  A data mining approach to discover unusual folding regions in genome sequences , 2002, Knowl. Based Syst..

[13]  Michal Ziv-Ukelson,et al.  RNAslider: a faster engine for consecutive windows folding and its application to the analysis of genomic folding asymmetry , 2009, BMC Bioinformatics.

[14]  Michael Zuker,et al.  Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information , 1981, Nucleic Acids Res..

[15]  P. Clote,et al.  Structural RNA has lower folding energy than random RNA of the same dinucleotide frequency. , 2005, RNA.

[16]  Guohui Lin,et al.  Rnall: an Efficient Algorithm for Predicting Rna Local Secondary Structural Landscape in Genomes , 2006, J. Bioinform. Comput. Biol..