Efficient calculation of exact probability distributions of integer features on RNA secondary structures

BackgroundAlthough the needs for analyses of secondary structures of RNAs are increasing, prediction of the secondary structures of RNAs are not always reliable. Because an RNA may have a complicated energy landscape, comprehensive representations of the whole ensemble of the secondary structures, such as the probability distributions of various features of RNA secondary structures are required.ResultsA general method to efficiently compute the distribution of any integer scalar/vector function on the secondary structure is proposed. We also show two concrete algorithms, for Hamming distance from a reference structure and for 5ʹ − 3ʹ distance, which can be constructed by following our general method. These practical applications of this method show the effectiveness of the proposed method.ConclusionsThe proposed method provides a clear and comprehensive procedure to construct algorithms for distributions of various integer features. In addition, distributions of integer vectors, that is a combination of different integer scores, can be also described by applying our 2D expanding technique.

[1]  J. McCaskill The equilibrium partition function and base pair binding probabilities for RNA secondary structure , 1990, Biopolymers.

[2]  D. Turner,et al.  Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structure. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[3]  C. Lawrence,et al.  RNA secondary structure prediction by centroids in a Boltzmann weighted ensemble. , 2005, RNA.

[4]  S. Wolin,et al.  The La protein functions redundantly with tRNA modification enzymes to ensure tRNA structural stability. , 2006, RNA.

[5]  A. Serganov,et al.  Structural basis for gene regulation by a thiamine pyrophosphate-sensing riboswitch , 2006, Nature.

[6]  Peter Clote,et al.  RNAbor: a web server for RNA structural neighbors , 2007, Nucleic Acids Res..

[7]  C. Lawrence,et al.  Centroid estimation in discrete high-dimensional spaces with applications in biology , 2008, Proceedings of the National Academy of Sciences.

[8]  Kiyoshi Asai,et al.  Prediction of RNA secondary structure using generalized centroid estimators , 2009, Bioinform..

[9]  Ronny Lorenz,et al.  2D Projections of RNA Folding Landscapes , 2009, GCB.

[10]  Lee Aaron Newberg,et al.  Exact Calculation of Distributions on Integers, with Application to Sequence Alignment , 2009, J. Comput. Biol..

[11]  R. Rosenfeld Nature , 2009, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[12]  David H. Mathews,et al.  NNDB: the nearest neighbor parameter database for predicting stability of nucleic acid secondary structure , 2009, Nucleic Acids Res..

[13]  William M. Gelbart,et al.  The ends of a large RNA molecule are necessarily close , 2010, Nucleic Acids Res..

[14]  Peter Clote,et al.  Using the Fast Fourier Transform to Accelerate the Computational Search for RNA Conformational Switches , 2012, PloS one.

[15]  Christian M. Reidys,et al.  The 5′-3′ Distance of RNA Secondary Structures , 2012, J. Comput. Biol..

[16]  F. Tuorto,et al.  RNA cytosine methylation by Dnmt2 and NSun2 promotes tRNA stability and protein synthesis , 2012, Nature Structural &Molecular Biology.

[17]  Peter Clote,et al.  Expected Distance between Terminal Nucleotides of Rna Secondary Structures , 2011 .

[18]  J. Bujnicki,et al.  CompaRNA: a server for continuous benchmarking of automated methods for RNA secondary structure prediction , 2014, Nucleic acids research.

[19]  Peter Clote,et al.  Abstract: Using the Fast Fourier Transform to Accelerate the Computational Search for RNA Conformational Switches , 2013, RECOMB.

[20]  S. Pfeffer,et al.  Importance of the RNA secondary structure for the relative accumulation of clustered viral microRNAs , 2014, Nucleic acids research.

[21]  P. Clote,et al.  RNA folding pathways and kinetics using 2D energy landscapes , 2015, Journal of mathematical biology.