A method for rapid similarity analysis of RNA secondary structures

BackgroundOwing to the rapid expansion of RNA structure databases in recent years, efficient methods for structure comparison are in demand for function prediction and evolutionary analysis. Usually, the similarity of RNA secondary structures is evaluated based on tree models and dynamic programming algorithms. We present here a new method for the similarity analysis of RNA secondary structures.ResultsThree sets of real data have been used as input for the example applications. Set I includes the structures from 5S rRNAs. Set II includes the secondary structures from RNase P and RNase MRP. Set III includes the structures from 16S rRNAs. Reasonable phylogenetic trees are derived for these three sets of data by using our method. Moreover, our program runs faster as compared to some existing ones.ConclusionThe famous Lempel-Ziv algorithm can efficiently extract the information on repeated patterns encoded in RNA secondary structures and makes our method an alternative to analyze the similarity of RNA secondary structures. This method will also be useful to researchers who are interested in evolutionary analysis.

[1]  Kaizhong Zhang,et al.  Comparing multiple RNA secondary structures using tree comparisons , 1990, Comput. Appl. Biosci..

[2]  D. Dairaghi,et al.  Secondary structure of RNase MRP RNA as predicted by phylogenetic comparison , 1993, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[3]  Ruth Nussinov,et al.  RNA secondary structures: comparison and determination of frequently recurring substructures by consensus , 1989, Comput. Appl. Biosci..

[4]  Dejan Plavšić,et al.  Analysis of similarity/dissimilarity of DNA sequences based on novel 2-D graphical representation , 2003 .

[5]  Milan Randic,et al.  Characterization of DNA Primary Sequences Based on the Average Distances between Bases , 2001, J. Chem. Inf. Comput. Sci..

[6]  Maciej Szymanski,et al.  5S Ribosomal RNA Database , 2002, Nucleic Acids Res..

[7]  Khalid Sayood,et al.  A new sequence distance measure for phylogenetic tree construction , 2003, Bioinform..

[8]  D. Engelke,et al.  An RNase P RNA subunit mutation affects ribosomal RNA processing. , 1996, Nucleic acids research.

[9]  Bo Liao,et al.  A 3D Graphical Representation of RNA Secondary Structures , 2004, Journal of biomolecular structure & dynamics.

[10]  Robert Giegerich,et al.  Local similarity in RNA secondary structures , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[11]  Roderic D. M. Page,et al.  TreeView: an application to display phylogenetic trees on personal computers , 1996, Comput. Appl. Biosci..

[12]  Peter F. Stadler,et al.  Alignment of RNA base pairing probability matrices , 2004, Bioinform..

[13]  S. Osawa,et al.  Evolutionary change in 5S RNA secondary structure and a phylogenic tree of 54 5S RNA species. , 1979, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Abraham Lempel,et al.  On the Complexity of Finite Sequences , 1976, IEEE Trans. Inf. Theory.

[15]  James W. Brown The ribonuclease P database , 1998, Nucleic Acids Res..

[16]  Xiaofeng Guo,et al.  Numerical characterization of DNA sequences in a 2-D graphical representation scheme of low degeneracy , 2003 .

[17]  D. Sankoff Simultaneous Solution of the RNA Folding, Alignment and Protosequence Problems , 1985 .

[18]  Ram Reddy,et al.  Structural and functional similarities between MRP and RNase P , 2004, Molecular Biology Reports.

[19]  Vincent Moulton,et al.  Use of RNA Secondary Structure for Studying the Evolution of RNase P and RNase MRP , 2000, Journal of Molecular Evolution.

[20]  Bin Ma,et al.  A General Edit Distance between RNA Structures , 2002, J. Comput. Biol..

[21]  Bruce A. Shapiro,et al.  An algorithm for comparing multiple RNA secondary structures , 1988, Comput. Appl. Biosci..

[22]  J. McCaskill The equilibrium partition function and base pair binding probabilities for RNA secondary structure , 1990, Biopolymers.

[23]  R. Nussinov,et al.  Tree graphs of RNA secondary structures and their comparisons. , 1989, Computers and biomedical research, an international journal.

[24]  Robert Giegerich,et al.  Pure multiple RNA secondary structure alignments: a progressive profile approach , 2004, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[25]  D. Tollervey,et al.  Birth of the snoRNPs: the evolution of RNase MRP and the eukaryotic pre-rRNA-processing system. , 1995, Trends in biochemical sciences.

[26]  Milan Randic,et al.  Algorithm for Coding DNA Sequences into "Spectrum-like" and "Zigzag" Representations , 2005, J. Chem. Inf. Model..

[27]  S. Osawa,et al.  Evolutionary change in 5S rRNA secondary structure and a phylogenic tree of 352 5S rRNA species. , 1986, Bio Systems.

[28]  Jie Feng,et al.  A 3D graphical representation of RNA secondary structures based on chaos game representation , 2008 .