Use of statistical measures for analyzing RNA secondary structures

With more and more RNA secondary structures accumulated, the need for comparing different RNA secondary structures often arises in function prediction and evolutionary analysis. Numerous efficient algorithms were developed for comparing different RNA secondary structures, but challenges remain. In this article, a new statistical measure extending the notion of relative entropy based on the proposed stochastic model is evaluated for RNA secondary structures. The results obtained from several experiments on real datasets have shown the effectiveness of the proposed approach. Moreover, the time complexity of our method is favorable by comparing with that of the existing methods which solve the similar problem. © 2008 Wiley Periodicals, Inc. J Comput Chem, 2008

[1]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[2]  R. A. Leibler,et al.  On Information and Sufficiency , 1951 .

[3]  L. Goddard Information Theory , 1962, Nature.

[4]  G G Brownlee,et al.  The sequence of 5 s ribosomal ribonucleic acid. , 1968, Journal of molecular biology.

[5]  S. Weissman,et al.  The primary structure of Bacillus subtilis and Bacillus stearothermophilus 5 S ribonucleic acids. , 1976, The Journal of biological chemistry.

[6]  G. Fox,et al.  Nucleotide sequence of Clostridium pasteurianum S5 rRNA , 1976, FEBS letters.

[7]  C R Woese,et al.  Phylogenetic analysis of the mycoplasmas. , 1980, Proceedings of the National Academy of Sciences of the United States of America.

[8]  S. Osawa,et al.  The nucleotide sequence of 5S rRNA from Mycoplasma capricolum. , 1981, Nucleic acids research.

[9]  R. T. Walker,et al.  The nucleotide sequence of the 5S rRNA from Spiroplasma species BC3 and Mycoplasma mycoides sp. capri PG3. , 1982, Nucleic acids research.

[10]  J Andersen,et al.  Unusual structural features of the 5S ribosomal RNA from Streptococcus cremoris. , 1983, Nucleic acids research.

[11]  K. Komagata,et al.  Taxonomic Significance of Cellular Fatty Acid Composition in Some Coryneform Bacteria , 1983 .

[12]  R. T. Walker,et al.  Construction of the mycoplasma evolutionary tree from 5S rRNA sequence data. , 1985, Proceedings of the National Academy of Sciences of the United States of America.

[13]  S. Osawa,et al.  Phylogenetic analysis of the coryneform bacteria by 5S rRNA sequences , 1987, Journal of bacteriology.

[14]  R. Nussinov,et al.  Tree graphs of RNA secondary structures and their comparisons. , 1989, Computers and biomedical research, an international journal.

[15]  D. Dairaghi,et al.  Secondary structure of RNase MRP RNA as predicted by phylogenetic comparison , 1993, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[16]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[17]  D. Engelke,et al.  An RNase P RNA subunit mutation affects ribosomal RNA processing. , 1996, Nucleic acids research.

[18]  Vincent Moulton,et al.  Use of RNA Secondary Structure for Studying the Evolution of RNase P and RNase MRP , 2000, Journal of Molecular Evolution.

[19]  Tiee-Jian Wu,et al.  Statistical Measures of DNA Sequence Dissimilarity under Markov Chain Models of Base Composition , 2001, Biometrics.

[20]  Laurent Tichit,et al.  RNA secondary structure comparison: exact analysis of the Zhang-Shasha tree edit algorithm , 2003, Theor. Comput. Sci..

[21]  Tian-ming Wang,et al.  A 3D Graphical Representation of RNA Secondary Structures , 2004, Journal of biomolecular structure & dynamics.

[22]  Tuan D. Pham,et al.  A probabilistic measure for alignment-free sequence comparison , 2004, Bioinform..

[23]  Kequan Ding,et al.  On A Six-Dimensional Representation of RNA Secondary Structures , 2005, Journal of biomolecular structure & dynamics.

[24]  Yu-Hua Yao,et al.  A class of 2D graphical representations of RNA secondary structures and the analysis of similarity based on them , 2005, J. Comput. Chem..

[25]  Yu-hua Yao,et al.  A 2D graphical representation of RNA secondary structures and the analysis of similarity/dissimilarity based on it , 2005 .

[26]  Sequence characterization of 5S ribosomal RNA from eight gram positive procaryotes , 1976, Journal of Molecular Evolution.

[27]  Wen Zhu,et al.  A condensed 3D graphical representation of RNA secondary structures , 2005 .

[28]  Na Liu,et al.  A method for rapid similarity analysis of RNA secondary structures , 2006, BMC Bioinformatics.

[29]  Xiao-Qing Liu,et al.  Numerical characterization of DNA sequences based on the k‐step Markov chain transition probability , 2006, J. Comput. Chem..

[30]  Tianming Wang,et al.  On Graphical and Numerical Representation of Protein Sequences , 2006, Journal of biomolecular structure & dynamics.

[31]  Tianming Wang,et al.  Analysis of protein sequences and their secondary structures based on transition matrices , 2007 .