Finding the common structure shared by two homologous RNAs

MOTIVATION CARNAC is a new method for pairwise folding of RNA sequences. The program takes into account local similarity, stem energy, and covariations to produce the common folding. It can handle all RNA types, and has also been adapted to align a new homologous sequence along a reference structured sequence. RESULTS Using different data sets, we show that CARNAC provides a good partial prediction for a wide range of sequences (16S ssu rRNA, RNase P RNA, viruses) with only two sequences. In presence of a whole family of sequences, we also show that CARNAC can be used to detect whether the sequences actually share the same structure. AVAILABILITY CARNAC is available at the URLhttp://www.lifl.fr/~perrique/rna/.

[1]  M. Zuker On finding all suboptimal foldings of an RNA molecule. , 1989, Science.

[2]  Bjarne Knudsen,et al.  RNA secondary structure prediction using stochastic context-free grammars and evolutionary history , 1999, Bioinform..

[3]  D. Sankoff Simultaneous Solution of the RNA Folding, Alignment and Protosequence Problems , 1985 .

[4]  James W. Brown The ribonuclease P database , 1998, Nucleic Acids Res..

[5]  I. Tinoco,et al.  How RNA folds. , 1999, Journal of molecular biology.

[6]  D. Turner,et al.  Dynalign: an algorithm for finding the secondary structure common to two RNA sequences. , 2002, Journal of molecular biology.

[7]  Florence Corpet,et al.  RNAlign program: alignment of RNA sequences using both primary and secondary structures , 1994, Comput. Appl. Biosci..

[8]  V. Juan,et al.  RNA secondary structure prediction based on free energy and phylogenetic analysis. , 1999, Journal of molecular biology.

[9]  R. Nussinov,et al.  Fast algorithm for predicting the secondary structure of single-stranded RNA. , 1980, Proceedings of the National Academy of Sciences of the United States of America.

[10]  M. Zuker,et al.  Common structures of the 5' non-coding RNA in enteroviruses and rhinoviruses. Thermodynamical stability and statistical significance. , 1990, Journal of molecular biology.

[11]  David K. Y. Chiu,et al.  Inferring consensus structure from nucleic acid sequences , 1991, Comput. Appl. Biosci..

[12]  Kaizhong Zhang,et al.  Finding Common RNA Secondary Structures from RNA Sequences , 1999, CPM.

[13]  R. Durbin,et al.  RNA sequence analysis using covariance models. , 1994, Nucleic acids research.

[14]  Laurie J. Heyer,et al.  Finding the most significant common sequence and structure motifs in a set of RNA sequences. , 1997, Nucleic acids research.

[15]  E Rivas,et al.  A dynamic programming algorithm for RNA structure prediction including pseudoknots. , 1998, Journal of molecular biology.

[16]  S. Le,et al.  Prediction of common secondary structures of RNAs: a genetic algorithm approach. , 2000, Nucleic acids research.

[17]  P. Stadler,et al.  Conserved RNA secondary structures in Picornaviridae genomes. , 2001, Nucleic acids research.

[18]  J. Sabina,et al.  Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. , 1999, Journal of molecular biology.

[19]  R. Gutell,et al.  Secondary structure model for bacterial 16S ribosomal RNA: phylogenetic, enzymatic and chemical evidence. , 1980, Nucleic acids research.

[20]  I. Tinoco,et al.  RNA folding causes secondary structure rearrangement. , 1998, Proceedings of the National Academy of Sciences of the United States of America.