Bi-alignments as Models of Incongruent Evolution of RNA Sequence and Secondary Structure

RNA molecules may be subject to independent selection pressures on sequence and structure. This may lead to the preservation of structural features without maintaining the exact position on the conserved sequence. Structurally analogous base pairs thus are no longer formed by homologous bases, and homologous nucleotides do not preserve their structural context. Therefore, the evolution of sequence and structure is incongruent. We model this phenomenon by introducing bi-alignments, defined as a pair of alignments, one modeling sequence homology; the other, structural homology, together with an alignment of the two alignments that models the relative shifts between conserved sequence and conserved structure. Bi-alignments therefore form a special class of four-way alignments. A preliminary survey of the Rfam database suggests that incongruent evolution is not a very rare phenomenon among structured ncRNAs and RNA elements.

[1]  Rolf Backofen,et al.  Inferring Noncoding RNA Families and Classes by Means of Genome-Scale Structure-Based Clustering , 2007, PLoS Comput. Biol..

[2]  D. Sankoff Simultaneous Solution of the RNA Folding, Alignment and Protosequence Problems , 1985 .

[3]  D. Lipman,et al.  The multiple sequence alignment problem in biology , 1988 .

[4]  Robert Giegerich,et al.  Pure multiple RNA secondary structure alignments: a progressive profile approach , 2004, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[5]  K. Katoh,et al.  MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability , 2013, Molecular biology and evolution.

[6]  Robert D. Finn,et al.  Rfam 13.0: shifting to a genome-centric resource for non-coding RNA families , 2017, Nucleic Acids Res..

[7]  Peter F. Stadler,et al.  Product Grammars for Alignment and Folding , 2015, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[8]  P. Schuster,et al.  RNA multi-structure landscapes , 1993, European Biophysics Journal.

[9]  S. Altschul,et al.  A tool for multiple sequence alignment. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Peter F. Stadler,et al.  Bi-Alignments as Models of Incongruent Evolution of RNA Sequence and Structure , 2019, bioRxiv.

[11]  Sebastian Will,et al.  RNAalifold: improved consensus structure prediction for RNA alignments , 2008, BMC Bioinformatics.

[12]  Deniz Dalli,et al.  StrAl: progressive alignment of non-coding RNA using base pairing probability vectors in quadratic time , 2006, Bioinform..

[13]  Zasha Weinberg,et al.  CMfinder - a covariance model based RNA motif finding algorithm , 2006, Bioinform..

[14]  João Meidanis,et al.  Introduction to computational molecular biology , 1997 .