A Common Framework for Linear and Cyclic Multiple Sequence Alignment Problems

Circularized RNAs have received considerable attention is the last few years following the discovery that they are not only a rather common phenomenon in the transcriptomes of Eukarya and Archaea but also may have key regulatory functions. This calls for the adaptation of basic tools of sequence analysis to accommodate cyclic sequences. Here we discuss a common formal framework for linear and circular alignments as partitions that preserve (cyclic) order. We focus on the similarities and differences and describe a prototypical ILP formulation.

[1]  Jens Gregor,et al.  Dynamic Programming Alignment of Sequences Representing Cyclic Patterns , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Peter F. Stadler,et al.  Alignment of RNA base pairing probability matrices , 2004, Bioinform..

[3]  T. Gregory Dewey,et al.  A Sequence Alignment Algorithm with an Arbitrary Gap Penalty Function , 2001, J. Comput. Biol..

[4]  Nimrod Megiddo,et al.  Partial and complete cyclic orders , 1976 .

[5]  Horst Bunke,et al.  Applications of approximate string matching to 2D shape recognition , 1993, Pattern Recognit..

[6]  Hans-Peter Lenhof,et al.  An exact solution for the Segment-to-Segment multiple sequence alignment problem , 1998, German Conference on Bioinformatics.

[7]  Sonja J. Prohaska,et al.  Phylogenetic Footprinting and Consistent Sets of Local Aligments , 2011, CPM.

[8]  Rolf Backofen,et al.  Inferring Noncoding RNA Families and Classes by Means of Genome-Scale Structure-Based Clustering , 2007, PLoS Comput. Biol..

[9]  Knut Reinert,et al.  Accurate multiple sequence-structure alignment of RNA sequences using combinatorial optimization , 2007, BMC Bioinformatics.

[10]  Zvi Galil,et al.  Cyclic Ordering is NP-Complete , 1977, Theor. Comput. Sci..

[11]  N. Sharpless,et al.  Detecting and characterizing circular RNAs , 2014, Nature Biotechnology.

[12]  Isaac Elias,et al.  Settling the Intractability of Multiple Alignment , 2003, ISAAC.

[13]  Winfried Just,et al.  Computational Complexity of Multiple Sequence Alignment with SP-Score , 2001, J. Comput. Biol..

[14]  Francisco Casacuberta,et al.  Cyclic Sequence Alignments: Approximate Versus Optimal Techniques , 2002, Int. J. Pattern Recognit. Artif. Intell..

[15]  Peter F. Stadler,et al.  Comparative Analysis of Cyclic Sequences: Viroids and other Small Circular RNAs , 2006, German Conference on Bioinformatics.

[16]  Gary Benson,et al.  Tandem cyclic alignment , 2005, Discret. Appl. Math..

[17]  J. Stoye,et al.  Consistent Equivalence Relations: A Set-Theoretical Framework for Multiple Sequence Alignment , 1999 .

[18]  M. Maes,et al.  On a Cyclic String-To-String Correction Problem , 1990, Inf. Process. Lett..

[19]  P. Stadler,et al.  Mapping the RNA-Seq trash bin , 2013, RNA biology.

[20]  Schraga Schwartz,et al.  Transcriptome-wide discovery of circular RNAs in Archaea , 2011, Nucleic acids research.

[21]  Burkhard Morgenstern,et al.  DIALIGN: finding local similarities by multiple sequence alignment , 1998, Bioinform..

[22]  Tao Jiang,et al.  On the Complexity of Multiple Sequence Alignment , 1994, J. Comput. Biol..

[23]  Rolf Backofen,et al.  Lifting Prediction to Alignment of RNA Pseudoknots , 2009, RECOMB.

[24]  Kurt Mehlhorn,et al.  A branch-and-cut algorithm for multiple sequence alignment , 1997, RECOMB '97.

[25]  Vítězslav Novák,et al.  Cuts in cyclically ordered sets , 1984 .

[26]  B. Ding Viroids: self‐replicating, mobile, and fast‐evolving noncoding regulatory RNAs , 2010, Wiley interdisciplinary reviews. RNA.