Structural properties and tractability results for linear synteny

Abstract The syntenic distance between two species is the minimum number of fusions, fissions, and translocations required to transform one genome into the other. The linear syntenic distance, a restricted form of this model, has been shown to be close to the syntenic distance. Both models are computationally difficult to compute and have resisted efficient approximation algorithms with non-trivial performance guarantees. In this paper, we prove that many useful properties of syntenic distance carry over to linear syntenic distance. We also give a reduction from the general linear synteny problem to the question of whether a given instance can be solved using the maximum possible number of translocations. Our main contribution is an algorithm exactly computing linear syntenic distance in nested instances of the problem. This is the first polynomial time algorithm exactly solving linear synteny for a non-trivial class of instances. It is based on a novel connection between the syntenic distance and a scheduling problem that has been studied in the operations research literature.

[1]  David Sankoff,et al.  Original Synteny , 1996, CPM.

[2]  Pavel A. Pevzner,et al.  Transforming cabbage into turnip: polynomial algorithm for sorting signed permutations by reversals , 1995, JACM.

[3]  Vineet Bafna,et al.  Genome rearrangements and sorting by reversals , 1993, Proceedings of 1993 IEEE 34th Annual Foundations of Computer Science.

[4]  Henrik Eriksson,et al.  Sorting a bridge hand , 2001, Discret. Math..

[5]  Alberto Caprara,et al.  Sorting Permutations by Reversals and Eulerian Cycle Decompositions , 1999, SIAM J. Discret. Math..

[6]  D. Sankoff,et al.  Synteny conservation and chromosome rearrangements during mammalian evolution. , 1997, Genetics.

[7]  David Liben-Nowell On the Structure of Syntenic Distance , 2001, J. Comput. Biol..

[8]  David Sankoff,et al.  Exact and approximation algorithms for sorting by reversals, with application to genome rearrangement , 1995, Algorithmica.

[9]  P. Berman,et al.  On Some Tighter Inapproximability Results , 1998, Electron. Colloquium Comput. Complex..

[10]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[11]  Niklas Eriksen,et al.  (1+epsilon)-Approximation of sorting by reversals and transpositions , 2001, Theor. Comput. Sci..

[12]  Vineet Bafna,et al.  Sorting by Transpositions , 1998, SIAM J. Discret. Math..

[13]  Clyde L. Monma,et al.  A general algorithm for optimal job sequencing with series-parallel precedence constraints , 1977 .

[14]  Shietung Peng,et al.  A 2-Approximation Algorithm for Genome Rearrangements by Reversals and Transpositions , 1999, Theor. Comput. Sci..

[15]  Tao Jiang,et al.  On the Complexity and Approximation of Syntenic Distance , 1998, Discret. Appl. Math..

[16]  Hussein M. Abdel-Wahab,et al.  Scheduling to Minimize Maximum Cumulative Cost Subject to Series-Parallel Precedence Constraints , 1978, Oper. Res..

[17]  Piotr Berman,et al.  Fast Sorting by Reversal , 1996, CPM.

[18]  Pavel A. Pevzner,et al.  Transforming Cabbage into Turnip: Polynomial Algorithm for Sorting Signed Permutations by Reversals , 1999, J. ACM.

[19]  Tao Jiang,et al.  On the complexity and approximation of syntenic distance , 1997, RECOMB '97.

[20]  David Sankoff,et al.  Conserved Synteny As a Measure of Genomic Distance , 1996, Discret. Appl. Math..

[21]  Niklas Eriksen (1+epsilon)-Approximation of Sorting by Reversals and Transpositions , 2001, WABI.

[22]  David A. Christie,et al.  A 3/2-approximation algorithm for sorting by reversals , 1998, SODA '98.

[23]  Marek Karpinski,et al.  1.375-Approximation Algorithm for Sorting by Reversals , 2002, ESA.