On the greedy algorithm for the Shortest Common Superstring problem with reversals

Abstract We study a variation of the classical Shortest Common Superstring (SCS) problem in which a shortest superstring of a finite set of strings S is sought containing as a factor every string of S or its reversal. We call this problem Shortest Common Superstring with Reversals (SCS-R). This problem has been introduced by Jiang et al. [9] , who designed a greedy-like algorithm with length approximation ratio 4. In this paper, we show that a natural adaptation of the classical greedy algorithm for SCS has (optimal) compression ratio 1 2 , i.e., the sum of the overlaps in the output string is at least half the sum of the overlaps in an optimal solution. We also provide a linear-time implementation of our algorithm.

[1]  Marcin Mucha,et al.  Lyndon Words and Short Superstrings , 2012, SODA.

[2]  Tao Jiang,et al.  Approximating Shortest Superstrings with Constraints , 1994, Theor. Comput. Sci..

[3]  Tao Jiang,et al.  A Note on Shortest Superstrings with Flipping , 1992, Inf. Process. Lett..

[4]  Haim Kaplan,et al.  The greedy algorithm for shortest superstrings , 2005, Inf. Process. Lett..

[5]  Leonidas S. Pitsoulis,et al.  The Shortest Superstring Problem , 2014 .

[6]  John Gallant String compression algorithms , 1982 .

[7]  Jonathan S. Turner,et al.  Approximation Algorithms for the Shortest Common Superstring Problem , 1989, Inf. Comput..

[8]  Wojciech Rytter,et al.  Algorithms for Three Versions of the Shortest Common Superstring Problem , 2010, CPM.

[9]  Eugene W. Myers,et al.  Combinatorial algorithms for DNA sequence assembly , 1995, Algorithmica.

[10]  Esko Ukkonen,et al.  A Greedy Approximation Algorithm for Constructing Shortest Common Superstrings , 1988, Theor. Comput. Sci..

[11]  Moshe Lewenstein,et al.  Restricted Common Superstring and Restricted Common Supersequence , 2011, CPM.

[12]  Esko Ukkonen A linear-time algorithm for finding approximate shortest common superstrings , 2005, Algorithmica.

[13]  Tao Jiang,et al.  Linear approximation of shortest superstrings , 1994, JACM.

[14]  Moshe Lewenstein,et al.  On Shortest Common Superstring and Swap Permutations , 2010, SPIRE.

[15]  David Maier,et al.  On Finding Minimal Length Superstrings , 1980, J. Comput. Syst. Sci..

[16]  Peter Kulchyski and , 2015 .

[17]  Bin Ma Why greed works for shortest common superstring problem , 2009, Theor. Comput. Sci..