On the Approximation Ratio of the Group-Merge Algorithm for the Shortest Common Superstring Problem

The shortest common superstring problem (SCS) is one of the fundamental optimization problems in the area of data compression and DNA sequencing. The SCS is known to be APX-hard. This paper focuses on the analysis of the approximation ratio of two greedy-based approximation algorithms for it, namely the naive Greedy algorithm and the Group-Merge algorithm. The main results of this paper are: I. We disprove the claim that the input instances of Jiang and Li {JL96} show that the Group-Merge algorithm does not provide any constant approximation for the SCS. We even prove that the Group-Merge algorithm always finds optimal solutions for these instances (except in one trivial case). II. We show that the Greedy algorithm and the Group-Merge algorithm are incomparable according to the approximation ratio.

[1]  Elizabeth Sweedyk,et al.  A 2½-Approximation Algorithm for Shortest Superstring , 1999, SIAM J. Comput..

[2]  David Maier,et al.  On Finding Minimal Length Superstrings , 1980, J. Comput. Syst. Sci..

[3]  Ming Li,et al.  Towards a DNA sequencing theory (learning a string) , 1990, Proceedings [1990] 31st Annual Symposium on Foundations of Computer Science.

[4]  Tao Jiang,et al.  Linear approximation of shortest superstrings , 1994, JACM.

[5]  Esko Ukkonen,et al.  A Greedy Approximation Algorithm for Constructing Shortest Common Superstrings , 1988, Theor. Comput. Sci..