The shortest common superstring problem (SCS) is one of the fundamental optimization problems in the area of data compression and DNA sequencing. The SCS is known to be APX-hard. This paper focuses on the analysis of the approximation ratio of two greedy-based approximation algorithms for it, namely the naive Greedy algorithm and the Group-Merge algorithm. The main results of this paper are: I. We disprove the claim that the input instances of Jiang and Li {JL96} show that the Group-Merge algorithm does not provide any constant approximation for the SCS. We even prove that the Group-Merge algorithm always finds optimal solutions for these instances (except in one trivial case). II. We show that the Greedy algorithm and the Group-Merge algorithm are incomparable according to the approximation ratio.
[1]
Elizabeth Sweedyk,et al.
A 2½-Approximation Algorithm for Shortest Superstring
,
1999,
SIAM J. Comput..
[2]
David Maier,et al.
On Finding Minimal Length Superstrings
,
1980,
J. Comput. Syst. Sci..
[3]
Ming Li,et al.
Towards a DNA sequencing theory (learning a string)
,
1990,
Proceedings [1990] 31st Annual Symposium on Foundations of Computer Science.
[4]
Tao Jiang,et al.
Linear approximation of shortest superstrings
,
1994,
JACM.
[5]
Esko Ukkonen,et al.
A Greedy Approximation Algorithm for Constructing Shortest Common Superstrings
,
1988,
Theor. Comput. Sci..