A string cover C of a set of strings S is a set of substrings from S such that every string in S can be written as a concatenation of the strings in C. Given costs assigned to each substring from S, the Minimum String Cover (MSC) problem asks for a cover of minimum total cost. This NP-hard problem has so far only been approached from a purely theoretical perspective. A previous integer linear programming (ILP) formulation was designed for a special case, in which each string in S must be generated by a (small) constant number of substrings. If this restriction is removed, the ILP has an exponential number of variables, for which we show the pricing problem to be NP-hard. We propose an alternative flow-based ILP formulation of polynomial size, whose structure is particularly favorable for a Lagrangian relaxation approach. By making use of the strong bounds obtained through a repeated shortest path computation in a branch-and-bound manner, we show for the first time that non-trivial MSC instances can be solved to provable optimality in reasonable time. We also provide and solve real-world instances derived from the classic text "Alice in Wonderland". On almost all instances, our Lagrangian relaxation approach outperforms a CPLEX-based implementation by an order of magnitude. Our software is available under the terms of the GNU general public license.
[1]
Richard M. Karp,et al.
The traveling-salesman problem and minimum spanning trees: Part II
,
1971,
Math. Program..
[2]
Jean Néraud.
Elementariness of a finite set of words is co-NP-complete
,
1990,
RAIRO Theor. Informatics Appl..
[3]
John N. Tsitsiklis,et al.
Introduction to linear optimization
,
1997,
Athena scientific optimization and computation series.
[4]
Laurence A. Wolsey,et al.
Integer and Combinatorial Optimization
,
1988
.
[5]
Michael R. Fellows,et al.
Parameterized complexity analysis in computational biology
,
1995,
Comput. Appl. Biosci..
[6]
Alexander Schrijver,et al.
Theory of linear and integer programming
,
1986,
Wiley-Interscience series in discrete mathematics and optimization.
[7]
Dror Rawitz,et al.
The Minimum Substring Cover problem
,
2008,
Inf. Comput..