A GRASP algorithm for the Closest String Problem using a probability-based heuristic

The Closest String Problem (CSP) is the problem of finding a string whose Hamming distance from the members of a given set of strings of the same length is minimal. It has applications, among others, in bioinformatics and in coding theory. Several approximation and (meta)heuristic algorithms have been proposed for the problem to achieve 'good' but not necessarily optimal solutions within a reasonable time. In this paper, a new algorithm for the problem is proposed, based on a Greedy Randomized Adaptive Search Procedure (GRASP) and a novel probabilistic heuristic function. The algorithm is compared with three recently proposed algorithms for CSP, outperforming all of them by achieving solutions of higher quality within a few seconds in most of the experimental cases.

[1]  Mauricio G. C. Resende,et al.  Grasp: An Annotated Bibliography , 2002 .

[2]  Jens Stoye,et al.  Computation of Median Gene Clusters , 2008, RECOMB.

[3]  Celso C. Ribeiro,et al.  Greedy Randomized Adaptive Search Procedures , 2003, Handbook of Metaheuristics.

[4]  Panos M. Pardalos,et al.  A parallel multistart algorithm for the closest string problem , 2008, Comput. Oper. Res..

[5]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[6]  Seyed Rasoul Mousavi,et al.  A Memetic Algorithm for closest string problem and farthest string problem , 2010, 2010 18th Iranian Conference on Electrical Engineering.

[7]  Simone Faro,et al.  Ant-CSP: An Ant Colony Optimization Algorithm for the Closest String Problem , 2009, SOFSEM.

[8]  Jing-Chao Chen,et al.  Iterative Rounding for the Closest String Problem , 2007, ArXiv.

[9]  Bin Ma,et al.  On the closest string and substring problems , 2002, JACM.

[10]  Holger Mauch,et al.  Genetic algorithm approach for the closest string problem , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[11]  Xiaolan Liu,et al.  A Compounded Genetic and Simulated Annealing Algorithm for the Closest String Problem , 2008, 2008 2nd International Conference on Bioinformatics and Biomedical Engineering.

[12]  A. Litman,et al.  On covering problems of codes , 1997, Theory of Computing Systems.

[13]  室 章治郎 Michael R.Garey/David S.Johnson 著, "COMPUTERS AND INTRACTABILITY A guide to the Theory of NP-Completeness", FREEMAN, A5判変形判, 338+xii, \5,217, 1979 , 1980 .

[14]  Bin Ma,et al.  More Efficient Algorithms for Closest String and Substring Problems , 2008, SIAM J. Comput..

[15]  Bin Ma,et al.  Finding similar regions in many strings , 1999, STOC '99.

[16]  Marco Dorigo,et al.  Ant colony optimization theory: A survey , 2005, Theor. Comput. Sci..

[17]  M. Resende,et al.  A probabilistic heuristic for a computationally difficult set covering problem , 1989 .

[18]  Mauricio G. C. Resende,et al.  An Annotated Bibliography of Grasp Part I: Algorithms , 2022 .

[19]  Sayyed R Mousavi,et al.  A Hybridization of Constructive Beam Search with Local Search for Far From Most Strings Problem , 2010 .

[20]  Ming Li,et al.  Some string problems in computational biology , 2000 .

[21]  A. Macario,et al.  Gene Probes for Bacteria , 1990 .

[22]  Rolf Niedermeier,et al.  Fixed-Parameter Algorithms for CLOSEST STRING and Related Problems , 2003, Algorithmica.

[23]  Jens Stoye,et al.  Swiftly Computing Center Strings , 2010, WABI.

[24]  P. Pardalos,et al.  Optimization techniques for string selection and comparison problems in genomics , 2005, IEEE Engineering in Medicine and Biology Magazine.

[25]  Rolf Niedermeier,et al.  On Exact and Approximation Algorithms for Distinguishing Substring Selection , 2003, FCT.

[26]  Jianer Chen,et al.  An improved lower bound on approximation algorithms for the Closest Substring problem , 2008, Inf. Process. Lett..

[27]  P Festa,et al.  On some optimization problems in molecular biology. , 2007, Mathematical biosciences.

[28]  Panos M. Pardalos,et al.  MODELING AND SOLVING STRING SELECTION PROBLEMS , 2005 .

[29]  Ugur Dogrusoz,et al.  Combinatorial Pattern Matching: 15th Annual Symposium, CPM 2004, Istanbul, Turkey, July 5-7, 2004, Proceedings (Lecture Notes in Computer Science) , 2004 .

[30]  Panos M. Pardalos,et al.  Optimal Solutions for the Closest-String Problem via Integer Programming , 2004, INFORMS J. Comput..

[31]  Rolf Niedermeier,et al.  Exact Solutions for CLOSEST STRING and Related Problems , 2001, ISAAC.

[32]  Bin Ma,et al.  Distinguishing string selection problems , 2003, SODA '99.

[33]  Rolf Niedermeier,et al.  Closest Strings, Primer Design, and Motif Search , 2010 .

[34]  Bryant A. Julstrom,et al.  A data-based coding of candidate strings in the closest string problem , 2009, GECCO '09.

[35]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .