Predicting Metaheuristic Performance on Graph Coloring Problems Using Data Mining

This chapter illustrates the benefits of using data mining methods to gain greater understanding of the strengths and weaknesses of a metaheuristic across the whole of instance space. Using graph coloring as a case study, we demonstrate how the relationships between the features of instances and the performance of algorithms can be learned and visualized. The instance space (in this case, the set of all graph coloring instances) is characterized as a high-dimensional feature space, with each instance summarized by a set of metrics selected as indicative of instance hardness. We show how different instance generators produce instances with various properties, and how the performance of algorithms depends on these properties. Based on a set of tested instances, we reveal the generalized boundary in instance space where an algorithm can be expected to perform well. This boundary is called the algorithm footprint in instance space. We show how data mining methods can be used to visualize the footprint and relate its boundary to properties of the instances. In this manner, we can begin to develop a good understanding of the strengths and weaknesses of a set of algorithms, and identify opportunities to develop new hybrid approaches that exploit the combined strength and improve the performance across a broad instance space.

[1]  John N. Hooker,et al.  Needed: An Empirical Science of Algorithms , 1994, Oper. Res..

[2]  Jano I. van Hemert,et al.  Understanding TSP Difficulty by Learning from Evolved Instances , 2010, LION.

[3]  Teuvo Kohonen,et al.  Self-organized formation of topologically correct feature maps , 2004, Biological Cybernetics.

[4]  Peter Merz,et al.  Advanced Fitness Landscape Analysis and the Performance of Memetic Algorithms , 2004, Evolutionary Computation.

[5]  Alain Hertz,et al.  Using tabu search techniques for graph coloring , 1987, Computing.

[6]  Jens Gottlieb,et al.  Evolutionary Computation in Combinatorial Optimization , 2006, Lecture Notes in Computer Science.

[7]  Yoav Shoham,et al.  A portfolio approach to algorithm select , 2003, IJCAI 2003.

[8]  Kate Smith-Miles,et al.  Generalising Algorithm Performance in Instance Space: A Timetabling Case Study , 2011, LION.

[9]  Charles Fleurent,et al.  Genetic and hybrid algorithms for graph coloring , 1996, Ann. Oper. Res..

[10]  David W. Corne,et al.  Towards Landscape Analyses to Inform the Design of Hybrid Local Search for the Multiobjective Quadratic Assignment Problem , 2002, HIS.

[11]  Jano I. van Hemert,et al.  Phase Transition Properties of Clustered Travelling Salesman Problem Instances Generated with Evolutionary Computation , 2004, PPSN.

[12]  Nicolas Zufferey,et al.  A graph coloring heuristic using partial solutions and a reactive tabu scheme , 2008, Comput. Oper. Res..

[13]  B. Mohar THE LAPLACIAN SPECTRUM OF GRAPHS y , 1991 .

[14]  Ross J. W. James,et al.  A Knowledge Discovery Approach to Understanding Relationships between Scheduling Problem Structure and Heuristic Performance , 2009, LION.

[15]  Nicolas Barnier,et al.  Solving the Kirkman's schoolgirl problem in a few seconds , 2002 .

[16]  Assaf Naor,et al.  Rigorous location of phase transitions in hard optimization problems , 2005, Nature.

[17]  Graham Kendall,et al.  Hyper-Heuristics: An Emerging Direction in Modern Search Technology , 2003, Handbook of Metaheuristics.

[18]  John N. Hooker,et al.  Testing heuristics: We have it all wrong , 1995, J. Heuristics.

[19]  Kevin Leyton-Brown,et al.  : The Design and Analysis of an Algorithm Portfolio for SAT , 2007, CP.

[20]  David H. Wolpert,et al.  No free lunch theorems for optimization , 1997, IEEE Trans. Evol. Comput..

[21]  David H. Wolpert,et al.  What makes an optimization problem hard? , 1995, Complex..

[22]  Bernd Freisleben,et al.  Fitness landscape analysis and memetic algorithms for the quadratic assignment problem , 2000, IEEE Trans. Evol. Comput..

[23]  Kate Smith-Miles,et al.  Cross-disciplinary perspectives on meta-learning for algorithm selection , 2009, CSUR.

[24]  Kate Smith-Miles,et al.  Towards insightful algorithm selection for optimisation using meta-learning concepts , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[25]  Jin-Kao Hao,et al.  Hybrid Evolutionary Algorithms for Graph Coloring , 1999, J. Comb. Optim..

[26]  Robert Schaefer Parallel Problem Solving from Nature - PPSN XI, 11th International Conference, Kraków, Poland, September 11-15, 2010. Proceedings, Part II , 2010, PPSN.

[27]  Yoav Shoham,et al.  Learning the Empirical Hardness of Optimization Problems: The Case of Combinatorial Auctions , 2002, CP.

[28]  N. Biggs Algebraic Graph Theory , 1974 .

[29]  John R. Rice,et al.  The Algorithm Selection Problem , 1976, Adv. Comput..

[30]  Alain Hertz,et al.  A survey of local search methods for graph coloring , 2004, Comput. Oper. Res..

[31]  Christian Blum,et al.  Hybrid Metaheuristics: An Introduction , 2008, Hybrid Metaheuristics.

[32]  Christian Bierwirth,et al.  Landscape Regularity and Random Walks for the Job-Shop Scheduling Problem , 2004, EvoCOP.

[33]  Cecilia R. Aragon,et al.  Optimization by Simulated Annealing: An Experimental Evaluation; Part II, Graph Coloring and Number Partitioning , 1991, Oper. Res..

[34]  Marc Schoenauer,et al.  Artificial Evolution , 2000, Lecture Notes in Computer Science.

[35]  Joseph C. Culberson,et al.  On the Futility of Blind Search: An Algorithmic View of No Free Lunch , 1998, Evolutionary Computation.

[36]  Jano I. van Hemert,et al.  Discovering the suitability of optimisation algorithms by learning from evolved instances , 2011, Annals of Mathematics and Artificial Intelligence.

[37]  El-Ghazali Talbi,et al.  A Taxonomy of Hybrid Metaheuristics , 2002, J. Heuristics.

[38]  David W. Corne,et al.  Optimisation and Generalisation: Footprints in Instance Space , 2010, PPSN.

[39]  Thomas Stützle,et al.  A review of metrics on permutations for search landscape analysis , 2007, Comput. Oper. Res..

[40]  Daniel Brélaz,et al.  New methods to color the vertices of a graph , 1979, CACM.

[41]  Christian Bessière Principles and Practice of Constraint Programming - CP 2007, 13th International Conference, CP 2007, Providence, RI, USA, September 23-27, 2007, Proceedings , 2007, CP.

[42]  P. Pardalos,et al.  The Graph Coloring Problem: A Bibliographic Survey , 1998 .

[43]  Peter C. Cheeseman,et al.  Where the Really Hard Problems Are , 1991, IJCAI.

[44]  T. Kohonen Self-organized formation of topographically correct feature maps , 1982 .

[45]  Jin-Kao Hao,et al.  Scatter Search for Graph Coloring , 2001, Artificial Evolution.

[46]  Kate Smith-Miles,et al.  Generating Applicable Synthetic Instances for Branch Problems , 2013, Oper. Res..

[47]  Charles H. Reilly,et al.  The Effects of Coefficient Correlation Structure in Two-Dimensional Knapsack Problems on Solution Procedure Performance , 2000 .

[48]  Kate Smith-Miles,et al.  Measuring instance difficulty for combinatorial optimization problems , 2012, Comput. Oper. Res..