Discovering Action Regions for Solving the Bin Packing Problem through Hyper-heuristics

Hyper-heuristics represent an innovative method for solving hard combinatorial problems such as the Bin Packing Problem. In this work, we propose a solution model that incorporates insights from unsupervised learning to produce solvers that base their decisions on maximizing a reward. The solution model takes the form of a hyper-heuristic. As the search progresses, this strategy chooses among different heuristics, adapting the solution process to the current problem state under exploration. The proposed model relies on the k-means clustering algorithm to identify the centroids of what we define as “action regions” (regions where we can recommend one particular heuristic). To recommend a heuristic in such action regions, we use a simple yet useful reward-based approach that analyzes the performance of individual heuristics. Then, the hyper-heuristic (a collection of action regions) decides which heuristic is the most suitable one to apply given one specific problem state. The experimental setup was carried out on a total of 580 bin packing instances with promising results.

[1]  Gabriela Ochoa,et al.  A unified hyper-heuristic framework for solving bin packing problems , 2014, Expert Syst. Appl..

[2]  Ender Özcan,et al.  Exploring heuristic interactions in constraint satisfaction problems: A closer look at the hyper-heuristic space , 2013, 2013 IEEE Congress on Evolutionary Computation.

[3]  Jiong Tang,et al.  A Reinforcement Learning Hyper-Heuristic in Multi-Objective Single Point Search with Application to Structural Fault Identification , 2018, ArXiv.

[4]  Manuel Iori,et al.  Bin packing and cutting stock problems: Mathematical models and exact algorithms , 2016, Eur. J. Oper. Res..

[5]  Ben Paechter,et al.  A Hyper-Heuristic Classifier for One Dimensional Bin Packing Problems: Improving Classification Accuracy by Attribute Evolution , 2012, PPSN.

[6]  Carlos A. Coello Coello,et al.  Tailoring Instances of the 1D Bin Packing Problem for Assessing Strengths and Weaknesses of Its Solvers , 2018, PPSN.

[7]  Zhi-Hua Zhou,et al.  Selection Hyper-heuristics Can Provably Be Helpful in Evolutionary Multi-objective Optimization , 2016, PPSN.

[8]  Zsolt Tuza,et al.  Online Results for Black and White Bin Packing , 2014, Theory of Computing Systems.

[9]  P. Tetali,et al.  Multidimensional Bin Packing and Other Related Problems : A Survey ∗ , 2016 .

[10]  Hugo Terashima-Marín,et al.  Evolving feature selection for characterizing and solving the 1D and 2D bin packing problem , 2013, 2013 IEEE Congress on Evolutionary Computation.

[11]  Edmund K. Burke,et al.  A Reinforcement Learning - Great-Deluge Hyper-Heuristic for Examination Timetabling , 2010, Int. J. Appl. Metaheuristic Comput..

[12]  Ender Özcan,et al.  CHAMP: Creating heuristics via many parameters for online bin packing , 2016, Expert Syst. Appl..

[13]  Michel Gendreau,et al.  Hyper-heuristics: a survey of the state of the art , 2013, J. Oper. Res. Soc..

[14]  Sanja Petrovic,et al.  Case-based heuristic selection for timetabling problems , 2006, J. Sched..

[15]  Graham Kendall,et al.  Hyper-Heuristics: An Emerging Direction in Modern Search Technology , 2003, Handbook of Metaheuristics.

[16]  Gerhard Wäscher,et al.  An improved typology of cutting and packing problems , 2007, Eur. J. Oper. Res..

[17]  Hugo Terashima-Marín,et al.  Evolutionary hyper-heuristics for tackling bi-objective 2D bin packing problems , 2018, Genetic Programming and Evolvable Machines.

[18]  Michele Garraffa,et al.  The one-dimensional cutting stock problem with sequence-dependent cut losses , 2016, Int. Trans. Oper. Res..

[19]  Mohamed Abdel-Basset,et al.  An improved nature inspired meta-heuristic algorithm for 1-D bin packing problems , 2018, Personal and Ubiquitous Computing.

[20]  Mehdi Serairi,et al.  Heuristics for the variable sized bin-packing problem , 2009, Comput. Oper. Res..

[21]  Nikos A. Vlassis,et al.  The global k-means clustering algorithm , 2003, Pattern Recognit..

[22]  Aurora Trinidad Ramirez Pozo,et al.  Hyper-Heuristic Based Product Selection for Software Product Line Testing , 2017, IEEE Computational Intelligence Magazine.

[23]  Andrea Lodi,et al.  MIPLIB 2010 , 2011, Math. Program. Comput..

[24]  Mark Sh. Levin Towards Bin Packing (preliminary problem survey, models with multiset estimates) , 2016, ArXiv.

[25]  Ender Özcan,et al.  A Learning Automata-Based Multiobjective Hyper-Heuristic , 2019, IEEE Transactions on Evolutionary Computation.

[26]  Mitra Montazeri,et al.  Hyper-heuristic Image Enhancement (HHIE): A Reinforcement Learning Method for Image Contrast Enhancement , 2020 .

[27]  Edmund K. Burke,et al.  A Classification of Hyper-Heuristic Approaches: Revisited , 2018, Handbook of Metaheuristics.

[28]  Hugo Terashima-Marín,et al.  Learning vector quantization for variable ordering in constraint satisfaction problems , 2013, Pattern Recognit. Lett..

[29]  Shokri Z. Selim,et al.  K-Means-Type Algorithms: A Generalized Convergence Theorem and Characterization of Local Optimality , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Per Kristian Lehre,et al.  Limits to Learning in Reinforcement Learning Hyper-heuristics , 2016, EvoCOP.

[31]  Yinghui Xu,et al.  Solving a New 3D Bin Packing Problem with Deep Reinforcement Learning Method , 2017, ArXiv.

[32]  Ben Paechter,et al.  A Lifelong Learning Hyper-heuristic Method for Bin Packing , 2015, Evolutionary Computation.

[33]  Andrei Lissovoi,et al.  On the Time Complexity of Algorithm Selection Hyper-Heuristics for Multimodal Optimisation , 2019, AAAI.

[34]  Chee Peng Lim,et al.  Automatic design of hyper-heuristic based on reinforcement learning , 2018, Inf. Sci..

[35]  Edmund K. Burke,et al.  Recent advances in selection hyper-heuristics , 2020, Eur. J. Oper. Res..

[36]  Emanuel Falkenauer,et al.  A hybrid grouping genetic algorithm for bin packing , 1996, J. Heuristics.

[37]  Ender Özcan,et al.  Sparse, Continuous Policy Representations for Uniform Online Bin Packing via Regression of Interpolants , 2017, EvoCOP.

[38]  M. Kubát An Introduction to Machine Learning , 2017, Springer International Publishing.

[39]  Manuel Iori,et al.  BPPLIB: a library for bin packing and cutting stock problems , 2018, Optim. Lett..

[40]  Edmund K. Burke,et al.  A methodology for determining an effective subset of heuristics in selection hyper-heuristics , 2017, Eur. J. Oper. Res..