论文信息 - Bandit-Based Monte-Carlo Planning for the Single-Machine Total Weighted Tardiness Scheduling Problem

Bandit-Based Monte-Carlo Planning for the Single-Machine Total Weighted Tardiness Scheduling Problem

The balance of exploration and exploitation is the essence of any successful meta-heuristic. The Multi-armed Bandit Problem represents a simple form of this general dilemma. This paper describes two heuristic optimization methods that use a simple yet efficient allocation strategy for the bandit problem called UCB1 to control the optimization process. The algorithms are applied to the well known Single Machine Total Weighted Tardiness Problem and the results compared to the results of other successful meta-heuristics for this scheduling problem.

Gabriel Kronberger | Roland Braune

[1] Matthijs den Besten,et al. Ant Colony Optimization for the Total Weighted Tardiness Problem , 2000, PPSN.

[2] Xin Yao,et al. Parallel Problem Solving from Nature PPSN VI , 2000, Lecture Notes in Computer Science.

[3] Olivier Teytaud,et al. Modification of UCT with Patterns in Monte-Carlo Go , 2006 .

[4] Neil D. Lawrence,et al. Missing Data in Kernel PCA , 2006, ECML.

[5] Peter Auer,et al. Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.

[6] R. Storer,et al. A problem space algorithm for single machine weighted tardiness problems , 2003 .

[7] Csaba Szepesvári,et al. Bandit Based Monte-Carlo Planning , 2006, ECML.

[8] Chris N. Potts,et al. An Iterated Dynasearch Algorithm for the Single-Machine Total Weighted Tardiness Scheduling Problem , 2002, INFORMS J. Comput..

[9] Furkan Kiraç,et al. A tabu search algorithm for the single machine total weighted tardiness problem , 2007, Eur. J. Oper. Res..

[10] John E. Beasley,et al. OR-Library: Distributing Test Problems by Electronic Mail , 1990 .