论文信息 - Dynamic sample budget allocation in model-based optimization

Dynamic sample budget allocation in model-based optimization

Model-based search methods are a class of optimization techniques that search the solution space by sampling from an underlying probability distribution “model,” which is updated iteratively after evaluating the performance of the samples at each iteration. This paper aims to improve the sampling efficiency of model-based methods by considering a generalization where a population of distribution models is maintained and subsequently propagated from generation to generation. A key issue in the proposed approach is how to efficiently allocate the sampling budget among the population of models to maximize the algorithm performance. We formulate this problem as a generalized max k-armed bandit problem, and derive an efficient dynamic sample allocation scheme based on Markov decision theory to adaptively allocate computational resources. The proposed allocation scheme is then further used to update the current population to produce an improving population of models. Our preliminary numerical results indicate that the proposed procedure may considerably reduce the number of function evaluations needed to obtain high quality solutions, and thus further enhance the value of model-based methods for optimization problems that require expensive function evaluations for performance evaluation.

[1] J. A. Lozano,et al. Towards a New Evolutionary Computation: Advances on Estimation of Distribution Algorithms (Studies in Fuzziness and Soft Computing) , 2006 .

[2] L. F. Perrone,et al. APPLYING MODEL REFERENCE ADAPTIVE SEARCH TO AMERICAN-STYLE OPTION PRICING , 2006 .

[3] Kellen Petersen August. Real Analysis , 2009 .

[4] Stephen F. Smith,et al. The Max K-Armed Bandit: A New Model of Exploration Applied to Search Heuristic Selection , 2005, AAAI.

[5] Michael C. Fu,et al. A Model Reference Adaptive Search Method for Global Optimization , 2007, Oper. Res..

[6] János D. Pintér,et al. Global optimization in action , 1995 .

[7] Dirk P. Kroese,et al. The Cross-Entropy Method: A Unified Approach to Combinatorial Optimization, Monte-Carlo Simulation and Machine Learning , 2004 .

[8] Pedro Larrañaga,et al. Estimation of Distribution Algorithms , 2002, Genetic Algorithms and Evolutionary Computation.

[9] H. Mühlenbein,et al. From Recombination of Genes to the Estimation of Distributions I. Binary Parameters , 1996, PPSN.

[10] Paolo Rapisarda,et al. Proceedings 15th International Symposium on Mathematical Theory of Networks and Systems , 2002 .

[11] Leyuan Shi,et al. Nested Partitions Method for Global Optimization , 2000, Oper. Res..

[12] Mehmet Gonullu,et al. Department of Computer Science and Engineering , 2011 .

[13] Michela Milano. Principles and Practice of Constraint Programming , 2012, Lecture Notes in Computer Science.

[14] Stephen F. Smith,et al. A Simple Distribution-Free Approach to the Max k-Armed Bandit Problem , 2006, CP.

[15] Dimitri P. Bertsekas,et al. Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[16] Z. Tang. Adaptive partitioned random search to global optimization , 1994, IEEE Trans. Autom. Control..

[17] Lalit M. Patnaik,et al. Genetic algorithms: a survey , 1994, Computer.

[18] Fred Glover,et al. Tabu Search: A Tutorial , 1990 .

[19] Tito Homem-de-Mello,et al. Solving the Vehicle Routing Problem with Stochastic Demands using the Cross-Entropy Method , 2005, Ann. Oper. Res..

[20] S. Marcus,et al. Model-Based Randomized Methods for Global Optimization , 2006 .

[21] Hans-Paul Schwefel,et al. Parallel Problem Solving from Nature — PPSN IV , 1996, Lecture Notes in Computer Science.

[22] Mauro Birattari,et al. Model-Based Search for Combinatorial Optimization: A Critical Survey , 2004, Ann. Oper. Res..

[23] Sheldon M. Ross,et al. Stochastic Processes , 2018, Gauge Integral Structures for Stochastic Calculus and Quantum Electrodynamics.

[24] Dirk P. Kroese,et al. The Cross Entropy Method: A Unified Approach To Combinatorial Optimization, Monte-carlo Simulation (Information Science and Statistics) , 2004 .

[25] Luca Maria Gambardella,et al. Ant colony system: a cooperative learning approach to the traveling salesman problem , 1997, IEEE Trans. Evol. Comput..

[26] C. D. Gelatt,et al. Optimization by Simulated Annealing , 1983, Science.

[27] J. A. Lozano,et al. Estimation of Distribution Algorithms: A New Tool for Evolutionary Computation , 2001 .

[28] Reuven Y. Rubinstein,et al. Optimization of computer simulation models with rare events , 1997 .