A Simple Distribution-Free Approach to the Max k-Armed Bandit Problem
暂无分享,去创建一个
[1] Eric P. Smith,et al. An Introduction to Statistical Modeling of Extreme Values , 2002, Technometrics.
[2] Rolf H. Möhring,et al. Solving Project Scheduling Problems by Minimum Cut Computations , 2002, Manag. Sci..
[3] P. W. Jones,et al. Bandit Problems, Sequential Allocation of Experiments , 1987 .
[4] Leslie Pack Kaelbling,et al. Learning in embedded systems , 1993 .
[5] Stephen F. Smith,et al. Heuristic Selection for Stochastic Search Optimization: Modeling Solution Quality by Extreme Value Theory , 2004, CP.
[6] H. Robbins. Some aspects of the sequential design of experiments , 1952 .
[7] Stephen F. Smith,et al. The Max K-Armed Bandit: A New Model of Exploration Applied to Search Heuristic Selection , 2005, AAAI.
[8] Stephen F. Smith,et al. An Asymptotically Optimal Algorithm for the Max k-Armed Bandit Problem , 2006, AAAI.
[9] Professor Dr. Klaus Neumann,et al. Project Scheduling with Time Windows and Scarce Resources , 2003, Springer Berlin Heidelberg.
[10] Peter Auer,et al. Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.
[11] Peter Auer,et al. The Nonstochastic Multiarmed Bandit Problem , 2002, SIAM J. Comput..
[12] T. Lai. Adaptive treatment allocation and the multi-armed bandit problem , 1987 .
[13] Christoph Schwindt,et al. Generation of Resource-Constrained Project Scheduling Problems with Minimal and Maximal Time Lags , 1998 .