The conjunction of the knowledge gradient and the economic approach to simulation selection

This paper deals with the selection of the best of a finite set of systems, where best is defined with respect to the maximum mean simulated performance. We extend the ideas of the knowledge gradient, which accounts for the expected value of one stage of simulation, by accounting for the future value of the option to simulate over multiple stages. We extend recent work on the economics of simulation, which studied discounted rewards, by balancing undiscounted simulation costs and the expected value of information from simulation runs. This contribution results in a diffusion model for comparing a single simulated system with a standard that has a known expected reward, and new stopping rules for fully sequential procedures when there are multiple systems. These stopping rules are more closely aligned with the expected opportunity cost allocations that are effective in numerical tests. We demonstrate an improvement in performance over previous methods.

[1]  S. Gupta,et al.  Bayesian look ahead one-stage sampling allocations for selection of the best population , 1996 .

[2]  Warren B. Powell,et al.  A Knowledge-Gradient Policy for Sequential Information Collection , 2008, SIAM J. Control. Optim..

[3]  Jürgen Branke,et al.  Selecting a Selection Procedure , 2007, Manag. Sci..

[4]  Stephen E. Chick,et al.  Economic Analysis of Simulation Selection Problems , 2009, Manag. Sci..

[5]  Chun-Hung Chen A lower bound for the correct subset-selection probability and its application to discrete-event system simulations , 1996, IEEE Trans. Autom. Control..

[6]  Barry L. Nelson,et al.  Chapter 17 Selecting the Best System , 2006, Simulation.

[7]  John Bather Optimal stopping problems for Brownian motion , 1970, Advances in Applied Probability.

[8]  M. Degroot Optimal Statistical Decisions , 1970 .

[9]  Jürgen Branke,et al.  Sequential Sampling to Myopically Maximize the Expected Value of Information , 2010, INFORMS J. Comput..

[10]  Stephen E. Chick,et al.  Chapter 9 Subjective Probability and Bayesian Methodology , 2006, Simulation.

[11]  Warren B. Powell,et al.  The knowledge-gradient stopping rule for ranking and selection , 2008, 2008 Winter Simulation Conference.

[12]  Stephen E. Chick,et al.  New Two-Stage and Sequential Procedures for Selecting the Best Simulated System , 2001, Oper. Res..

[13]  David Lindley,et al.  Optimal Statistical Decisions , 1971 .

[14]  Herman Chernoff,et al.  Sequential Tests for the Mean of a Normal Distribution , 1965 .