Analyzing and provably improving fixed budget ranking and selection algorithms

This paper studies the fixed budget formulation of the Ranking and Selection (R&S) problem with independent normal samples, where the goal is to investigate different algorithms' convergence rate in terms of their resulting probability of false selection (PFS). First, we reveal that for the well-known Optimal Computing Budget Allocation (OCBA) algorithm and its two variants, a constant initial sample size (independent of the total budget) only amounts to a sub-exponential (or even polynomial) convergence rate. After that, a modification is proposed to achieve an exponential convergence rate, where the improvement is shown by a finite-sample bound on the PFS as well as numerical results. Finally, we focus on a more tractable two-design case and explicitly characterize the large deviations rate of PFS for some simplified algorithms. Our analysis not only develops insights into the algorithms' properties, but also highlights several useful techniques for analyzing the convergence rate of fixed budget R\&S algorithms.

[1]  Peter I. Frazier,et al.  Sequential Sampling with Economics of Selection Procedures , 2012, Manag. Sci..

[2]  Diego Klabjan,et al.  Improving the Expected Improvement Algorithm , 2017, NIPS.

[3]  Shane G. Henderson,et al.  An efficient fully sequential selection procedure guaranteeing probably approximately correct selection , 2017, 2017 Winter Simulation Conference (WSC).

[4]  A. Tamhane Design and Analysis of Experiments for Statistical Selection, Screening, and Multiple Comparisons , 1995 .

[5]  Loo Hay Lee,et al.  Optimal computing budget allocation for multi-objective simulation models , 2004, Proceedings of the 2004 Winter Simulation Conference, 2004..

[6]  Peter W. Glynn,et al.  A large deviations perspective on ordinal optimization , 2004, Proceedings of the 2004 Winter Simulation Conference, 2004..

[7]  Joseph Lipka,et al.  A Table of Integrals , 2010 .

[8]  Averill M. Law,et al.  Simulation Modeling and Analysis , 1982 .

[9]  R. Bechhofer A Single-Sample Multiple Decision Procedure for Ranking Means of Normal Populations with known Variances , 1954 .

[10]  Di Wu,et al.  PROVABLY IMPROVING THE OPTIMAL COMPUTING BUDGET ALLOCATION ALGORITHM , 2018, 2018 Winter Simulation Conference (WSC).

[11]  Daniel Russo,et al.  Simple Bayesian Algorithms for Best Arm Identification , 2016, COLT.

[12]  Jürgen Branke,et al.  New developments in ranking and selection: an empirical comparison of the three main approaches , 2005, Proceedings of the Winter Simulation Conference, 2005..

[13]  Hui Xiao,et al.  Robust ranking and selection with optimal computing budget allocation , 2017, Autom..

[14]  R. Munos,et al.  Best Arm Identification in Multi-Armed Bandits , 2010, COLT.

[15]  P. Massart,et al.  Adaptive estimation of a quadratic functional by model selection , 2000 .

[16]  Victor Kowalenko,et al.  Exactification of Stirling's Approximation for the Logarithm of the Gamma Function , 2014, 1408.1881.

[17]  I. S. Gradshteyn,et al.  Table of Integrals, Series, and Products , 1976 .

[18]  Loo Hay Lee,et al.  Stochastic Simulation Optimization - An Optimal Computing Budget Allocation , 2010, System Engineering and Operations Research.

[19]  L. Jeff Hong,et al.  Fully sequential indifference‐zone selection procedures with variance‐dependent sampling , 2006 .

[20]  Barry L. Nelson,et al.  A fully sequential procedure for indifference-zone selection in simulation , 2001, TOMC.

[21]  Alexandra Carpentier,et al.  Tight (Lower) Bounds for the Fixed Budget Best Arm Identification Bandit Problem , 2016, COLT.

[22]  Ilya O. Ryzhov,et al.  On the Convergence Rates of Expected Improvement Methods , 2016, Oper. Res..

[23]  Qing-Shan Jia,et al.  Efficient computing budget allocation for finding simplest good designs , 2013, IIE transactions : industrial engineering research & development.

[24]  Leyuan Shi,et al.  A New Budget Allocation Framework for the Expected Opportunity Cost , 2017, Oper. Res..

[25]  Barry L. Nelson,et al.  Recent advances in ranking and selection , 2007, 2007 Winter Simulation Conference.

[26]  Dimitri P. Bertsekas,et al.  Nonlinear Programming , 1997 .

[27]  Alexander Shapiro,et al.  Lectures on Stochastic Programming: Modeling and Theory , 2009 .

[28]  Peter I. Frazier,et al.  A Fully Sequential Elimination Procedure for Indifference-Zone Ranking and Selection with Tight Bounds on Probability of Correct Selection , 2014, Oper. Res..

[29]  Chun-Hung Chen,et al.  Ranking and Selection as Stochastic Control , 2017, IEEE Transactions on Automatic Control.