Simulation selection for empirical model comparison

We propose an efficient statistical method for the empirical model comparison, which is typically referred to as a simulation procedure to evaluate multiple statistical learning algorithms. First, we use experimental designs to appropriately construct the training and test sets for estimating the empirical performances of these models using the mean square errors. Second, we apply the idea of Bayesian fully sequential ranking and selection to optimally allocate the simulation budget according to the value of information. To make the procedure computationally tractable, we assume a normal-Wishart prior distribution, and propose a new approximation scheme for the posterior distribution by matching it with a normal-Wishart distribution using the first-order moment. Numerical experiments are conducted to show the superiority of the proposed approach on empirical model comparison problems.

[1]  Jun Luo,et al.  Fully Sequential Procedures for Large-Scale Ranking-and-Selection Problems in Parallel Computing Environments , 2015, Oper. Res..

[2]  Stephen E. Chick,et al.  Bayesian Ideas and Discrete Event Simulation: Why, What and How , 2006, Proceedings of the 2006 Winter Simulation Conference.

[3]  Huashuai Qu,et al.  Sequential Selection with Unknown Correlation Structures , 2015, Oper. Res..

[4]  M. Degroot Optimal Statistical Decisions , 1970 .

[5]  Sonja Kuhnt,et al.  Design and analysis of computer experiments , 2010 .

[6]  Runze Li,et al.  Design and Modeling for Computer Experiments , 2005 .

[7]  Rich Caruana,et al.  An empirical comparison of supervised learning algorithms , 2006, ICML.

[8]  Barry L. Nelson,et al.  A brief introduction to optimization via simulation , 2009, Proceedings of the 2009 Winter Simulation Conference (WSC).

[9]  Warren B. Powell,et al.  Optimal Learning , 2022, Encyclopedia of Machine Learning and Data Mining.

[10]  Peter I. Frazier,et al.  Sequential Bayes-Optimal Policies for Multiple Comparisons with a Known Standard , 2013, Oper. Res..

[11]  Peter Z. G. Qian Sliced Latin Hypercube Designs , 2012 .

[12]  Barry L. Nelson,et al.  Recent advances in ranking and selection , 2007, 2007 Winter Simulation Conference.

[13]  Peter Z. G. Qian,et al.  Making corrections to your proof , 2012 .

[14]  Warren B. Powell,et al.  The Knowledge-Gradient Policy for Correlated Normal Beliefs , 2009, INFORMS J. Comput..

[15]  Barry L. Nelson,et al.  A fully sequential procedure for indifference-zone selection in simulation , 2001, TOMC.

[16]  Barry L. Nelson,et al.  On the Asymptotic Validity of Fully Sequential Selection Procedures for Steady-State Simulation , 2006, Oper. Res..

[17]  A. Rukhin Matrix Variate Distributions , 1999, The Multivariate Normal Distribution.

[18]  F. Al-Shamali,et al.  Author Biographies. , 2015, Journal of social work in disability & rehabilitation.

[19]  Peter I. Frazier,et al.  Sequential Sampling with Economics of Selection Procedures , 2012, Manag. Sci..

[20]  T. J. Mitchell,et al.  Bayesian design and analysis of computer experiments: Use of derivatives in surface prediction , 1993 .

[21]  Thomas J. Santner,et al.  The Design and Analysis of Computer Experiments , 2003, Springer Series in Statistics.