论文信息 - Bias-Variance Techniques for Monte Carlo Optimization: Cross-validation for the CE Method

Bias-Variance Techniques for Monte Carlo Optimization: Cross-validation for the CE Method

In this paper, we examine the CE method in the broad context of Monte Carlo Optimization (MCO) and Parametric Learning (PL), a type of machine learning. A well-known overarching principle used to improve the performance of many PL algorithms is the bias-variance tradeoff. This tradeoff has been used to improve PL algorithms ranging from Monte Carlo estimation of integrals, to linear estimation, to general statistical estimation. Moreover, as described by, MCO is very closely related to PL. Owing to this similarity, the bias-variance tradeoff affects MCO performance, just as it does PL performance. In this article, we exploit the bias-variance tradeoff to enhance the performance of MCO algorithms. We use the technique of cross-validation, a technique based on the bias-variance tradeoff, to significantly improve the performance of the Cross Entropy (CE) method, which is an MCO algorithm. In previous work we have confirmed that other PL techniques improve the perfomance of other MCO algorithms. We conclude that the many techniques pioneered in PL could be investigated as ways to improve MCO algorithms in general, and the CE method in particular.

David H. Wolpert | Dev G. Rajnarayan

[1] Wray L. Buntine,et al. Bayesian Back-Propagation , 1991, Complex Syst..

[2] Yuri Ermoliev,et al. Monte Carlo Optimization and Path Dependent Nonstationary Laws of Large Numbers , 1998 .

[3] Dana Angluin,et al. Computational learning theory: survey and selected bibliography , 1992, STOC '92.

[4] G. Lepage. A new algorithm for adaptive multidimensional integration , 1978 .

[5] Leo Breiman,et al. Stacked regressions , 2004, Machine Learning.

[6] Dirk P. Kroese,et al. The Cross-Entropy Method for Continuous Multi-Extremal Optimization , 2006 .

[7] V. Vapnik. Estimation of Dependences Based on Empirical Data , 2006 .

[8] Leo Breiman,et al. Bagging Predictors , 1996, Machine Learning.

[9] David H. Wolpert,et al. On Bias Plus Variance , 1997, Neural Computation.

[10] J. Berger. Statistical Decision Theory and Bayesian Analysis , 1988 .

[11] Hoon Kim,et al. Monte Carlo Statistical Methods , 2000, Technometrics.

[12] Vladimir Vapnik,et al. The Nature of Statistical Learning , 1995 .

[13] David H. Wolpert,et al. Parametric Learning and Monte Carlo Optimization , 2007, ArXiv.

[14] Vladimir N. Vapnik,et al. The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[15] David J. C. MacKay,et al. Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[16] H. Sebastian Seung,et al. Selective Sampling Using the Query by Committee Algorithm , 1997, Machine Learning.