论文信息 - Adaptive Antithetic Sampling for Variance Reduction - 字舞流文

Adaptive Antithetic Sampling for Variance Reduction

Variance reduction is crucial in stochastic estimation and optimization problems. Antithetic sampling reduces the variance of a Monte Carlo estimator by drawing correlated, rather than independent, samples. However, designing an effective correlation structure is challenging and application specific, thus limiting the practical applicability of these methods. In this paper, we propose a general-purpose adaptive antithetic sampling framework. We provide gradient-based and gradient-free methods to train the samplers such that they reduce variance while ensuring that the underlying Monte Carlo estimator is provably unbiased. We demonstrate the effectiveness of our approach on Bayesian inference and generative model training, where it reduces variance and improves task performance with little computational overhead.

Stefano Ermon | Hongyu Ren | Shengjia Zhao | Shengjia Zhao | Hongyu Ren | Stefano Ermon

[1] Mike Wu,et al. Differentiable Antithetic Sampling for Variance Reduction in Stochastic Variational Inference , 2019, AISTATS.

[2] Chong Wang,et al. Stochastic variational inference , 2012, J. Mach. Learn. Res..

[3] Jeff Donahue,et al. Large Scale GAN Training for High Fidelity Natural Image Synthesis , 2018, ICLR.

[4] Raymond Y. K. Lau,et al. Least Squares Generative Adversarial Networks , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[5] Ruslan Salakhutdinov,et al. Importance Weighted Autoencoders , 2015, ICLR.

[6] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[7] Peter L. Bartlett,et al. Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning , 2001, J. Mach. Learn. Res..

[8] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[9] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[10] Radford M. Neal. Pattern Recognition and Machine Learning , 2007, Technometrics.

[11] C. Sempi,et al. Copula Theory: An Introduction , 2010 .

[12] J. Neyman. On the Two Different Aspects of the Representative Method: the Method of Stratified Sampling and the Method of Purposive Selection , 1934 .

[13] A. Stuart. Sampling Theory of Surveys with Applications , 1954 .

[14] Aaron C. Courville,et al. Improved Training of Wasserstein GANs , 2017, NIPS.

[15] Stefano Ermon,et al. Variational Rejection Sampling , 2018, AISTATS.

[16] Tatjana Chavdarova,et al. Stochastic Variance Reduced Gradient Optimization of Generative Adversarial Networks , 2018, ICML 2018.

[17] J. Geweke,et al. Antithetic acceleration of Monte Carlo integration in Bayesian inference , 1988 .

[18] Wojciech Zaremba,et al. Improved Techniques for Training GANs , 2016, NIPS.

[19] Radford M. Neal. Annealed importance sampling , 1998, Stat. Comput..

[20] Wolfram Burgard,et al. Improved Techniques for Grid Mapping With Rao-Blackwellized Particle Filters , 2007, IEEE Transactions on Robotics.

[21] Lex Weaver,et al. The Optimal Reward Baseline for Gradient-Based Reinforcement Learning , 2001, UAI.

[22] J. Hammersley,et al. A new Monte Carlo technique: antithetic variates , 1956, Mathematical Proceedings of the Cambridge Philosophical Society.