论文信息 - Neural Adaptive Sequential Monte Carlo

Neural Adaptive Sequential Monte Carlo

Sequential Monte Carlo (SMC), or particle filtering, is a popular class of methods for sampling from an intractable target distribution using a sequence of simpler intermediate distributions. Like other importance sampling-based methods, performance is critically dependent on the proposal distribution: a bad proposal can lead to arbitrarily inaccurate estimates of the target distribution. This paper presents a new method for automatically adapting the proposal using an approximation of the Kullback-Leibler divergence between the true posterior and the proposal distribution. The method is very flexible, applicable to any parameterized proposal distribution and it supports online and batch variants. We use the new framework to adapt powerful proposal distributions with rich parameterizations based upon neural networks leading to Neural Adaptive Sequential Monte Carlo (NASMC). Experiments indicate that NASMC significantly improves inference in a non-linear state space model outperforming adaptive proposal methods including the Extended Kalman and Unscented Particle Filters. Experiments also indicate that improved inference translates into improved parameter learning when NASMC is used as a subroutine of Particle Marginal Metropolis Hastings. Finally we show that NASMC is able to train a latent variable recurrent neural network (LV-RNN) achieving results that compete with the state-of-the-art for polymorphic music modelling. NASMC can be seen as bridging the gap between adaptive SMC methods and the recent work in scalable, black-box variational inference.

Richard E. Turner | Zoubin Ghahramani | Shixiang Gu | S. Gu | Zoubin Ghahramani

[1] N. Gordon,et al. Novel approach to nonlinear/non-Gaussian Bayesian state estimation , 1993 .

[2] S. Srihari. Mixture Density Networks , 1994 .

[3] Geoffrey E. Hinton,et al. The "wake-sleep" algorithm for unsupervised neural networks. , 1995, Science.

[4] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[5] Nando de Freitas,et al. The Unscented Particle Filter , 2000, NIPS.

[6] Tom Minka,et al. Expectation Propagation for approximate Bayesian inference , 2001, UAI.

[7] Nando de Freitas,et al. Sequential Monte Carlo Methods in Practice , 2001, Statistics for Engineering and Information Science.

[8] David J. C. MacKay,et al. Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[9] Yuhong Yang,et al. Information Theory, Inference, and Learning Algorithms , 2005 .

[10] P. Moral,et al. On Adaptive Sequential Monte Carlo Methods , 2008 .

[11] Alex Graves,et al. Supervised Sequence Labelling with Recurrent Neural Networks , 2012, Studies in Computational Intelligence.

[12] Razvan Pascanu,et al. Theano: A CPU and GPU Math Compiler in Python , 2010, SciPy.

[13] A. Doucet,et al. Particle Markov chain Monte Carlo methods , 2010 .

[14] Richard E. Turner,et al. Two problems with variational expectation maximisation for time-series models , 2011 .

[15] Sumeetpal S. Singh,et al. Particle approximations of the score and observed information matrix in state space models with application to parameter estimation , 2011 .

[16] Yoshua Bengio,et al. Modeling Temporal Dependencies in High-Dimensional Sequences: Application to Polyphonic Music Generation and Transcription , 2012, ICML.

[17] Alex Graves,et al. Generating Sequences With Recurrent Neural Networks , 2013, ArXiv.

[18] Razvan Pascanu,et al. Advances in optimizing recurrent networks , 2012, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[19] Karol Gregor,et al. Neural Variational Inference and Learning in Belief Networks , 2014, ICML.

[20] Daan Wierstra,et al. Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.

[21] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[22] Christian Osendorfer,et al. Learning Stochastic Recurrent Networks , 2014, NIPS 2014.

[23] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[24] Carl E. Rasmussen,et al. Variational Gaussian Process State-Space Models , 2014, NIPS.

[25] Yoshua Bengio,et al. Reweighted Wake-Sleep , 2014, ICLR.

[26] Alex Graves,et al. DRAW: A Recurrent Neural Network For Image Generation , 2015, ICML.

[27] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.