Stochastic Gradient MCMC for Nonlinear State Space Models

State space models (SSMs) provide a flexible framework for modeling complex time series via a latent stochastic process. Inference for nonlinear, non-Gaussian SSMs is often tackled with particle methods that do not scale well to long time series. The challenge is two-fold: not only do computations scale linearly with time, as in the linear case, but particle filters additionally suffer from increasing particle degeneracy with longer series. Stochastic gradient MCMC methods have been developed to scale inference for hidden Markov models (HMMs) and linear SSMs using buffered stochastic gradient estimates to account for temporal dependencies. We extend these stochastic gradient estimators to nonlinear SSMs using particle methods. We present error bounds that account for both buffering error and particle error in the case of nonlinear SSMs that are log-concave in the latent process. We evaluate our proposed particle buffered stochastic gradient using SGMCMC for inference on both long sequential synthetic and minute-resolution financial returns data, demonstrating the importance of this class of methods.

[1]  R. Langrock Some applications of nonlinear and non-Gaussian state–space modelling by means of hidden Markov models , 2011 .

[2]  Jun S. Liu,et al.  Sequential Monte Carlo methods for dynamic systems , 1997 .

[3]  Arnak S. Dalalyan,et al.  User-friendly guarantees for the Langevin Monte Carlo with inaccurate gradient , 2017, Stochastic Processes and their Applications.

[4]  Genshiro Kitagawa,et al.  Monte Carlo Smoothing and Self-Organising State-Space Model , 2001, Sequential Monte Carlo Methods in Practice.

[5]  Christopher Nemeth,et al.  Stochastic Gradient Markov Chain Monte Carlo , 2019, Journal of the American Statistical Association.

[6]  T. Bollerslev,et al.  Generalized autoregressive conditional heteroskedasticity , 1986 .

[7]  C. Villani Optimal Transport: Old and New , 2008 .

[8]  Sumeetpal S. Singh,et al.  Particle approximations of the score and observed information matrix in state space models with application to parameter estimation , 2011 .

[9]  Haikady N. Nagaraja,et al.  Inference in Hidden Markov Models , 2006, Technometrics.

[10]  Gregor Kastner,et al.  Dealing with Stochastic Volatility in Time Series Using the R Package stochvol , 2016, 1906.12134.

[11]  Scott W. Linderman,et al.  Variational Sequential Monte Carlo , 2017, AISTATS.

[12]  Arnaud Doucet,et al.  On Particle Methods for Parameter Estimation in State-Space Models , 2014, 1412.8695.

[13]  Paul Fearnhead,et al.  Particle Filters and Data Assimilation , 2017, 1709.04196.

[14]  Maria Colombo,et al.  Lipschitz Changes of Variables between Perturbations of Log-concave Measures , 2015, 1510.03687.

[15]  J. Wellner,et al.  Log-Concavity and Strong Log-Concavity: a review. , 2014, Statistics surveys.

[16]  Nicholas G. Polson,et al.  Tracking Epidemics With Google Flu Trends Data and a State-Space SEIR Model , 2012, Journal of the American Statistical Association.

[17]  Lester W. Mackey,et al.  Measuring Sample Quality with Kernels , 2017, ICML.

[18]  A. Doucet,et al.  A Tutorial on Particle Filtering and Smoothing: Fifteen years later , 2008 .

[19]  Yee Whye Teh,et al.  Bayesian Learning via Stochastic Gradient Langevin Dynamics , 2011, ICML.

[20]  Yee Whye Teh,et al.  Exploration of the (Non-)Asymptotic Bias and Variance of Stochastic Gradient Langevin Dynamics , 2016, J. Mach. Learn. Res..

[21]  Yee Whye Teh,et al.  Filtering Variational Objectives , 2017, NIPS.

[22]  Emily B. Fox,et al.  Stochastic Gradient MCMC for State Space Models , 2018, SIAM J. Math. Data Sci..

[23]  David B. Dunson,et al.  Clustering-Enhanced Stochastic Gradient MCMC for Hidden Markov Models with Rare States , 2018, ArXiv.

[24]  Johan Dahlin,et al.  Particle Metropolis–Hastings using gradient and Hessian information , 2013, Statistics and Computing.

[25]  G. Kitagawa Monte Carlo Filter and Smoother for Non-Gaussian Nonlinear State Space Models , 1996 .

[26]  A. Doucet,et al.  Particle Markov chain Monte Carlo methods , 2010 .

[27]  Christopher Nemeth,et al.  Control variates for stochastic gradient MCMC , 2017, Statistics and Computing.

[28]  N. Gordon,et al.  Novel approach to nonlinear/non-Gaussian Bayesian state estimation , 1993 .

[29]  M. Girolami,et al.  Riemann manifold Langevin and Hamiltonian Monte Carlo methods , 2011, Journal of the Royal Statistical Society: Series B (Statistical Methodology).

[30]  Tianqi Chen,et al.  A Complete Recipe for Stochastic Gradient MCMC , 2015, NIPS.

[31]  Mark W. Schmidt,et al.  Non-Uniform Stochastic Average Gradient Method for Training Conditional Random Fields , 2015, AISTATS.

[32]  P. Fearnhead,et al.  Particle Approximations of the Score and Observed Information Matrix for Parameter Estimation in State–Space Models With Linear Computational Cost , 2013, 1306.0735.

[33]  H. Robbins A Stochastic Approximation Method , 1951 .

[34]  Qiang Liu,et al.  A Kernelized Stein Discrepancy for Goodness-of-fit Tests , 2016, ICML.

[35]  Ryan Babbush,et al.  Bayesian Sampling Using Stochastic Gradient Thermostats , 2014, NIPS.

[36]  Leonard Hasenclever,et al.  The True Cost of Stochastic Gradient Langevin Dynamics , 2017, 1706.02692.

[37]  Tianqi Chen,et al.  Stochastic Gradient Hamiltonian Monte Carlo , 2014, ICML.

[38]  Yee Whye Teh,et al.  Consistency and Fluctuations For Stochastic Gradient Langevin Dynamics , 2014, J. Mach. Learn. Res..

[39]  John K Kruschke,et al.  Bayesian data analysis. , 2010, Wiley interdisciplinary reviews. Cognitive science.

[40]  Yee Whye Teh,et al.  Stochastic Gradient Riemannian Langevin Dynamics on the Probability Simplex , 2013, NIPS.

[41]  Lawrence Carin,et al.  On the Convergence of Stochastic Gradient MCMC Algorithms with High-Order Integrators , 2015, NIPS.

[42]  J. Olsson,et al.  Efficient particle-based online smoothing in general hidden Markov models : the PaRIS algorithm , 2014 .

[43]  N. Shephard Stochastic Volatility: Selected Readings , 2005 .

[44]  Michael I. Jordan,et al.  On the Theory of Variance Reduction for Stochastic Gradient Monte Carlo , 2018, ICML.

[45]  Emily B. Fox,et al.  Stochastic Gradient MCMC Methods for Hidden Markov Models , 2017, ICML.