Inference Machines for Nonparametric Filter Learning

Data-driven approaches for learning dynamic models for Bayesian filtering often try to maximize the data likelihood given parametric forms for the transition and observation models. However, this objective is usually nonconvex in the parametrization and can only be locally optimized. Furthermore, learning algorithms typically do not provide performance guarantees on the desired Bayesian filtering task. In this work, we propose using inference machines to directly optimize the filtering performance. Our procedure is capable of learning partially-observable systems when the state space is either unknown or known in advance. To accomplish this, we adapt PREDICTIVE STATE INFERENCE MACHINES (PSIMS) by introducing the concept of hints, which incorporate prior knowledge of the state space to accompany the predictive state representation. This allows PSIM to be applied to the larger class of filtering problems which require prediction of a specific parameter or partial component of state. Our PSIM+HINTS adaptation enjoys theoretical advantages similar to the original PSIM algorithm, and we showcase its performance on a variety of robotics filtering problems.

[1]  Jonathan D. Cryer,et al.  Time Series Analysis , 1986, Encyclopedia of Big Data.

[2]  M. V. Rossum,et al.  In Neural Computation , 2022 .

[3]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[4]  Zoubin Ghahramani,et al.  A Unifying Review of Linear Gaussian Models , 1999, Neural Computation.

[5]  Rudolph van der Merwe,et al.  The unscented Kalman filter for nonlinear estimation , 2000, Proceedings of the IEEE 2000 Adaptive Systems for Signal Processing, Communications, and Control Symposium (Cat. No.00EX373).

[6]  Richard S. Sutton,et al.  Predictive Representations of State , 2001, NIPS.

[7]  Sebastian Thrun,et al.  Probabilistic robotics , 2002, CACM.

[8]  Martin Zinkevich,et al.  Online Convex Programming and Generalized Infinitesimal Gradient Ascent , 2003, ICML.

[9]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[10]  Pieter Abbeel,et al.  Learning first-order Markov models for control , 2004, NIPS.

[11]  Michael R. James,et al.  Predictive State Representations: A New Theory for Modeling Dynamical Systems , 2004, UAI.

[12]  Leonardo A. B. Tôrres,et al.  Using data-driven discrete-time models and the unscented Kalman filter to estimate unobserved variables of nonlinear systems. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[13]  Robert E. Mahony,et al.  Attitude estimation on SO[3] based on direct inertial measurements , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[14]  Benjamin Recht,et al.  Random Features for Large-Scale Kernel Machines , 2007, NIPS.

[15]  Dieter Fox,et al.  GP-BayesFilters: Bayesian filtering using Gaussian process prediction and observation models , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[16]  John Langford,et al.  Learning nonlinear dynamic models , 2009, ICML '09.

[17]  Alexander J. Smola,et al.  Super-Samples from Kernel Herding , 2010, UAI.

[18]  Byron Boots,et al.  Closing the learning-planning loop with predictive state representations , 2009, Int. J. Robotics Res..

[19]  Byron Boots,et al.  Reduced-Rank Hidden Markov Models , 2009, AISTATS.

[20]  J. Bagnell,et al.  Learning Deep Inference Machines , 2010 .

[21]  Le Song,et al.  Hilbert Space Embeddings of Hidden Markov Models , 2010, ICML.

[22]  Byron Boots,et al.  Closing the learning-planning loop with predictive state representations , 2009, Int. J. Robotics Res..

[23]  Martial Hebert,et al.  Learning message-passing inference machines for structured prediction , 2011, CVPR 2011.

[24]  Geoffrey J. Gordon,et al.  A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning , 2010, AISTATS.

[25]  Bart De Moor,et al.  Subspace Identification for Linear Systems: Theory ― Implementation ― Applications , 2011 .

[26]  Byron Boots,et al.  Spectral Approaches to Learning Predictive Representations , 2011 .

[27]  양정삼 [해외 대학 연구센터 소개] Carnegie Mellon University , 2012 .

[28]  Byron Boots,et al.  Hilbert Space Embeddings of Predictive State Representations , 2013, UAI.

[29]  M. Panella Associate Editor of the Journal of Computer and System Sciences , 2014 .

[30]  Dean Alderucci A SPECTRAL ALGORITHM FOR LEARNING HIDDEN MARKOV MODELS THAT HAVE SILENT STATES , 2015 .

[31]  Martial Hebert,et al.  Improving Multi-Step Prediction of Learned Time Series Models , 2015, AAAI.

[32]  Geoffrey J. Gordon,et al.  Supervised Learning for Dynamical System Learning , 2015, NIPS.

[33]  Byron Boots,et al.  Learning to Filter with Predictive State Inference Machines , 2015, ICML.

[34]  Le Song,et al.  The Nonparametric Kernel Bayes Smoother , 2016, AISTATS.