论文信息 - Inference Machines for Nonparametric Filter Learning

Inference Machines for Nonparametric Filter Learning

Data-driven approaches for learning dynamic models for Bayesian filtering often try to maximize the data likelihood given parametric forms for the transition and observation models. However, this objective is usually nonconvex in the parametrization and can only be locally optimized. Furthermore, learning algorithms typically do not provide performance guarantees on the desired Bayesian filtering task. In this work, we propose using inference machines to directly optimize the filtering performance. Our procedure is capable of learning partially-observable systems when the state space is either unknown or known in advance. To accomplish this, we adapt PREDICTIVE STATE INFERENCE MACHINES (PSIMS) by introducing the concept of hints, which incorporate prior knowledge of the state space to accompany the predictive state representation. This allows PSIM to be applied to the larger class of filtering problems which require prediction of a specific parameter or partial component of state. Our PSIM+HINTS adaptation enjoys theoretical advantages similar to the original PSIM algorithm, and we showcase its performance on a variety of robotics filtering problems.

[1] Jonathan D. Cryer,et al. Time Series Analysis , 1986, Encyclopedia of Big Data.

[2] M. V. Rossum,et al. In Neural Computation , 2022 .

[3] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.

[4] Zoubin Ghahramani,et al. A Unifying Review of Linear Gaussian Models , 1999, Neural Computation.

[5] Rudolph van der Merwe,et al. The unscented Kalman filter for nonlinear estimation , 2000, Proceedings of the IEEE 2000 Adaptive Systems for Signal Processing, Communications, and Control Symposium (Cat. No.00EX373).

[6] Richard S. Sutton,et al. Predictive Representations of State , 2001, NIPS.

[7] Sebastian Thrun,et al. Probabilistic robotics , 2002, CACM.

[8] Martin Zinkevich,et al. Online Convex Programming and Generalized Infinitesimal Gradient Ascent , 2003, ICML.

[9] Leo Breiman,et al. Random Forests , 2001, Machine Learning.

[10] Pieter Abbeel,et al. Learning first-order Markov models for control , 2004, NIPS.

[11] Michael R. James,et al. Predictive State Representations: A New Theory for Modeling Dynamical Systems , 2004, UAI.

[12] Leonardo A. B. Tôrres,et al. Using data-driven discrete-time models and the unscented Kalman filter to estimate unobserved variables of nonlinear systems. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[13] Robert E. Mahony,et al. Attitude estimation on SO[3] based on direct inertial measurements , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[14] Benjamin Recht,et al. Random Features for Large-Scale Kernel Machines , 2007, NIPS.

[15] Dieter Fox,et al. GP-BayesFilters: Bayesian filtering using Gaussian process prediction and observation models , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[16] John Langford,et al. Learning nonlinear dynamic models , 2009, ICML '09.

[17] Alexander J. Smola,et al. Super-Samples from Kernel Herding , 2010, UAI.