A New View of Predictive State Methods for Dynamical System Learning

Recently there has been substantial interest in predictive state methods for learning dynamical systems: these algorithms are popular since they often offer a good tradeoff between computational speed and statistical efficiency. Despite their desirable properties, though, predictive state methods can sometimes be difficult to use in practice. E.g., in contrast to the rich literature on supervised learning methods, which allows us to choose from an extensive menu of models and algorithms to suit the prior beliefs we have about properties of the function to be learned, predictive state dynamical system learning methods are comparatively inflexible: it is as if we were restricted to use only linear regression instead of being allowed to choose decision trees, nonparametric regression, or the lasso. To address this problem, we propose a new view of predictive state methods in terms of instrumentalvariable regression. This view allows us to construct a wide variety of dynamical system learners simply by swapping in different supervised learning methods. We demonstrate the effectiveness of our proposed methods by experimenting with non-linear regression to learn a hidden Markov model, showing that the resulting algorithm outperforms its linear counterpart; the correctness of this algorithm follows directly from our general analysis.

[1]  A. Izenman Reduced-rank regression for the multivariate linear model , 1975 .

[2]  H. Theil Introduction to econometrics , 1978 .

[3]  Shien-Ming Wu,et al.  Time series and system analysis with applications , 1983 .

[4]  Bagrat Amirikian,et al.  Directional tuning profiles of motor cortical cells , 2000, Neuroscience Research.

[5]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[6]  Herbert Jaeger,et al.  Observable Operator Models for Discrete Stochastic Time Series , 2000, Neural Computation.

[7]  Dawn M. Taylor,et al.  Direct Cortical Control of 3D Neuroprosthetic Devices , 2002, Science.

[8]  John R. Anderson,et al.  Knowledge tracing: Modeling the acquisition of procedural knowledge , 2005, User Modeling and User-Adapted Interaction.

[9]  John P. Cunningham,et al.  Gaussian-process factor analysis for low-dimensional single-trial analysis of neural population activity , 2008, NIPS.

[10]  Amir F. Atiya,et al.  Cerberus: Applying Supervised and Reinforcement Learning Techniques to Capture the Flag Games , 2008, AIIDE.

[11]  Alexander J. Smola,et al.  Hilbert space embeddings of conditional distributions with applications to dynamical systems , 2009, ICML '09.

[12]  R. Gentner,et al.  A novel virtual reality based finger movement training system to investigate mechanisms of training induced plasticity , 2009 .

[13]  John Langford,et al.  Learning nonlinear dynamic models , 2009, ICML '09.

[14]  Byron Boots,et al.  Closing the learning-planning loop with predictive state representations , 2009, Int. J. Robotics Res..

[15]  Byron Boots,et al.  Reduced-Rank Hidden Markov Models , 2009, AISTATS.

[16]  Le Song,et al.  Hilbert Space Embeddings of Hidden Markov Models , 2010, ICML.

[17]  Kenneth R. Koedinger,et al.  A Data Repository for the EDM Community: The PSLC DataShop , 2010 .

[18]  Amir F. Atiya,et al.  A New Monte Carlo-Based Error Rate Estimator , 2010, ANNPR.

[19]  Bart De Moor,et al.  Subspace Identification for Linear Systems: Theory ― Implementation ― Applications , 2011 .

[20]  Kareem Darwish,et al.  Is a Query Worth Translating: Ask the Users! , 2011, ECIR.

[21]  Hany Hassan,et al.  Incremental Combinatory Categorial Grammar and Its Derivations , 2011, CICLing.

[22]  Byron Boots,et al.  An Online Spectral Learning Algorithm for Partially Observable Nonlinear Dynamical Systems , 2011, AAAI.

[23]  Daniel J. Hsu,et al.  Tail inequalities for sums of random matrices that depend on the intrinsic dimension , 2012 .

[24]  Byron Boots,et al.  Spectral Approaches to Learning Predictive Representations , 2011 .

[25]  J. Tropp User-Friendly Tools for Random Matrices: An Introduction , 2012 .

[26]  Sham M. Kakade,et al.  Random Design Analysis of Ridge Regression , 2012, COLT.

[27]  Byron Boots,et al.  Two Manifold Problems with Applications to Nonlinear System Identification , 2012, ICML.

[28]  Eric P. Xing,et al.  A Nonparametric Mixture Model for Topic Modeling over Time , 2012, SDM.

[29]  J. Wooldridge Introduction to Econometrics , 2013 .

[30]  Byron Boots,et al.  Hilbert Space Embeddings of Predictive State Representations , 2013, UAI.

[31]  Le Song,et al.  Kernel Bayes' rule: Bayesian inference with positive definite kernels , 2013, J. Mach. Learn. Res..

[32]  Dean Alderucci A SPECTRAL ALGORITHM FOR LEARNING HIDDEN MARKOV MODELS THAT HAVE SILENT STATES , 2015 .