论文信息 - Predictive linear-Gaussian models of controlled stochastic dynamical systems

Predictive linear-Gaussian models of controlled stochastic dynamical systems

We introduce the controlled predictive linear-Gaussian model (cPLG), a model that uses predictive state to model discrete-time dynamical systems with real-valued observations and vector-valued actions. This extends the PLG, an uncontrolled model recently introduced by Rudary et al. (2005). We show that the cPLG subsumes controlled linear dynamical systems (LDS, also called Kalman filter models) of equal dimension, but requires fewer parameters. We also introduce the predictive linear-quadratic Gaussian problem, a cost-minimization problem based on the cPLG that we show is equivalent to linear-quadratic Gaussian problems (LQG, sometimes called LQR). We present an algorithm to estimate cPLG parameters from data, and show that our algorithm is a consistent estimation procedure. Finally, we present empirical results suggesting that our algorithm performs favorably compared to expectation maximization on controlled LDS models.

Satinder P. Singh | Matthew R. Rudary | Satinder Singh

[1] Walter L. Smith. Probability and Statistics , 1959, Nature.

[2] Lennart Ljung,et al. System Identification: Theory for the User , 1987 .

[3] D. Catlin. Estimation, Control, and the Discrete Kalman Filter , 1988 .

[4] Geoffrey E. Hinton,et al. Parameter estimation for linear dynamical systems , 1996 .

[5] H. Jaeger,et al. Observable operator models II: Interpretable models and model induction , 1997 .

[6] J. L. Roux. An Introduction to the Kalman Filter , 2003 .

[7] Michael R. James,et al. Predictive State Representations: A New Theory for Modeling Dynamical Systems , 2004, UAI.

[8] Matthew R. Rudary,et al. Predictive Linear-Gaussian Models of Stochastic Dynamical Systems , 2005, UAI.

[9] Michael R. James,et al. Learning predictive state representations in dynamical systems without reset , 2005, ICML.