论文信息 - Learning and discovery of predictive state representations in dynamical systems with reset

Learning and discovery of predictive state representations in dynamical systems with reset

Predictive state representations (PSRs) are a recently proposed way of modeling controlled dynamical systems. PSR-based models use predictions of observable outcomes of tests that could be done on the system as their state representation, and have model parameters that define how the predictive state representation changes over time as actions are taken and observations noted. Learning PSR-based models requires solving two subproblems: 1) discovery of the tests whose predictions constitute state, and 2) learning the model parameters that define the dynamics. So far, there have been no results available on the discovery subproblem while for the learning subproblem an approximate-gradient algorithm has been proposed (Singh et al., 2003) with mixed results (it works on some domains and not on others). In this paper, we provide the first discovery algorithm and a new learning algorithm for linear PSRs for the special class of controlled dynamical systems that have a reset operation. We provide experimental verification of our algorithms. Finally, we also distinguish our work from prior work by Jaeger (2000) on observable operator models (OOMs).

Michael R. James | Satinder P. Singh | Satinder Singh

[1] Ronald L. Rivest,et al. Diversity-based inference of finite automata , 1994, 28th Annual Symposium on Foundations of Computer Science (sfcs 1987).

[2] W. Lovejoy. A survey of algorithmic methods for partially observed Markov decision processes , 1991 .

[3] Michael L. Littman,et al. Algorithms for Sequential Decision Making , 1996 .

[4] Leslie Pack Kaelbling,et al. Learning Topological Maps with Weak Local Odometric Information , 1997, IJCAI.

[5] Observable Operator Processes and Conditioned Continuation Representations 1 , 1997 .

[6] Herbert Jaeger,et al. Observable Operator Models for Discrete Stochastic Time Series , 2000, Neural Computation.

[7] Richard S. Sutton,et al. Predictive Representations of State , 2001, NIPS.

[8] Neil D. Lawrence,et al. Advances in Neural Information Processing Systems 14 , 2002 .

[9] H. Jaeger. Discrete-time, discrete-valued observable operator models: a tutorial , 2003 .

[10] Peter Stone,et al. Learning Predictive State Representations , 2003, ICML.

[11] Michael R. James,et al. Predictive State Representations: A New Theory for Modeling Dynamical Systems , 2004, UAI.