Learning and discovery of predictive state representations in dynamical systems with reset
暂无分享,去创建一个
[1] Ronald L. Rivest,et al. Diversity-based inference of finite automata , 1994, 28th Annual Symposium on Foundations of Computer Science (sfcs 1987).
[2] W. Lovejoy. A survey of algorithmic methods for partially observed Markov decision processes , 1991 .
[3] Michael L. Littman,et al. Algorithms for Sequential Decision Making , 1996 .
[4] Leslie Pack Kaelbling,et al. Learning Topological Maps with Weak Local Odometric Information , 1997, IJCAI.
[5] Observable Operator Processes and Conditioned Continuation Representations 1 , 1997 .
[6] Herbert Jaeger,et al. Observable Operator Models for Discrete Stochastic Time Series , 2000, Neural Computation.
[7] Richard S. Sutton,et al. Predictive Representations of State , 2001, NIPS.
[8] Neil D. Lawrence,et al. Advances in Neural Information Processing Systems 14 , 2002 .
[9] H. Jaeger. Discrete-time, discrete-valued observable operator models: a tutorial , 2003 .
[10] Peter Stone,et al. Learning Predictive State Representations , 2003, ICML.
[11] Michael R. James,et al. Predictive State Representations: A New Theory for Modeling Dynamical Systems , 2004, UAI.