Partially Observable Markov Decision Processes for Artificial Intelligence
暂无分享,去创建一个
Leslie Pack Kaelbling | Michael L. Littman | Anthony R. Cassandra | M. Littman | A. Cassandra | L. Kaelbling
[1] Edward J. Sondik,et al. The Optimal Control of Partially Observable Markov Processes over a Finite Horizon , 1973, Oper. Res..
[2] Leslie Pack Kaelbling,et al. Planning under Time Constraints in Stochastic Domains , 1993, Artif. Intell..
[3] Chelsea C. White,et al. A survey of solution techniques for the partially observed Markov decision process , 1991, Ann. Oper. Res..
[4] Leslie Pack Kaelbling,et al. Learning Policies for Partially Observable Environments: Scaling Up , 1997, ICML.
[5] P. Tseng. Solving H-horizon, stationary Markov decision problems in time proportional to log(H) , 1990 .
[6] Karl Johan Åström,et al. Optimal control of Markov processes with incomplete state information , 1965 .
[7] Edward J. Sondik,et al. The Optimal Control of Partially Observable Markov Processes over the Infinite Horizon: Discounted Costs , 1978, Oper. Res..
[8] W. Lovejoy. A survey of algorithmic methods for partially observed Markov decision processes , 1991 .
[9] G. Monahan. State of the Art—A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms , 1982 .
[10] M. Littman. The Witness Algorithm: Solving Partially Observable Markov Decision Processes , 1994 .
[11] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .