Point-Based Policy Iteration
暂无分享,去创建一个
Hui Li | Lawrence Carin | Shihao Ji | Ronald Parr | Xuejun Liao | Ronald E. Parr | L. Carin | X. Liao | Hui Li | Shihao Ji
[1] Edward J. Sondik,et al. The Optimal Control of Partially Observable Markov Processes over a Finite Horizon , 1973, Oper. Res..
[2] M. Puterman,et al. Modified Policy Iteration Algorithms for Discounted Markov Decision Problems , 1978 .
[3] Edward J. Sondik,et al. The Optimal Control of Partially Observable Markov Processes over the Infinite Horizon: Discounted Costs , 1978, Oper. Res..
[4] Michael L. Littman,et al. Incremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes , 1997, UAI.
[5] Leslie Pack Kaelbling,et al. Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..
[6] Eric A. Hansen,et al. Solving POMDPs by Searching in Policy Space , 1998, UAI.
[7] Kee-Eung Kim,et al. Solving POMDPs by Searching the Space of Finite Policies , 1999, UAI.
[8] Joelle Pineau,et al. Point-based value iteration: An anytime algorithm for POMDPs , 2003, IJCAI.
[9] Craig Boutilier,et al. Bounded Finite State Controllers , 2003, NIPS.
[10] Reid G. Simmons,et al. Heuristic Search Value Iteration for POMDPs , 2004, UAI.
[11] Nikos A. Vlassis,et al. Perseus: Randomized Point-based Value Iteration for POMDPs , 2005, J. Artif. Intell. Res..
[12] P. Poupart. Exploiting structure to efficiently solve large scale partially observable Markov decision processes , 2005 .
[13] Reid G. Simmons,et al. Point-Based POMDP Algorithms: Improved Analysis and Implementation , 2005, UAI.