Exact solutions of interactive POMDPs using behavioral equivalence
暂无分享,去创建一个
Prashant Doshi | Piotr J. Gmytrasiewicz | Bharaneedharan Rathnasabapathy | P. Gmytrasiewicz | Prashant Doshi | Bharaneedharan Rathnasabapathy
[1] Victor R. Lesser,et al. Decentralized Markov decision processes with event-driven interactions , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..
[2] Shlomo Zilberstein,et al. Bounded Policy Iteration for Decentralized POMDPs , 2005, IJCAI.
[3] Shlomo Zilberstein,et al. Dynamic Programming for Partially Observable Stochastic Games , 2004, AAAI.
[4] Leslie Pack Kaelbling,et al. Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..
[5] Milind Tambe,et al. Multiagent teamwork: analyzing the optimality and complexity of key theories and models , 2002, AAMAS '02.
[6] J. Harsanyi. Games with Incomplete Information Played by 'Bayesian' Players, Part III. The Basic Probability Distribution of the Game , 1968 .
[7] Prashant Doshi,et al. Approximating state estimation in multiagent settings using particle filters , 2005, AAMAS '05.
[8] Craig Boutilier,et al. Value-Directed Compression of POMDPs , 2002, NIPS.
[9] Neil Immerman,et al. The Complexity of Decentralized Control of Markov Decision Processes , 2000, UAI.
[10] Robert Givan,et al. Equivalence notions and model minimization in Markov decision processes , 2003, Artif. Intell..
[11] Michael L. Littman,et al. Incremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes , 1997, UAI.
[12] Prashant Doshi,et al. A Particle Filtering Based Approach to Approximating Interactive POMDPs , 2005, AAAI.
[13] J. M. Porta,et al. Value iteration for continuous-state POMDPs , 2004 .
[14] E. J. Sondik,et al. The Optimal Control of Partially Observable Markov Decision Processes. , 1971 .
[15] Sebastian Thrun,et al. Monte Carlo POMDPs , 1999, NIPS.
[16] Amos Storkey,et al. Advances in Neural Information Processing Systems 20 , 2007 .
[17] Makoto Yokoo,et al. Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings , 2003, IJCAI.
[18] Edward J. Sondik,et al. The Optimal Control of Partially Observable Markov Processes over a Finite Horizon , 1973, Oper. Res..
[19] P. J. Gmytrasiewicz,et al. A Framework for Sequential Planning in Multi-Agent Settings , 2005, AI&M.