Symmetric Primal-Dual Approximate Linear Programming for Factored MDPs
暂无分享,去创建一个
[1] D. Koller,et al. Planning under uncertainty in complex structured environments , 2003 .
[2] Daphne Koller,et al. Computing Factored Value Functions for Policies in Structured MDPs , 1999, IJCAI.
[3] P. Schweitzer,et al. Generalized polynomial approximations in Markovian decision processes , 1985 .
[4] Benjamin Van Roy,et al. On Constraint Sampling in the Linear Programming Approach to Approximate Dynamic Programming , 2004, Math. Oper. Res..
[5] Edmund H. Durfee,et al. Graphical models in local, asymmetric multi-agent Markov decision processes , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..
[6] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[7] Yuval Rabani,et al. Linear Programming , 2007, Handbook of Approximation Algorithms and Metaheuristics.
[8] Edmund H. Durfee,et al. Optimal Resource Allocation and Policy Formulation in Loosely-Coupled Markov Decision Processes , 2004, ICAPS.
[9] Keiji Kanazawa,et al. A model for reasoning about persistence and causation , 1989 .
[10] Shobha Venkataraman,et al. Efficient Solution Algorithms for Factored MDPs , 2003, J. Artif. Intell. Res..
[11] Craig Boutilier,et al. Exploiting Structure in Policy Construction , 1995, IJCAI.
[12] John S. Edwards,et al. Linear Programming and Finite Markovian Control Problems , 1983 .
[13] Benjamin Van Roy,et al. The Linear Programming Approach to Approximate Dynamic Programming , 2003, Oper. Res..
[14] Richard Bellman,et al. Adaptive Control Processes: A Guided Tour , 1961, The Mathematical Gazette.
[15] E. Altman. Constrained Markov Decision Processes , 1999 .
[16] Craig Boutilier,et al. Greedy linear value-approximation for factored Markov decision processes , 2002, AAAI/IAAI.
[17] Craig Boutilier,et al. Piecewise linear value function approximation for factored MDPs , 2002, AAAI/IAAI.
[18] Umberto Bertelè,et al. Nonserial Dynamic Programming , 1972 .