Max-norm Projections for Factored MDPs
暂无分享,去创建一个
[1] Sandra L. Berger. Massachusetts , 1896, The Journal of comparative medicine and veterinary archives.
[2] E. Stiefel. Note on Jordan elimination, linear programming and Tchebycheff approximation , 1960 .
[3] R. Bellman,et al. Polynomial approximation—a new computational technique in dynamic programming: Allocation processes , 1963 .
[4] Herbert A. Simon,et al. The Sciences of the Artificial , 1970 .
[5] Umberto Bertelè,et al. Nonserial Dynamic Programming , 1972 .
[6] G. Alexits. Approximation theory , 1983 .
[7] Keiji Kanazawa,et al. A model for reasoning about persistence and causation , 1989 .
[8] Ronald J. Williams,et al. Tight Performance Bounds on Greedy Policies Based on Imperfect Value Functions , 1993 .
[9] Frank Jensen,et al. From Influence Diagrams to junction Trees , 1994, UAI.
[10] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[11] Peter Norvig,et al. Artificial Intelligence: A Modern Approach , 1995 .
[12] Craig Boutilier,et al. Abstraction and Approximate Decision-Theoretic Planning , 1997, Artif. Intell..
[13] Rina Dechter,et al. Bucket Elimination: A Unifying Framework for Reasoning , 1999, Artif. Intell..
[14] Daphne Koller,et al. Computing Factored Value Functions for Policies in Structured MDPs , 1999, IJCAI.
[15] Craig Boutilier,et al. Decision-Theoretic Planning: Structural Assumptions and Computational Leverage , 1999, J. Artif. Intell. Res..
[16] Daphne Koller,et al. Policy Iteration for Factored MDPs , 2000, UAI.