Fast Exact Planning in Markov Decision Processes
暂无分享,去创建一个
[1] I. Duff,et al. Direct Methods for Sparse Matrices , 1987 .
[2] William H. Press,et al. Numerical Recipes in C, 2nd Edition , 1992 .
[3] William H. Press,et al. The Art of Scientific Computing Second Edition , 1998 .
[4] C. Atkeson,et al. Prioritized Sweeping: Reinforcement Learning with Less Data and Less Time , 1993, Machine Learning.
[5] Richard Barrett,et al. Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods , 1994, Other Titles in Applied Mathematics.
[6] Thomas G. Dietterich,et al. Explanation-Based Learning and Reinforcement Learning: A Unified View , 1995, Machine-mediated learning.
[7] Thomas Dean,et al. Decomposition Techniques for Planning in Stochastic Domains , 1995, IJCAI.
[8] Dimitri P. Bertsekas,et al. Dynamic Programming and Optimal Control, Two Volume Set , 1995 .
[9] Andrew G. Barto,et al. Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..
[10] R. K. Shyamasundar,et al. Introduction to algorithms , 1996 .
[11] David Andre,et al. Generalized Prioritized Sweeping , 1997, NIPS.
[12] Marco Wiering,et al. Explorations in efficient reinforcement learning , 1999 .
[13] Shlomo Zilberstein,et al. LAO*: A heuristic search algorithm that finds solutions with loops , 2001, Artif. Intell..
[14] Anshul Gupta,et al. Recent advances in direct methods for solving unsymmetric sparse systems of linear equations , 2002, TOMS.
[15] William H. Press,et al. Numerical recipes in C , 2002 .
[16] Blai Bonet,et al. Faster Heuristic Search Algorithms for Planning with Uncertainty and Full Feedback , 2003, IJCAI.
[17] Blai Bonet,et al. Labeled RTDP: Improving the Convergence of Real-Time Dynamic Programming , 2003, ICAPS.
[18] Andrew W. Moore,et al. Prioritized Sweeping: Reinforcement Learning with Less Data and Less Time , 1993, Machine Learning.
[19] Geoffrey J. Gordon,et al. Generalizing Dijkstra's Algorithm and Gaussian Elimination for Solving MDPs , 2005 .