Solving Markov Decision Processes with Reachability Characterization from Mean First Passage Times
暂无分享,去创建一个
[1] Ronald A. Howard,et al. Dynamic Programming and Markov Processes , 1960 .
[2] J. Shanthikumar,et al. First-passage times with PF r densities , 1985, Journal of Applied Probability.
[3] Dimitri P. Bertsekas,et al. Dynamic Programming: Deterministic and Stochastic Models , 1987 .
[4] Richard S. Sutton,et al. Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming , 1990, ML.
[5] D. J. White,et al. A Survey of Applications of Markov Decision Processes , 1993 .
[6] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[7] Peter Norvig,et al. Artificial Intelligence: A Modern Approach , 1995 .
[8] Andrew G. Barto,et al. Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..
[9] Gene H. Golub,et al. Matrix computations (3rd ed.) , 1996 .
[10] David Andre,et al. Generalized Prioritized Sweeping , 1997, NIPS.
[11] Craig Boutilier,et al. Decision-Theoretic Planning: Structural Assumptions and Computational Leverage , 1999, J. Artif. Intell. Res..
[12] Craig Boutilier,et al. Stochastic dynamic programming with factored representations , 2000, Artif. Intell..
[13] David Andre,et al. State abstraction for programmable reinforcement learning agents , 2002, AAAI/IAAI.
[14] Blai Bonet,et al. Labeled RTDP: Improving the Convergence of Real-Time Dynamic Programming , 2003, ICAPS.
[15] Andrew W. Moore,et al. Prioritized sweeping: Reinforcement learning with less data and less time , 2004, Machine Learning.
[16] Kevin D. Seppi,et al. Prioritization Methods for Accelerating MDP Solvers , 2005, J. Mach. Learn. Res..
[17] Thomas J. Walsh,et al. Towards a Unified Theory of State Abstraction for MDPs , 2006, AI&M.
[18] Groupe Pdmia. Markov Decision Processes In Artificial Intelligence , 2009 .
[19] Bart De Schutter,et al. Reinforcement Learning and Dynamic Programming Using Function Approximators , 2010 .
[20] Marco Wiering,et al. Reinforcement Learning and Markov Decision Processes , 2012, Reinforcement Learning.