Labeled RTDP: Improving the Convergence of Real-Time Dynamic Programming
暂无分享,去创建一个
[1] Nils J. Nilsson,et al. Principles of Artificial Intelligence , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[2] Richard E. Korf,et al. Finding Optimal Solutions to the Twenty-Four Puzzle , 1996, AAAI/IAAI, Vol. 2.
[3] Dimitri P. Bertsekas,et al. Dynamic Programming and Optimal Control, Two Volume Set , 1995 .
[4] Richard E. Korf,et al. Finding Optimal Solutions to Rubik's Cube Using Pattern Databases , 1997, AAAI/IAAI.
[5] Blai Bonet,et al. GPT: A Tool for Planning with Uncertainty and Partial Information , 2001, IJCAI 2001.
[6] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[7] Blai Bonet,et al. Planning with Incomplete Information as Heuristic Search in Belief Space , 2000, AIPS.
[8] Dimitri P. Bertsekas,et al. Dynamic Programming and Optimal Control, Two Volume Set , 1995 .
[9] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.
[10] T. Dean,et al. Planning under uncertainty: structural assumptions and computational leverage , 1996 .
[11] Shlomo Zilberstein,et al. LAO*: A heuristic search algorithm that finds solutions with loops , 2001, Artif. Intell..
[12] Richard E. Korf,et al. Real-Time Heuristic Search , 1990, Artif. Intell..
[13] Andrew G. Barto,et al. Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..
[14] Allen Newell,et al. Human Problem Solving. , 1973 .
[15] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[16] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[17] Sean R Eddy,et al. What is dynamic programming? , 2004, Nature Biotechnology.
[18] Blai Bonet,et al. Planning and Control in Artificial Intelligence: A Unifying Perspective , 2001, Applied Intelligence.
[19] Peter Norvig,et al. Artificial Intelligence: A Modern Approach , 1995 .