MASSACHUSETTS INSTITUTE OF TECHNOLOGY ARTIFICIAL INTELLIGENCE LABORATORY and CENTER FOR BIOLOGICAL AND COMPUTATIONAL LEARNING DEPARTMENT OF BRAIN AND COGNITIVE SCIENCES
暂无分享,去创建一个
Michael I. Jordan | Tommi S. Jaakkola | Matthew J. Beal | Satinder P. Singh | Satinder Singh | T. Jaakkola | Zoubin Ghahramani | C. Rasmussen
[1] F. Downton. Stochastic Approximation , 1969, Nature.
[2] M. T. Wasan. Stochastic Approximation , 1969 .
[3] Peter W. Glynn,et al. Optimization of stochastic systems , 1986, WSC '86.
[4] Dimitri P. Bertsekas,et al. Dynamic Programming: Deterministic and Stochastic Models , 1987 .
[5] Richard S. Sutton,et al. Sequential Decision Problems and Neural Networks , 1989, NIPS 1989.
[6] John N. Tsitsiklis,et al. Parallel and distributed computation , 1989 .
[7] John N. Tsitsiklis,et al. Parallel and distributed computation , 1989 .
[8] P. Dayan,et al. TD ( X ) Converges with Probability 1 , 1994 .
[9] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..
[10] Andrew G. Barto,et al. Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..