Temporal-difference methods and Markov models
暂无分享,去创建一个
[1] P. Billingsley,et al. Statistical Methods in Markov Chains , 1961 .
[2] A G Barto,et al. Toward a modern theory of adaptive networks: expectation and prediction. , 1981, Psychological review.
[3] Fernando J. Pineda,et al. Dynamics and architecture for neural computation , 1988, J. Complex..
[4] D. Ballard,et al. A Role for Anticipation in Reactive Systems that Learn , 1989, ML.
[5] Richard S. Sutton,et al. Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming , 1990, ML.
[6] Gerald Tesauro,et al. Practical Issues in Temporal Difference Learning , 1992, Mach. Learn..