Reinforcement Learning in Markovian and Non-Markovian Environments
暂无分享,去创建一个
[1] Charles W. Anderson,et al. Learning and problem-solving with multilayer connectionist systems (adaptive, strategy learning, neural networks, reinforcement learning) , 1986 .
[2] Paul J. Werbos,et al. Building and Understanding Adaptive Systems: A Statistical/Numerical Approach to Factory Automation and Brain Research , 1987, IEEE Transactions on Systems, Man, and Cybernetics.
[3] Michael I. Jordan. Supervised learning and systems with excess degrees of freedom , 1988 .
[4] R. J. Williams,et al. On the use of backpropagation in associative reinforcement learning , 1988, IEEE 1988 International Conference on Neural Networks.
[5] Frank Fallside,et al. Dynamic reinforcement driven error propagation networks with application to game playing , 1989 .
[6] Ronald J. Williams,et al. Experimental Analysis of the Real-time Recurrent Learning Algorithm , 1989 .
[7] Michael I. Jordan,et al. Learning to Control an Unstable System with Forward Modeling , 1989, NIPS.
[8] Jürgen Schmidhuber,et al. Networks adjusting networks , 1990, Forschungsberichte, TU Munich.
[9] S. Piche,et al. First-Order Gradient Descent Training of Adaptive Discrete-Time Dynamic Networks , 1991 .