Shaping as a method for accelerating reinforcement learning
暂无分享,去创建一个
[1] B. Skinner,et al. Science and human behavior , 1953 .
[2] W. K. Honig,et al. Handbook of Operant Behavior , 2022 .
[3] J. Staddon. Adaptive behavior and learning , 1983 .
[4] Richard S. Sutton,et al. Training and Tracking in Robotics , 1985, IJCAI.
[5] Steven J. Nowlan,et al. Gain Variation in Recurrent Error Propagation Networks , 1988, Complex Syst..
[6] Russell Leighton,et al. Shaping schedules as a method for accelerated learning , 1988, Neural Networks.
[7] Robert B. Allen,et al. Adaptive training for connectionist state machines , 1989, CSC '89.
[8] Vijaykumar Gullapalli,et al. A stochastic reinforcement learning algorithm for learning real-valued functions , 1990, Neural Networks.
[9] Alexis P. Wieland,et al. Evolving Controls for Unstable Systems , 1991 .
[10] A. P. Wieland,et al. Evolving neural network controllers for unstable systems , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.
[11] Vijaykumar Gullapalli,et al. Reinforcement learning and its application to control , 1992 .