论文信息 - Shaping as a method for accelerating reinforcement learning

Shaping as a method for accelerating reinforcement learning

Learning complex control behavior by building some initial control knowledge into the learning controller through shaping is addressed. The principle underlying shaping is that learning to solve complex problems can be facilitated by first learning to solve related simpler problems. The authors present experimental results illustrating the utility of shaping in training controllers by means of reinforcement learning methods. Shaping a reinforcement learning controller's behavior over time by gradually increasing the complexity of the control task as the controller learns makes it possible to scale reinforcement learning methods to more complex tasks. This is illustrated by an example.<<ETX>>

Andrew G. Barto | Vijaykumar Gullapalli | A. Barto | V. Gullapalli

[1] B. Skinner,et al. Science and human behavior , 1953 .

[2] W. K. Honig,et al. Handbook of Operant Behavior , 2022 .

[3] J. Staddon. Adaptive behavior and learning , 1983 .

[4] Richard S. Sutton,et al. Training and Tracking in Robotics , 1985, IJCAI.

[5] Steven J. Nowlan,et al. Gain Variation in Recurrent Error Propagation Networks , 1988, Complex Syst..

[6] Russell Leighton,et al. Shaping schedules as a method for accelerated learning , 1988, Neural Networks.

[7] Robert B. Allen,et al. Adaptive training for connectionist state machines , 1989, CSC '89.

[8] Vijaykumar Gullapalli,et al. A stochastic reinforcement learning algorithm for learning real-valued functions , 1990, Neural Networks.

[9] Alexis P. Wieland,et al. Evolving Controls for Unstable Systems , 1991 .

[10] A. P. Wieland,et al. Evolving neural network controllers for unstable systems , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[11] Vijaykumar Gullapalli,et al. Reinforcement learning and its application to control , 1992 .