To swing up an inverted Pendulum using stochastic real-valued Reinforcement Learning
暂无分享,去创建一个
This paper deals with the problem of learning to swing up an inverted pendulum, which belongs to the class of highly nonlinear, non-minimum phase control problems without a general control methodology. It is thus a challenge for reinforcement learning over time (Sutton, 1988).
[1] R. J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[2] Vijaykumar Gullapalli,et al. A stochastic reinforcement learning algorithm for learning real-valued functions , 1990, Neural Networks.