论文信息 - To swing up an inverted Pendulum using stochastic real-valued Reinforcement Learning

To swing up an inverted Pendulum using stochastic real-valued Reinforcement Learning

This paper deals with the problem of learning to swing up an inverted pendulum, which belongs to the class of highly nonlinear, non-minimum phase control problems without a general control methodology. It is thus a challenge for reinforcement learning over time (Sutton, 1988).

R. E. Eckmiller | A. Standfuss

[1] R. J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[2] Vijaykumar Gullapalli,et al. A stochastic reinforcement learning algorithm for learning real-valued functions , 1990, Neural Networks.