To swing up an inverted Pendulum using stochastic real-valued Reinforcement Learning

This paper deals with the problem of learning to swing up an inverted pendulum, which belongs to the class of highly nonlinear, non-minimum phase control problems without a general control methodology. It is thus a challenge for reinforcement learning over time (Sutton, 1988).